The Credit Union Wire
Global Signal · Outside the Bubble
COINBASE US · earnings

Coinbase AWS Outage Exposes Cloud Concentration Risk

The Coinbase AWS outage 2025 grounded a major crypto exchange after a data-center cooling failure, raising hard questions about cloud vendor concentration risk.

By The Credit Union Wire ·

The Coinbase AWS outage that struck in early May put one of crypto's most-watched platforms offline for hours after multiple chillers failed inside an Amazon Web Services facility, causing a server room to overheat and taking down core trading infrastructure. Coinbase Global, Inc. (NASDAQ: COIN) CEO Brian Armstrong confirmed the incident publicly on X, writing that the disruption was "never acceptable." The failure arrived at a particularly exposed moment: Coinbase had just reported weaker-than-expected quarterly results and disclosed plans to reduce its workforce by roughly 14% as the company pivots toward what Armstrong described as a leaner, AI-native operating model.

AWS Cooling Failure Behind Coinbase Exchange Crash

The mechanics of the failure were unusually specific. According to Armstrong and Coinbase Head of Platform Rob Witoff, the incident began when multiple chillers inside an affected AWS Availability Zone failed, causing thermal stress across the physical infrastructure that Coinbase's exchange systems depend on. While many Coinbase services are architected to tolerate single-zone failures, the core trading platform was not. Witoff explained that the primary exchange systems run inside a single availability zone to minimize latency, a deliberate design choice that optimizes execution speed by keeping matching-engine components and client connections physically close together. Backup and failover systems intended to isolate such failures did not behave as expected during the incident, which forced engineers to execute disaster-recovery procedures manually, extending the duration of the outage beyond what an automated response would have allowed. Armstrong acknowledged the architectural tradeoff directly, noting that "exchanges have unique architectures that optimize for latency and co-location of clients," and said Coinbase would revisit those tradeoffs with a focus on reducing the time required to migrate across availability zones. Customer funds, the company stressed, were not at risk at any point during the disruption. The full account was reported by Benzinga.

Cloud Concentration Risk in Crypto Platform Infrastructure

The incident is a pointed illustration of a structural tension running through much of fintech infrastructure: the same architectural decisions that deliver performance advantages can compress the margin for error during physical failures at the cloud layer. AWS is the dominant cloud provider for financial services applications, including a substantial share of the fintech and crypto platforms that credit unions are increasingly partnering with to offer digital asset services or modernize core adjacent systems. When a single hyperscaler's availability zone experiences a physical failure, the blast radius is determined not by one company's risk posture but by every tenant's architectural choices within that zone. Coinbase's decision to co-locate matching-engine components in a single zone for latency reasons is commercially rational, but it created a single point of failure that a chiller malfunction could expose. For any institution whose fintech partner, core processor, or digital-asset custody vendor runs on AWS infrastructure with similar single-zone designs, the lesson is direct: uptime guarantees in a service-level agreement are only as strong as the vendor's ability to execute an availability-zone migration under real failure conditions. Smaller credit unions evaluating crypto or payments technology partners should examine not just advertised redundancy but tested failover, a distinction the Coinbase incident made visible. Reviewing how community-based institutions approach operational resilience, including technology partnerships, is a theme we return to across our credit union spotlight coverage of community institutions.

What it means for credit unions considering fintech partnerships

For credit unions, the Coinbase AWS outage is not merely a crypto-industry headline. It is a vendor risk scenario playing out in public, with the kind of technical specificity that regulators increasingly expect credit union boards and risk committees to understand. The NCUA's third-party vendor risk guidance and FFIEC cloud service provider oversight frameworks both call for institutions to assess not just whether a vendor has a business continuity plan but whether that plan has been tested, whether recovery time objectives are contractually enforceable, and whether the institution itself has a contingency if the vendor goes dark. A credit union in a community bond, with perhaps a few hundred million dollars in assets and a newly launched crypto wallet service or a cloud-hosted digital lending platform, faces a materially different conversation with its examiners if that platform goes offline during a high-volatility period and member transactions are blocked. The Coinbase situation, where automated failover did not work as designed and manual recovery extended the outage, is precisely the gap regulators want institutions to probe. Boards should be asking vendors for documented and recently tested recovery time data, not just contractual promises. For context on how smaller institutions are building their digital service models, see our profile of Acadiana Medical Credit Union.

What we're watching

Sources cited