0%

Incident Post Mortem: October 27, 2021

30 paź 2021 Czas czytania: 4 min
Obraz baneru artykułu informacyjnego

We’re continuing to learn more about these events, and will continue to update this post with additional details that may be of interest.

The Incident

On the morning of October 27th PT, we experienced a significant increase in traffic. As traffic increased, our engineers were alerted about elevated error rates appearing across a number of services.

The following functionality was affected:

  • Logged-out experience: users that were not logged in experienced errors when visiting coinbase.com or our mobile apps.

  • Coinbase Pro: users were temporarily unable to log in to Coinbase Pro.

  • Transfers: There was a higher rate of cancelled and refunded transfers during this time, as well as delays in processing on-chain money movements. Users may have been unable to see their latest transfer history.

Root Cause Analysis

These issues were caused by two separate but related outages. Both were triggered by system bottlenecks caused by the elevated traffic.

Traffic to Coinbase — 10/27/2021

In the first outage, we observed traffic patterns that were several times greater than previous peaks. This increase in traffic began to overload a datastore responsible for our rewards functionality. As latency increased on this database, related services became saturated and started to deplete resources as well. This resulted in a chain of failures and a more widespread outage.

Query capacity to key database cluster

The second outage was also triggered by a spike in traffic levels. In the early afternoon, engineers were alerted that our payment processing was being similarly overloaded. Unfortunately, an automated maintenance event that was already underway slowed our ability to scale this cluster up to meet with demand, and a set of failures similar to those that occurred during the first outage followed.

Elevated query latency for Payments cluster

In this instance, the servers that power our logged-out experience were also affected. As these servers became overwhelmed, they were unable to serve new traffic and were ultimately marked by our load balancer as unhealthy and removed from its pool, causing coinbase.com to become unavailable to users who were logged out or who were attempting to log in. Other impacted functionality included the ability to buy, sell, and trade in both Coinbase’s retail application as well as Coinbase Pro.

At 2:32pm PT, our services returned to normal operation.

Resolution & Improvements

For the first outage, once the caching changes were deployed, the rewards database was scaled up, and additional replicas became available. Afterwards, our system was able to resume normal operation.

To resolve the second outage, we upgraded the under-capacity payments cluster to a larger instance size and introduced additional read-only replicas.

To prevent similar issues in the future, we are taking several additional actions:

  1. Reorganizing our largest services: we will continue to shard and isolate our largest services to avoid hitting limits like those mentioned previously.

  2. Enhanced load testing: we’re enhancing our load testing framework to be more representative of new traffic patterns that we saw during this event.

  3. Additional scaling: we are further scaling several of our databases that we observed operating close to limits at Wednesday’s elevated traffic levels.

We take the uptime and performance of our infrastructure very seriously, and we’re working hard to support the millions of customers that choose Coinbase to manage their cryptocurrency. If you’re interested in solving scaling challenges like those presented here, come work with us.

was originally published in The Coinbase Blog on Medium, where people are continuing the conversation by highlighting and responding to this story.

Popularne wiadomości

How to Set Up and Use Trust Wallet for Binance Smart Chain
#Bitcoin#Bitcoins#Config+2 więcej tagów

How to Set Up and Use Trust Wallet for Binance Smart Chain

Your Essential Guide To Binance Leveraged Tokens

Your Essential Guide To Binance Leveraged Tokens

How to Sell Your Bitcoin Into Cash on Binance (2021 Update)
#Subscriptions

How to Sell Your Bitcoin Into Cash on Binance (2021 Update)

What is Grid Trading? (A Crypto-Futures Guide)

What is Grid Trading? (A Crypto-Futures Guide)

Rozpocznij darmowy handel z Cryptohopperem!

Bezpłatne użytkowanie - karta kredytowa nie jest wymagana

Zaczynajmy
Cryptohopper appCryptohopper app

Informacja: Cryptohopper nie jest regulowanym podmiotem. Handel kryptowalutami za pomocą botów wiąże się z dużym ryzykiem, a wcześniejsze wyniki nie gwarantują przyszłych rezultatów. Prezentowane zyski na zrzutach ekranu produktu mają charakter ilustracyjny i mogą być zawyżone. Podejmuj handel botami tylko wtedy, gdy posiadasz odpowiednią wiedzę lub skonsultuj się z wykwalifikowanym doradcą finansowym. Cryptohopper nie ponosi odpowiedzialności za (a) jakiekolwiek straty lub szkody, całkowite lub częściowe, wynikające z transakcji z wykorzystaniem naszego oprogramowania lub (b) jakiekolwiek szkody bezpośrednie, pośrednie, specjalne, wynikowe lub przypadkowe. Pamiętaj, że treści dostępne na platformie handlu społecznościowego Cryptohopper są tworzone przez członków społeczności Cryptohopper i nie stanowią porad lub zaleceń ze strony Cryptohopper. Zyski prezentowane na Rynku nie są gwarancją przyszłych wyników. Korzystając z usług Cryptohopper, akceptujesz ryzyko związane z handlem kryptowalutami i zobowiązujesz się do niepociągania Cryptohopper do odpowiedzialności za ewentualne straty. Przed korzystaniem z naszego oprogramowania lub podjęciem jakiejkolwiek działalności handlowej, konieczne jest zapoznanie się z naszymi Warunkami świadczenia usług i oświadczenie dot. ujawniania ryzyka. Skonsultuj się z prawnikami i doradcami finansowymi, aby uzyskać porady dostosowane do Twojej sytuacji.

©2017 - 2024 Copyright by Cryptohopper™ - Wszelkie prawa zastrzeżone.