Sep 20, 2009

Gmail Down? Outage Spreads Confusion


-E-mail Outage Affects 'Majority' of Users; Google Explains, Apologizes
Facebook
Twitter
Reddit
StumbleUpon
More

It only lasted for an hour and a half, but yesterday's Gmail outage was an online hiccup felt 'round the world.
he popular e-mail service offered by Google, went down for a "majority" of users yesterday. By late afternoon, eastern time, the company said, "We've fixed the issue, and Gmail should be back up and running as usual."

Google said it made a mistake during some routine system maintenance, and apologized profusely: "Today's outage was a Big Deal, and we're treating it as such," said Ben Treynor, a Google vice president who is Site Reliability Czar (yes, Googlers have titles like that).

What happened? "We took a small fraction of Gmail's servers offline to perform routine upgrades," wrote Treynor in an explanation on Google's Gmail blog, which the company e-mailed to ABCNews.com. "This isn't in itself a problem -- we do this all the time, and Gmail's web interface runs in many locations and just sends traffic to other locations when one is offline.
"However, as we now know, we had slightly underestimated the load which some recent changes (ironically, some designed to improve service availability) placed on the request routers -- servers which direct web queries to the appropriate Gmail server for response. At about 12:30 pm Pacific a few of the request routers became overloaded and in effect told the rest of the system 'stop sending us traffic, we're too slow!'."

From there it was a cascade effect, and pretty soon most of the system was down.


0 comments: