Google AI
The Times Australia

Times Media Advertising

A Meta outage hit Facebook, Instagram, WhatsApp and more. Here’s what we know so far

  • Written by: Toby Murray, Associate Professor of Cybersecurity, School of Computing and Information Systems, The University of Melbourne

A major outage is affecting users of popular social media and messaging services including Facebook, Instagram and WhatsApp around the globe. All these platforms are run by the social media giant Meta.

As news of the outage spread, we learned that it affected almost all of Meta’s products, including Messenger and Threads, as well as Meta’s business products, such as Facebook Ads Manager and the Messenger API for Instagram.

Most services are beginning to come back online. But what went wrong, and what can we learn from this massive outage?

The scope of the outage

Outages have been reported from the United Kingdom[1] to Canada[2] to the United States[3] and beyond.

The outage was first reported in the US on Wednesday (around 12.30pm in New York, 5.30pm in London, or 4.30am Thursday in Sydney).

Five hours later, Meta posted to X[4] to say it was 99% of the way to resolving the outage.

What might have caused it?

At the moment, there has been no official word on the cause of the outage. However, we can make some educated guesses based on its scope.

From reporting so far, the outage covered not only Meta’s major social media platforms and messaging services, but also some of its business products. It also affected Meta’s Login with Facebook service, which allows users to log in to third-party sites using their Facebook username and password.

Screenshot of a web page with a list of business tools showing statuses such as 'Major disruption' or 'Resolved'.
Meta’s business product status page showed outages in several services. Meta[5]

In other words, there seem to be very few Meta products this outage did not impact.

That suggests that whatever went wrong was a single point of failure[6]: something relied upon by all of Meta’s services, without which the services can’t function.

Design for reliability

These kinds of outages are rare. That’s because major internet platforms are designed to be highly reliable.

The main way reliability is achieved is through replication. When you visit Instagram, for example, your computer connects to a server that sends back your Instagram feed. In fact, Instagram content is not stored on just one computer but is replicated across a massive array of computers known as a content delivery network (or CDN).

Practically all major web platforms, including news sites such as The Conversation, large companies, and online services such as YouTube and Google, use content delivery networks to increase the reliability and efficiency of their websites.

The idea behind a content delivery network is that if one computer in the network has a problem, another can take over in its place. This is what makes the networks reliable.

Content delivery networks also help when websites are under heavy demand. If many people are trying to request the same content, those requests can be spread out between many computers in the network, allowing each to be handled efficiently.

The widespread nature of Meta’s outage suggests it might have happened in a part of Meta’s systems that wasn’t replicated. However, we’ll have to wait for word from Meta on the causes before we will know for sure.

Lessons to be learned

Meta’s outage comes in the wake of the major outage caused earlier this year by CrowdStrike’s Falcon[7] security software. Falcon’s design meant it was deeply entangled with Microsoft Windows. That made Falcon a single point of failure so that, when it crashed, it brought down Windows as well – in spectacular fashion.

A key lesson from this outage was that invasive security software such as Falcon should be re-engineered[8] to operate at arm’s length of Windows. This idea is known as fault isolation, which says that systems should be built as a collection of separate components so that if one component fails it cannot cause the entire system to fail.

This is the reason why modern ships are designed to have multiple internal compartments, with mechanisms to try to make each compartment watertight. That way, if the ship’s hull is breached, water cannot flood the entire ship.

Meta’s outage is a timely reminder of the need to engineer critical systems to maximise their reliability, including minimising central points of failure and employing engineering principles like fault isolation.

Looking ahead

In the meantime, the precise cause of Meta’s outage remains to be determined.

Many people all over the world rely on Meta’s services. These include businesses using Instagram as their primary platform for engaging customers online, or merchants using Facebook Marketplace as a key revenue stream. For many families, WhatsApp has become an indispensable way to keep in contact, especially during times of crisis.

We can only hope Meta will be forthcoming about the causes of this outage and the measures it will put in place to make sure it cannot happen again.

References

  1. ^ the United Kingdom (www.standard.co.uk)
  2. ^ Canada (vancouver.citynews.ca)
  3. ^ United States (www.indystar.com)
  4. ^ posted to X (x.com)
  5. ^ Meta (www.metastatus.com)
  6. ^ single point of failure (en.wikipedia.org)
  7. ^ CrowdStrike’s Falcon (theconversation.com)
  8. ^ re-engineered (theconversation.com)

Read more https://theconversation.com/a-meta-outage-hit-facebook-instagram-whatsapp-and-more-heres-what-we-know-so-far-245834

Times Magazine

Quickest Way of Getting Rid of Your Old Cars in Brisbane?

If you are done searching for a practical solution for quickly getting rid of your old car, this w...

The Human Supplement Craze Has Officially Gone to the Dogs (Literally)

Australians’ appetite for supplements is no longer limited to their own vitamin cabinets. New reta...

AI Guilt: It’s Real — But it is irrational

Artificial intelligence is rapidly becoming one of the most powerful tools ever made available to ...

Australians Are Keeping Their Cars Longer — And It’s Changing The Market

Australia’s car market is undergoing a subtle but important transformation. People are keeping th...

Streaming Fatigue: Australians Overwhelmed By Subscriptions

Streaming was once supposed to simplify entertainment. Instead, many Australians now feel overwhe...

Why Shopping Centres No Longer Feel Exciting

There was a time when going to the shopping centre felt like an event. Families spent entire Satu...

The Times Features

The Blood Test That Could Change Colon Cancer Screening…

A simple blood test that may one day reduce the need for colonoscopies is generating enormous inte...

Recovering at Home After Surgery: The Role of Mobile Re…

Recovering from surgery can be both physically and emotionally challenging. Whether it is a joint ...

Children and Screens: The Growing Health Challenge Faci…

Once upon a time, parents worried that children spent too much time reading books indoors instead ...

FIRE PIT CINEMA. A New Winter Ritual Comes to Canberra

A Winter Night of Mulled Wine, Firelight & Christmas Movies Canberra, Wednesday 27th May - Fo...

Why Professional House Painting in Melbourne Adds Long-…

There is a particular kind of frustration about which Melbourne homeowners rarely talk about openl...

Residential HVAC Systems in Australia: What Homeowners …

Australia’s residential HVAC market is evolving rapidly as households face hotter summers, rising ...

The Biden Administration: Did The Inquiry Establish Who…

Questions surrounding former US President Joe Biden and his health while in office continue to dom...

Nationals move Bill to protect women. Sall Grover inter…

Matt Canavan  All good. Look, well, it's great to be here with my friend and colleague, Alison Pe...

The Human Supplement Craze Has Officially Gone to the D…

Australians’ appetite for supplements is no longer limited to their own vitamin cabinets. New reta...