Hacker News new | ask | show | jobs
by csabakissi 1829 days ago
I can't actually imagine something like that can happen. Single person with a simple change in a config can cause this.
4 comments

That was the bug.

A trivial example would be a bug that replaces the configuration for all customers with the last uploaded. Then when the next customer uploads a new (valid!) config, you have a problem.

Obviously it wasn’t that trivial but the point is: it wasn’t the customer’s configuration change that was the problem but some code that managed the config change.

It's more common than we imagine. That's usually the start of many of the historical network incidents. The important part, as usual, is to make sure the remediations of such incidents focus on how to limit blast radius of small changes, and how to accomplish that without imposing artificial gatekeeping and bureaucracy into the change process.
Welcome to the CentralizedWeb (tm). Unfortunately we had to sunset the Internet you used to use.
A web filled with DDOS attacks and scraping is a web that needs cloudflare and fastly. I’m not sure how to avoid this sorry state of things.
Invisible Internet Project (I2P) is decentralized and defends from such attacks quite well.
How is that project doing? It's been around for years and does not come up often.
Not sure why it's not adopted more widely. It's in constant development and is impressive.
I hadn’t heard of I2P -- sounds like it’s similar to Freenet? (originally forked from it, Wikipedia says)

Can it be used as a CDN for a normal website? How well does it perform?

This attitude is why we have only 2½ search engines on the entire Internet. Only Google, Bing, and Yandex run crawlers. Everybody else is just a reseller for them.

Web crawlers are a feature not a bug. If your site shouldn't be crawled, it doesn't belong on the Internet.

Search engines scrapping your content is not the problem. Competitors scraping your content is.
If you don't want your content crawled, don't put it on the public Internet.

Your profitability is not the Internet's problem.

It’s a very interesting proposition.

If you cannot generate revenue by your internet content, probably you can’t live from generating content for the internet.

The consequence, IMHO, is that the internet would have this amount of content and usefulness.

Newspapers? No. Can’t live from internet news if anyone can copy a reporter’s work and post it on his own site and dilute traffic.

Online selling? Don’t look like a viable business model, as anyone can copy the photos you paid a photographer for, the descriptions you paid someone to write and the reviews your customers wrote. True reviews are priceless, you now? Even more now that an AI can detect computer generated reviews.

Obviously an open and totally money-free internet is nice, but it wouldn’t be the internet people make a living from.

They forgot to test it.
Test “it”? The change in question wasn’t by fastly but a customer of theirs making a config change. It’s possible that this customer did validate their change somehow.

Fastly obviously didn’t test their code (with the bug) enough, but testing of course can never prove the absence of bugs. Testing for a global deployment like a massive CDN happens to a large extent in prod because you don’t have another globe. You can test on a smaller scale but eventually you run into a problem that only shows itself at full scale.

Fastly even say it was a valid change.

> We experienced a global outage due to an undiscovered software bug that surfaced on June 8 when it was triggered by a valid customer configuration change.

in the first sentence

Their change was bad, that was May 12. Since that seemed OK on May 13,14,… there wasn’t much indicating that change would blow up weeks later. For example if they roll it out gradually, they would reach 100% rollout with all lights being green

The customer change was a valid configuration. That was yesterday.

Testing is never complete nor can it be theoretically complete