Hacker News new | ask | show | jobs
by mcintyre1994 1687 days ago
Neat, I didn't know AWS had an AnyCast service like that. The Cloudflare workers issue is a bit disappointing, I remember their blog about eliminating cold starts and how clever it seemed, but I guess it's still not there? On another note have you evaluated fly.io at all for this? This looks right in their wheelhouse of many regions + AnyCast + IPv6, and I could definitely see configuration being simpler with them than AWS. Not sure if they'd meet all your requirements though/how they'd compare on price etc.
2 comments

The cloudflare workers issue should be resolved now, they eliminated cold starts in 2020 (https://blog.cloudflare.com/eliminating-cold-starts-with-clo...) and it seems like he tested it in 2019
Ah thanks, that was the blog I was thinking about but I didn't realise the timing there. That makes sense, I wonder how that would be if they re-tested it now.
The latency that this individual was experiencing with Workers wayyyyy back in 2019 was not caused by cold starts, but rather a bug in the workers runtime which caused rare but long pauses. Specifically, the bug involved a hash map with a broken hash function. Here's me publicly facepalming over the issue at the time: https://twitter.com/KentonVarda/status/1189966124688953344

But that was two years ago, Workers has grown ~an order of magnitude since then (both traffic and team size), and such a bug would be immediately noticed today.

The article author says that after the fix he still wasn't happy with performance, but I don't know what he was experiencing exactly. We've done a lot of other things to improve performance since then; it may very well be better today.

I used cloudflare-workers still this year and only switched a couple of months ago: https://securitytrails.com/domain/url.rw/history/a

The performance was still very bad from a 95 percentile perspective.

This isn't the feedback we get from most customers, so I guess there was something unusual going on. Sorry we weren't able to dig into this while it was still relevant to you.
> https://twitter.com/KentonVarda/status/1189966124688953344

What exactly was the bug though? bool effectively reducing hash(key) to 0 or 1?

Yes. So almost all keys had a hash code of 1, forcing linear search lookups. Insidious since the code still works, it's just slow -- and only really slow in production, after seeming fine in tests that only put a few items in the map.