WebKit Quirks | HN Mirror

    shouldBypassBackForwardCache()
    // Google Docs used to bypass the back/forward cache by serving "Cache-Control: no-store" over HTTPS.
    // We started caching such content in r250437 but the Google Docs index page unfortunately is not currently compatible
    // because it puts an overlay (with class "docs-homescreen-freeze-el-full") over the page when navigating away and fails
    // to remove it when coming back from the back/forward cache

Millions of pages have this bug, because of Safari's broken navigation. Nice that the big players get the browser to fix it for them. For instance, a common issue is you click a button that becomes disabled and shows a spinner while working, before forwarding to a new page. If you click back from the new page, Safari will render the previous page exactly as it was when leaving, so in a broken loading state (instead of starting it from scratch).

marcellus23 1950 days ago

What makes it broken, out of curiosity? Is there a spec anywhere that suggests that behavior is incorrect? Or is it just because it's not what Chrome does?

goranmoomin 1950 days ago

Yeah, I'm interested too. I might be wrong, but AFAIK there aren't any specs on how a browser should implement forward/back buttons, right?

I'm personally getting a ton of mileage on the Safari's much more stable forward/back cache, the fact that you can go back reliably gives me more comfort than other browsers where going back usually refreshes the page (although I can't really explain how this is much better). I personally feel that this bug is more of a web app bug rather than the browser.

capableweb 1950 days ago

The closest you get for how browsers should act regarding history is part of the HTML spec here: https://html.spec.whatwg.org/multipage/history.html

Of course, the exact implementation is not specified, browsers are free to either implement cached behavior, which I think Firefox does as well, or just a naive refresh.

https://html.spec.whatwg.org/multipage/browsing-the-web.html...

tinus_hn 1949 days ago

I think the closest is the pagehide event

Which appears to be supported in Safari, just like all the other browsers.

gsnedders 1949 days ago

There's a metabug on HTML at https://github.com/whatwg/html/issues/5880 about defining how various platform-exposed features behave in the face of a bfcache (backwards/forwards cache).

m_eiman 1950 days ago

Why shouldn't it show it in the same state? Seems like a reasonable thing to do.

If it was a static page, then sure. But for dynamic pages or SPAs it more often than not leads to going back to a page in a broken state. Other browsers have better heuristics for when this cache is used. So Safari's behavior is unexpected, even so that the big guys are taken by surprise it seems. I don't really mind either way, the main thing is that it's inconsistent.

It's not a huge deal, but it's just one of many small things making Safari annoying when developing. Especially since it cannot be tested without owning an Apple device.

arghwhat 1950 days ago

GNOME Web is a webkit browser to name an example of non-Apple Web browsers.

From my perspective this is an application bug, and relying on heuristics is a bad idea. If a change should be made, it should be to make it explicitly the web apps task to handle on its own.

capableweb 1950 days ago

> But for dynamic pages or SPAs it more often than not leads to going back to a page in a broken state.

Definitely a page concern, not the browsers. This problem is also easy to fix and a solution has been known for many years by now: handle navigation via the URL always! (or, modern take: via the history API). A modal opens? The URL should change and because the URL changed to that specific path, show the modal. Now users can bookmark or go back/forward without any issues of pages being broken.

m_eiman 1950 days ago

Conceptually I'd say that clicking a link and then clicking Back should be the same as right-clicking the link and opening it in a new window, and then closing that window.

But it wouldn't surprise me if "web apps" makes this hard for some reason.

jay_kyburz 1949 days ago

In my mind the forward back buttons apply to the URL. So you are going back and forward in URL history, as if you were typing the URL in fresh each time. In the old days the back forward buttons were right there next to the URL on the toolbar.

It just occurred to me that on mobile, the back button is not associated with the URL at all, so its not surprising that people don't associate it with that anymore.

I think safari desktop hides the URL as well.

tannhaeuser 1950 days ago

No, it's just one of too many things, big and small, making browser web apps annoying. The entire point of a browser was a relatively simple viewer app that renders docs ok on most devices; not an opionated renderLikeChrome mode. If the basic concept of leaving a page for linked content, then coming back can't be handled without heuristics, then clearly the web app model is broken af.

timw4mail 1950 days ago

Sounds more like an issue with the web apps to me.

unilynx 1950 days ago

There is no spec to conform to to work around these cache issues. (IE was even worse in the past, shutting down the back forward cache if devtools were opened. Have fun debugging that)

But imagine Windows opening an app, drawing the last known interface state and then skipping half of the app startup code. Should apps deal with that too, or would it be considered a Windows bug?

— https://tools.ietf.org/html/rfc2616#section-13.13

JimDabell 1950 days ago

The spec. doesn’t cover this case explicitly, but the general gist of RFC 2616 is very much on the side of “don’t reload things”:

> History mechanisms and caches are different. In particular history mechanisms SHOULD NOT try to show a semantically transparent view of the current state of a resource. Rather, a history mechanism is meant to show exactly what the user saw at the time when the resource was retrieved.

If a web application depends on the browser reloading the page when the user presses the back button, then I think it’s fair to call that a bug in the web application. It is “trying to show a semantically transparent view of the current state of the resource”, which is explicitly called out as incorrect behaviour by the spec.

my123 1950 days ago

Modern applications for Windows (UWP) and iOS do use tombstones. The app's memory itself is completely suspended/saved to disk, and then the state is restored. The app startup code is _not_ called again.

[1] https://github.com/OtterBrowser/otter-browser

ksec 1950 days ago

>Especially since it cannot be tested without owning an Apple device.

Yes very annoying. They dont have to bring Safari on Windows, but at least WebKit on Windows would be nice for testing. In the mean time, Otter for Cross Platform Browser [1], or you could do Gnome Web with Windows WSL2.

my123 1950 days ago

You can test GNOME Web (Epiphany) just fine, which does use the same engine.

GranPC 1950 days ago

I'm actually having a similar issue with my web app currently and I'm not sure what the best way to solve it would be. I was thinking of setting a checkbox in an invisible form when the page loads initially, and force a real reload if the checkbox was previously set, but that seems like a terrible hack. Any ideas?

We use this to force a refresh: https://stackoverflow.com/a/13123626/923847

Or you can use that event to fix what's wrong on the page without a refresh if possible (remove a modal, enable the button again etc)

GranPC 1950 days ago

Great! Looks like a proper way to do what I was trying to do. Thank you!

My web app is a game, so fixing everything without a refresh is unfortunately pointless for the most part, and forcing a refresh also ensures players are running the latest version if they've been away for a while.

zachrip 1950 days ago

I've actually been dealing with this issue. Does anyone have an easy way to resolve it? It causes some pretty nasty rendering issues in our app.

https://stackoverflow.com/a/13123626/923847

Can force a refresh when the page us navigated back to. Or use that event to fix whatever state your in.

zachrip 1950 days ago

Full refresh defeats the purpose of a spa unfortunately. I just tested that event and it actually doesn't appear to fire when navigating backwards, only when the page is initially shown...which contradicts the comments in that code.

Maybe you misunderstand the original issue. This is when navigating back to a "new" page, not internally in a SPA. But this issue mainly happens when navigating back to a page with dynamic behavior (typically a SPA or other interactive application).

https://github.com/WebKit/WebKit/blob/f43587ec2416b86eecef50...

jakub_g 1950 days ago

There was a viral tongue-in-cheek tweet this month "how to prevent scrolling in Safari when..."?

1. Buy Zillow. Destroy the company.

2. Redirect your website to zillow.com

(it was more fun that what I wrote)

TonyTrapp 1950 days ago

I really dislike this sort of quirk workarounds, not just in web browsers. It just makes everything complicated. Someone else using the exact same code will get different behaviour, just because they serve it from a different domain. I did similar workarounds before (not related to websites but to specific files), and I felt very bad about it even though it was clear that no more files in that specific format would ever be generated again.

duckerude 1950 days ago

> domain.endsWith("hulu.com")

Huh, does that mean it would also apply on "thisisnothulu.com"?

Most other endsWith calls seem to do e.g. `domain.endsWith(".hulu.com")` to only match subdomains.

https://github.com/WebKit/webkit/blob/master/Source/WebCore/...

Sephr 1950 days ago

This is to enable Safari's legacy EME implementation. I wonder if there are any vulnerabilities waiting in those unmaintained legacy codepaths

I first noticed this bug a year ago last February and it's been unchanged ever since.

richdougherty 1949 days ago

Definitely a vulnerability there exploitable in concert with the error in the domain name check.

https://github.com/WebKit/WebKit/blob/f43587ec2416b86eecef50... https://github.com/WebKit/WebKit/blob/88278b55563e5ccdc0b341...

dillondoyle 1949 days ago

good catch.

just on a quick glance a few lines above might possibly be a good vector to test for ads to get around autoplay sound restrictions. make a domain ending with somethingnetflix.com, iframe it, and maybe figure out if the second link below has a class that allows override to allow autoplay sound without user interaction to something like kWKWebsiteAutoplayPolicyAllow with sound on.

https://bugs.webkit.org/show_bug.cgi?id=222130

richdougherty 1948 days ago

I've created a WebKit bug report for this so they can fix it.

Sayrus 1950 days ago

Damn, it seems you're right and it applies to any domain ending with this instead of hulu.com subdomains.

From the name of the quirk, I'm not sure this is an issue though.

0x0 1950 days ago

I bet it's fun being responsible for developing and deploying on those sites. Works in CI and dev, but deploying to production makes the browser behave differently! Nice surprise!

londons_explore 1950 days ago

A lot of the domain filters use things like:

topPrivatelyControlledDomain(url.host().toString()).startsWith("google.")

The definition of `topPrivatelyControlledDomain` means that `google.github.io` would qualify, or `google.works.aero`... Pretty much anybody can abuse that to get any of the quirks modes available in this file.

See the full list here:

https://publicsuffix.org/list/public_suffix_list.dat

https://github.com/WebKit/WebKit/blob/3def0062f77b82a46fc40c...

firloop 1950 days ago

Wow, there's not only _domain_ specificity, but also HTML _element_ specificity in this quirks list.

    // When panning on an Amazon product image, we're either touching on the #magnifierLens element
    // or its previous sibling.
    auto& element = downcast<Element>(*target);
    if (element.getIdAttribute() == "magnifierLens")
        return true;
    if (auto* sibling = element.nextElementSibling())
        return sibling->getIdAttribute() == "magnifierLens";

andrekandre 1949 days ago

am i crazy or does this seem not scalable?

at what point does it make more sense to just have a wasm "html-lib" provided by a specific site that it can depend on instead of burdening webkit/blink with all these unsafory hacks?

if html-lib was versioned and slimed down to remove old hacks, it could he small enough to download quickly, and with hashing could ensure other pages that use the same version dont have to re-download again...

at some point we could have other web front ends than html that can represent "apps" on the web better too...

is that a crazy idea?

kgin 1950 days ago

It's like that saying about debt.

When a browser renders your 500 mau site badly it's your problem.

When a browser renders your 50,000,000 mau site badly it's the browser's problem.

IMTDb 1950 days ago

That's new "you have made it when"...they need to change the browser engine for your website.

mappu 1950 days ago

I was surprised to see they're almost exclusively anglosphere websites, i would have guessed a broader variation

capableweb 1949 days ago

The web is surprisingly segregated. Seems there is at least three versions of everything (from my perspective), from websites like the typical social network to utilities people use day to day, english/spanish/chinese.

sdflhasjd 1950 days ago

Domain name specific quirks?

What in the world...

robin_reala 1950 days ago

Oh, WebKit are absolutely not the only people doing this. Opera used to with their Presto engine, and I’m pretty sure I’ve seen a similar list in Gecko, though I can’t find it now.

At the end of the day, this is the only way that non-Chrome browsers can meet Google’s hegemony, unless they give up and adopt Chromium itself. The opportunity cost of switching is too low for browser manufacturers no to have these workarounds; if a site is broken for a user, then they’ll change browser.

https://web.archive.org/web/20190204112249if_/https://github...

gsnedders 1949 days ago

> Oh, WebKit are absolutely not the only people doing this. Opera used to with their Presto engine, and I’m pretty sure I’ve seen a similar list in Gecko, though I can’t find it now.

This is the last (shipped) Presto one for desktop:

Perhaps more surprisingly, this continued into Chromium-based Opera (though archive.org seems not to have that, but that's the OPRdesktop directory in that repo); this was primarily down to sites with a UA string allowlist, and having to lie to get in. (Chromium-based Edge also has a means to override the UA string.)

sdflhasjd 1950 days ago

Yes, it is a real shame, but then again, a lot of these sites are made by reasonably big companies.

I'm sure trello and such could fix whatever these input quirks are.

Then there's autoplay specific behaviour on facebook, twitter and netflix. Is this really a google hegemony thing, or is this leniency that other sites don't get?

I'm just trying to see if there's similar examples in Blink & Gecko right now.

erichurkman 1950 days ago

> if (host == "trailers.apple.com")

> return true;

Even Apple themselves are not immune.

abrowne 1950 days ago

And icloud.com: https://github.com/WebKit/WebKit/blob/21c441ed8ddc83f3e24ad5...

robin_reala 1950 days ago

I work for a big company. There have been plenty of outstanding bugs in my company’s sites and apps, because the people that care aren’t in teams that own the systems with bugs in, or aren’t in a position to have their voices heard, and that’s before the hydra of ”legacy software” rears its many-consultanted head.

(at least in my org we’re generally better at this now)

ksec 1950 days ago

Oh Yes, As it was in Firefox when they have IE Quirks. And I am now leaning towards may be it is the standard's fault, not the implementation.

dastx 1950 days ago

> At the end of the day, this is the only way that non-Chrome browsers can meet Google’s hegemony

Except these quirks include a whole lot more than just Google. Some of the domains in there:

1. nytimes

2. twitter

3. ralphlauren

4. baidu

5. warbyparker

6. nfl

7. gizmodo

8. microsoft

I'm not sure what each different piece of code does but there is many more domains in there.

robin_reala 1950 days ago

By Google’s hegemony, I was talking about developers only testing in Google-developed rendering engines, not their web properties.

dastx 1950 days ago

Aah! Pardon me. That makes more sense.

livre 1950 days ago

This isn't new, I remember Opera (versions 12 and older) used to come with a privileged .js file that would apply patches to websites. It was used mostly to fix those that blocked Opera when they detected its user agent or popular websites that ran code using proprietary functions or css properties (things that only Chrome or Internet Explorer implemented but weren't part of the standard).

shay_ker 1950 days ago

How do these domain-specific quirks get into WebKit? Is it advocated by the companies? By the users? By the devs?

I'm just curious how these changes actually make it to millions of people's browsers

I’d suspect this is driven more by using those sites as internal test cases. Every browser vendor since Firefox broke IE’s total domination has had a “don't break the web” priority to some extent. Using large scale real world examples to validate that has the added bonus of revealing cases where you did, in fact, break a significant (by usage) portion of the web.

jonnypotty 1950 days ago

My thoughts exactly. So fragile. Im so grateful I don't have to do web dev any more, what a mess.

TazeTSchnitzel 1950 days ago

Anyone who's read a few Old New Thing posts would know that Windows must be full of similar checks.

colejohnson66 1950 days ago

IIRC, the “checking for a solution to the problem” dialogs were added because they (Microsoft) would submit an actual bug report to the developers, and if they offered a solution, Microsoft’s servers would respond with it. I’ve never seen it work, but IIRC, they added it back in the Windows 95 days (when there was a lot less software to deal with).

AshamedCaptain 1950 days ago

That was a Vista thing, definitely not 95. I have seen it working once, but I don't remember what the program was.

colejohnson66 1950 days ago

That was Vista? Wow. I was way off...

chris_wot 1950 days ago

They really need to get rid of this.

gruez 1950 days ago

You can disable it using group policy https://admx.help/?Category=Windows_10_2016&Policy=Microsoft...

anaisbetts 1949 days ago

Windows has tens of thousands of them. However, the vast majority of them unlike this quirks file are very very specifically gated to an explicit version range, file name, product name, etc etc, typically with consent of the app manufacturer.

throwaway744678 1950 days ago

Wow, there's a nice 77 characters function name [1]! Yes, with a typo. Hard to keep the lines under 80 characters...

[1] https://github.com/WebKit/WebKit/blob/main/Source/WebCore/pa...

RandallBrown 1950 days ago

I wonder if the typo was done on purpose given the point of the function is to suppress autocorrection.

shadowgovt 1949 days ago

The thing about network effects is this:

When you're making a new technology to interact with existing technology, and following the specification results in your new technology failing to work with what's already there, nobody will care if you blame the standard or everyone's mis-implementation of the standard.

They'll consider your malfunctioning tool as damage and route around it... Until you become big enough that you can be the existing implementation other people have to adapt to.

Every commonly-used browser either has something like this buried in its implementation or has a date-stamp of first release older than everything else out there.

saagarjha 1949 days ago

It's fun (and a little depressing) to look at this list, but it exists is basically every popular project. Unfortunately, the incentives are set up for this to essentially be necessary: if something popular doesn't work in your software, users are going to think your code is the one who is broken, not the thing they are trying to use. So you have a sort of a "tragedy of the commons" where if you keep a principled position a user is going to switch to your competitor that supports quirks to get the thing to work.

jaywalk 1950 days ago

Interesting to see www.icloud.com in there...

spectramax 1947 days ago

We need something new here. This is not scalable. If a group of engineers can't write a new browser in sabbatical, then we need to change what a browser (and the spec) should be.

Access to internet should be simple, not complicated so many can participate and leave control out of big corp.

tabtab 1949 days ago

Web UI "standards" are a friggen mess. We really need to rethink it all. For one, if we had a standard state-ful GUI markup language, we wouldn't need to reinvent so many common GUI widgets and idioms using bloated libraries based on JS + DOM.

Second, if web standards allowed true absolute positioning of vectors (as an option), then the layout engines could reside on servers, allowing us to choose a layout engine that best fits domain and need.

Note that while existing web standards do have some coordinate based features, they are too inconsistent to reply upon. If they were any good, we wouldn't need PDF viewers.

My goodness a coordinates based layout system would be an enormous step backwards. We’d be back to `m.` subdomains and horizontal scrolling as the norm.

The CSS layout standards are certainly not all ideal for my preferences. But the adaptability they afford in fluid layout is far better for real end users than any kind of absolute layout predetermined on a server.

We’ve reached a point I can use most of the web on my phone without compromise and I’d hate to lose that for some development convenience.

tabtab 1949 days ago

I don't think you understand. The layout engine could be on the server, not "non existent". (Although one could program directly with coordinates if they wanted.) Coordinate based vectors allows the layout engine to be on the server, so that we are not stuck with a one-size-fits all layout engine.

And what's wrong with the "m." standard as an option? For some jobs it's the right tool.

And what's good for public site phone use may not be the right tool for internal CRUD applications. I don't recommend Google make an email client with the additions I suggest, for example. The existing standards are fine for light-input consumer sites, but lousy for productivity-oriented CRUD. I'm not saying get rid of existing standards.

> I don't think you understand. The layout engine could be on the server, not "non existent".

Oh, I do understand which is why the thing in your quotes isn't something I said. Layout on the server means you're laying out without the context of my device or viewport, and certainly without any change of context like if I switch to dark mode or prefers-reduced-motion, or if my data access changes.

> And what's wrong with the "m." standard as an option? For some jobs it's the right tool.

Instead of getting overly principled about it... one of the reasons it went away was because the heuristics that determined what even is mobile were becoming increasingly wrong. And like I said this resulted in a bunch of ridiculous horizontal scrolling for lots of users.

tabtab 1949 days ago

Re: "Layout on the server means you're laying out without the context of my device or viewport"

Why are you making that assumption? A device can either send its screen size to the server, or a size preference category if it wants to hide details: watch, phone, tablet, laptop, desktop, workstation. (And user should be able to switch the preference manually.)

Re: "And like I said this resulted in a bunch of ridiculous horizontal scrolling for lots of users."

Without seeing a specific scenario, I cannot comment on possible solutions or standard adjustment proposals. Sometimes people throw the baby out with the bathwater even the baby was fine.

eyelidlessness 1948 days ago

> Why are you making that assumption? A device can either send its screen size to the server, or a size preference category if it wants to hide details

And what happens when I rotate my device? Or resize my window? Or switch to dark mode? Another request to the server to redraw? What happens when an unknown class of device is encountered? What happens when your assumptions about a known class of device aren’t future proof?

Look I agree that the CSS layout APIs aren’t as good as they could be. But I really don’t think making them less flexible is the solution. If anything they’re not flexible enough (for example many kinds of layout are still very difficult to achieve with dynamic content, even with grid). But what would improve the APIs in my opinion is to design them with common idioms as primitives. Grid has somewhat embraced that by allowing template areas to be named arbitrarily. But the underlying APIs are sprawling and hard to understand even with close attention to the docs/spec.

dbbk 1948 days ago

I really don't think you have thought this through.

lxe 1950 days ago

Wow, this is terrible. Either fix the bugs, introduce non-standard behavior for all sites, or expect these big players to fix their own problems.

They likely can’t. What happens when whatever quirk they used to isolate goes global and breaks workarounds on thousands of other sites they didn’t test? I mean, it’s awful that anything like this exists but it’s pretty likely well past any kind of turning back.

Given WebKit’s lineage I have to wonder if some portion of this was inherited from KHTML.