Hacker News new | ask | show | jobs
by crtasm 1578 days ago
Trying gopher://gopherpedia.com with a page linked on HN: https://en.wikipedia.org/wiki/Belling_the_cat

Any ideas why it's missing the first paragraph? I didn't spot a page revision without it.

2 comments

I don't have time to look too closely into it, but my guess is that it's a bug somewhere in the parser: https://github.com/muffinista/gopherpedia.com/blob/main/pars...

The markup behind that article isn't terribly complicated, but the parser isn't perfect either. For example: https://gopherpedia.com/gopherpedia.com/Gustave_Dore has some glitches in the lead sentence.

My suspicion was that it's getting confused by the line above the intro, but I pasted the article's source code into Wikipedia:Sandbox and gopherpedia read the intro just fine. Strange!

Aha! Here we go. Compare:

- https://gopherpedia.com/gopherpedia.com/Belling_the_Cat

- https://gopherpedia.com/gopherpedia.com/Belling%20the%20Cat

Without underscores in the URL, the first paragraph disappears. Bizarre.

Looks like the first paragraph was a recent edit. I'm not sure how gopherpedia works but perhaps it only syncs periodically.
That was my thinking too but I can't find a revision without it on the first page of results going back to 2019. Why does it look like a recent edit to you?