Hacker News new | ask | show | jobs
by typis7 3974 days ago
Where is the information being scraped from?

I looked at the source but I'm hardly a JS/Web expert.

Seems to be from here:

https://meta.wikimedia.org/wiki/Special:RecentChanges

https://wikitech.wikimedia.org/wiki/RCStream

but only found a list of weird addresses: https://github.com/hatnote/listen-to-wikipedia/search?utf8=%...

1 comments

Those addresses are websocket servers that seem to publish wikipedia changes. The app is just subscribing to the changes by opening a websocket connection to the appropriate server.

Looking at their other repos and the addresses you linked to, looks like the project for publishing the changes is at https://github.com/hatnote/wikimon, which is in turn getting changes from an IRC feed at https://meta.wikimedia.org/wiki/Research:Data#IRC_Feeds