Hacker News new | ask | show | jobs
by alright2565 809 days ago
I saw this part:

> Every Discourse Discussion returns data in JSON if you append .json to the URL.

then this:

> Raw data was gathered into a single JSONL file by automating a browser using Playwright.

Kinda seems to me like having a whole browser instance for this isn't necessary? I would have been surprised if this .json pattern didn't continue for all pages, and it turns out that it does in fact also work for the topic list: https://community.openai.com/latest.json

The other place I've seen this sort of API pattern is reddit. For example, https://www.reddit.com/r/all.json or (randomly chosen) https://www.reddit.com/r/mildlyinfuriating/comments/1bqn3c0/...