I built lightrag-snkv, Basically it uses lightRAG https://github.com/HKUDS/LightRAG ,this requires various storage databases like key value store, graph database, vector database, I built single embedded file based database which covers all these requirements: https://github.com/hash-anu/snkv.
And developed ingestor which ingest data from Algolia's HN search API. Here auto ingestion timings can be configured either using server.py or it can be configured by ingester code file or can be configured in env file.
This is very early stage and you might hit issues.
Demo video and quick-start in the README. Let me know if you run into issues.
I am getting used to with this lightrag-snkv, Now I dont need to check hacker news site again, it automatically ingest top ranked posts and I can query and check answers using natural query language.