| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by stevemk14ebr 87 days ago
	Testing on 5GB of data fully resident in ram is a terrible comparison. Things get hard when you're in the hundreds of gigabytes or more.

2 comments

malandin 87 days ago

Thanks a lot for your comment! We agree that a dataset as small as 5 GB may sound strange but it was a conscious decision. Check out our blog post to read more about the methodology of this benchmark itself.

https://blog.serenedb.com/search-benchmark-game-overview

link

MBkkt 85 days ago

TLDR It's not our choice, but it's meaningful. Because this 5GB is single data segment and literally what you will have in Elastic/etc when you have overall TBs of data. See https://www.elastic.co/docs/deploy-manage/production-guidanc... (single shard is one Lucene index that contains multiple data segments)

link