| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ddorian43 812 days ago
	There https://ydb.tech/ open source db that uses erasure coding for replication in single zone/region.

1 comments

eivanov89 812 days ago

In YDB with block 4+2 erasure coding, you need half the disk space compared to mirror-3-dc schema. Meanwhile CPU usage is just a little bit higher, thus in high throughput tests mirror-3-dc wins. Indeed as mentioned in the post there might be a tail latency win in latency runs, but if your task is throughput with a reasonable latencies, replication might be a better choice.

link

pjdesno 812 days ago

If you only care about throughput, just fetch the data and read should be the same speed as triple replicated.

For writing, triple-rep has to write 2x as much data or more, so it's going to be slower unless your CPUs are horribly slow compared to your drives.

link

ddorian43 812 days ago

I expect it to save a lot of CPU by only needing 1/3x of compactions. You might want to do a benchmark on that ;). An example is quickwit (building inverted indexes is very expensive).

link