| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by bourse_lee 38 days ago

What drives such a high throughput difference between Quack and Arrow on high-volume operations ?

I'll try to search from source/Github, reply appreciated though, for example:

- when DuckDb bulk exports a table, does Quack benefit from pre-existing compression/encodings/0-copy where Arrow requires decode+re-encode ?

- the post mentions parallel reads, is the level of parallelism the same on Arrow vs Quack here ? Running the high throughput benchmark at resource saturation with increasing number of concurrent bulk-read clients would be more transparent