|
|
|
|
|
by imperio59
157 days ago
|
|
From the author:
> at some point we started benchmarking on wikipedia-scale datasets.
> that’s when things started feeling… slow. So they're talking about this becoming an issue when chunking TBs of data (I assume), not your 1kb random string... |
|
memchunk has a throughput of 164 GB/s. A really fast embedder can deliver maybe 16k embeddings/sec, or ~1.6GB/s (if you assume 100 char sentences)
That's two orders of magnitude difference. Chunking is not the bottleneck.
It might be an architectural issue - you stuff chunks into a MQ, and you want to have full visibility in queue size ASAP - but otherwise it doesn't matter how much you chunk, your embedder will slow you down.
It's still a neat exercise on principle, though :)