|
|
|
|
|
by teraflop
160 days ago
|
|
Don't get me wrong, it's fun to see performance optimizations like this. But I'd expect that a naive implementation of the same strategy would already take like 0.1% of the time needed to actually generate embeddings for your chunks. So practically, is it really worth the effort of writing a bunch of non-trivial SIMD code to reduce that overhead from 0.1% to 0.001%? |
|
So they're talking about this becoming an issue when chunking TBs of data (I assume), not your 1kb random string...