| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by EffCompute 136 days ago

One thing I'm trying to better understand is where the real limits are.

At this point it feels like the bottleneck is less about raw compute and more about how efficiently data is represented and accessed on the GPU.

Curious if others have seen similar behavior when pushing large-scale vector search on consumer hardware.