Y
Hacker News
new
|
ask
|
show
|
jobs
by
crucio
813 days ago
how well would this work for 762 dimensional vectors? At 3072, they're starting with such a high number of dimensions that the accuracy loss may not be representative of what others would see
3 comments
esafak
813 days ago
You'd have to look at the precision-recall curves for your data set and make the trade-off. There are studies on this topic.
link
mediaman
813 days ago
Generally, it seems that people are starting to see more problems when making vectors of fewer than 1,000 dimensions binary.
link
blublubdub
812 days ago
seems to work pretty well check this out:
https://huggingface.co/blog/embedding-quantization
link