Hacker News new | ask | show | jobs
by pamelafox 311 days ago
We use text-embedding-3-large, with both quantization and MRL reduction, plus oversampling on the search to compensate for the compression.