Hacker News new | ask | show | jobs
by throwaway1851 1216 days ago
For embeddings, it may be overkill. Smaller BERT-type models can provide good embeddings when fine tuned with a contrastive learning objective. Eg: https://sbert.net.