|
|
|
|
|
by 10c8
750 days ago
|
|
Generating embeddings is relatively simple with a model and Python code. There's plenty of them on HuggingFace, along with code examples. all-MiniLM-L6-v2 is a really (if not the most) popular one (albeit not SotA), with 384 dimensions: https://huggingface.co/sentence-transformers/all-MiniLM-L6-v... Edit: A more modern and robust suite of models comes from Nomic, and can generate embeddings with 64 to 768 dimensions (https://huggingface.co/nomic-ai/nomic-embed-text-v1.5). When the author talks about thousands of dimensions, they're probably talking about the OpenAI embedding models. |
|