|
|
|
|
|
by minimaxir
663 days ago
|
|
One of the now-underdiscussed features of embeddings is that you can indeed use any existing statistical modeling techniques on them out of the box, and as a bonus avoid the common NLP preprocessing nuances and pitfalls (e.g. stemming) entirely. This post is a good example on why going straight to LLM embeddings for NLP is a pragmatic first step, especially for long documents. |
|