| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by theptip 973 days ago

> The 8k context window is new

Hasn’t Claude had this for many months (before they bumped to 100k)?

Edit: ah, you mean new for OSS maybe?

1 comments

simonw 973 days ago

Claude is a large language model, which is a different thing from an embedding model.

link

Der_Einzige 973 days ago

Any large language model generates embedding representations at every layer of the model, and these can be trivially extracted. So, large language models are indeed embedding models.

This leaderboard doesn't compare these custom tailored embedding models vs the obvious thing of average pooling layered with any traditional LLM, which is easily implemented using sentence transformers.

link

anothernewdude 972 days ago

Because 4K+ dimensional embeddings are functionally useless.

link

theptip 973 days ago

Aha, that’s what I missed, thanks!

link