| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by penguinten 1265 days ago
	It uses the OpenAI embedding models to get around the context window problem. You can compress large amounts of text/code into just a few tokens.

3 comments

LunaticDragon 1265 days ago

Can you elaborate on this? What is the input to the GPT in that case? I was under the impression that GPT is given an array of tokens and it produces one token as its output.

link

iamflimflam1 1265 days ago

Would love some more information on how to do this. Can you share details or point to a resource?

link

cipherpro 1265 days ago

https://beta.openai.com/docs/guides/embeddings/use-cases

link

joshka 1265 days ago

I'm curious if can share LoC -> token count metrics?

link