Hacker News new | ask | show | jobs
by jafitc 929 days ago
We already know LLMs are good at summarizing.

Question is how good they are are retaining minute details from extremely long context, say 200k tokens.

That’s the frontier Claude and now GPT-4 Turbo are pushing

1 comments

I guess I’m proposing a new compression, new substitutions, the llm inventing new words to compress common ideas. A bytecode if you will. Compiling the context down.