Hacker News new | ask | show | jobs
by pixl97 929 days ago
We'll need some kind of hybrid system to deal with this. For example the LLM 'indexes' the text it reads and assigns importance weights to parts of it, then as it moves to new text it can check back to these more important parts to ensure its not forgetting things.
1 comments

I would think there is some benefit to synthesizing, and compressing. Summarization is similar in that the heavier weighed text remains and the rest is pruned.

If the same basic information is all over a text, combine it.

We already know LLMs are good at summarizing.

Question is how good they are are retaining minute details from extremely long context, say 200k tokens.

That’s the frontier Claude and now GPT-4 Turbo are pushing

I guess I’m proposing a new compression, new substitutions, the llm inventing new words to compress common ideas. A bytecode if you will. Compiling the context down.