Hacker News new | ask | show | jobs
by wastewastewaste 981 days ago
What is the most capable LLM for summarising stuff like long educational articles perhaps?
4 comments

Anthropic's Claude 2 is great for pasting large text files in, and Flan T5 seems to be excelling at summarization benchmarks: https://huggingface.co/docs/transformers/model_doc/flan-t5.
GPT-4 is usually the answer to “which llm is most/best…”

But if the article is longer than the context window, you can try either RWKV or Claud-200k

Has Claude 200k ever been made available? I've only found ways to access the 100k version.
GPT-4 if it's less than 8,000 tokens (or 32,000 if you have access to that via the API).

Claude for up to 100,000 tokens.

GPT-4