Hacker News new | ask | show | jobs
by jerpint 1207 days ago
The summary itself is still generated, but has all the context to do summarization in the prompt.

It's very difficult to otherwise finetune existing LLMs. GPT itself is closed-sourced, and doesn't allow for finetuning (except via an opaque API and with limited amounts of data). Other open models are either very difficult to load in memory and/or simply not as expressive as GPT