Y
Hacker News
new
|
ask
|
show
|
jobs
by
AlexCoventry
172 days ago
No, there's no training going on, here, as far as I can tell. E.g., they use GPT-5 as their base model. Also, AFAICT from a quick skim/search there's no mention of loss functions or derivatives, FWIW.
1 comments
alextheparrot
172 days ago
The derivative being a grad(ient) student sampling scaffolds against evals + qualitative observations: most prompt-based llm papers
link