Y
Hacker News
new
|
ask
|
show
|
jobs
by
astrange
781 days ago
Since LLMs are loosely based on NM models, it seems research on newer sampling methods like Mirostat might help here.