Hacker News new | ask | show | jobs
by astrange 781 days ago
Since LLMs are loosely based on NM models, it seems research on newer sampling methods like Mirostat might help here.