Hacker News new | ask | show | jobs
by int_19h 245 days ago
Google's models are just generally more resilient to high temps and high top_p than some others. OTOH you really don't want to run Qwen3 with top_p=1.0...