Y
Hacker News
new
|
ask
|
show
|
jobs
by
int_19h
245 days ago
Google's models are just generally more resilient to high temps and high top_p than some others. OTOH you really don't want to run Qwen3 with top_p=1.0...