Hacker News new | ask | show | jobs
by eden-u4 549 days ago
ah, numerical instability in the warmup stage might be the issue then?