Hacker News new | ask | show | jobs
by buildbot 1 day ago
It honestly explains so many issues I have been having, as I used it primarily for ML research (on my personal account, doing things not related to my job I should note). It would literally typo package names and spend huge amounts of time failing to setup simple environments…then do stupid things like set the learning rate to 1e-7, and use the eval set as training data.
3 comments

It burned through all of my tokens in a very short time. I wonder if it their ML mitigations leads to model into deadlocks.
That’s insane. I hope they fix it.
Nothing to fix. This is working as designed.

Using codex for this use case is the fix.

just imagine if they made it sneaky. get things just subtly wrong enough that your training runs just never quite go as well as you think they should.