Hacker News new | ask | show | jobs
by NavinF 1069 days ago
People have tried, but all the local minimums perform about the same so there's no point trying to find a global minimum. A much better strategy is to train multiple models and use all of them at inference time to score better