Hacker News new | ask | show | jobs
by machiaweliczny 656 days ago
YeS, exactly. The trick is to have enough tough data so you find optimal one. I think as we will scale models back to smaller sizes we will discover viable/correct representations