Hacker News new | ask | show | jobs
by m101 189 days ago
Prove it beats models of different architectures trained under identical limited resources?