Hacker News new | ask | show | jobs
by sebzim4500 961 days ago
How exactly are you proposing he tests this without access to hundreds of thousands of dollars worth of compute? Toy models don't work for this kind of thing, small language models behave qualitatively differently from large ones.