Hacker News new | ask | show | jobs
by tarruda 858 days ago
> It's just a matter of throwing compute at it, nothing fancy.

I read somewhere that there was a recent breakthrough that enabled this.

Even if it costs a lot to run inference with 1M token context, it is hard to imagine it would cost anywhere close to a software engineer salary.