Hacker News new | ask | show | jobs
by kristjansson 3 days ago
Be sure you compare inputs tokens to pre-fill rates and output tokens to generation rates.