Hacker News new | ask | show | jobs
by guitarlimeo 634 days ago
Even if the costs were lower, the trend is towards more inference compute time (o1), so these costs might be valid for the future.
1 comments

I'm not sure how comparable o1 is in total usage. Remember that people will either adjust the prompt or continue the conversation as needed. If o1 spends more time on the answer, but responds in fewer steps, it may be a net positive on energy use. Also it may skip the planning and self-reflection steps in agent usage completely. It's going to be hard to estimate the real change in usage.