| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by manmal 62 days ago
	Why is it not useful? Input token pricing is the same for 4.7. The same prompt costs roughly 30% more now, for input.

2 comments

dktp 62 days ago

The idea is that smarter models might use fewer turns to accomplish the same task - reducing the overall token usage

Though, from my limited testing, the new model is far more token hungry overall

link

manmal 62 days ago

Well you‘ll need the same prompt for input tokens?

link

httgbgg 62 days ago

Only the first one. Ideally now there is no second prompt.

link

manmal 62 days ago

Are you aware that every tool call produces output which also counts as input to the LLM?

link

squeaky-clean 62 days ago

Are you aware that a lot of model tool calls are useless and a smarter model could avoid those?

Are you aware that output tokens are priced 5x higher than input tokens?

link

manmal 62 days ago

> a lot of model tool calls are useless

That’s just wrong. File reads, searches, compiler output, are the top input token consumers in my workflow. None of them can be removed. And they are the majority of my input tokens. That’s also why labs are trying to make 1M input work, and why compaction is so important to get right.

Regarding output - yes, but that wasn’t the topic in this thread. It’s just easier to argue with input tokens that price has gone up. I have a hunch the price for output will go up similarly, but can’t prove it. The jury’s out IMO: https://news.ycombinator.com/item?id=47816960

link

httgbgg 62 days ago

This has no bearing on my comment. The point is that a better model avoids dozens of prompts and tool calls by making fewer CORRECT tool calls, with the user needing no more prompts.

I’m surprised this is even a question; obviously a better prompter has the same properties and it’s not in dispute?

link

kalkin 62 days ago

That's valid, but it's also worth knowing it's only one part of the puzzle. The submission title doesn't say "input".

link