|
|
|
|
|
by vova_hn2
76 days ago
|
|
> random LLM-generated statistic-vomit text I do not understand why this project in particular have set you off. Their README looks much better than many I've seen on HN: - no annoying verbosity, that is so prevalent in AI-generated text
- not too many buzzwords (they're not saying "agentic" every sentence)
- it is very clear what exactly project is supposed to do and why it can be useful Personally, I upvoted this because I wanted to do something similar for a long time but never got around to it. |
|
“Provider: OpenAI (gpt-4o / o1)”
Uh so is it 4o or o1? Very different models. When you read this, how did you interpret this?
- OK ill take your word for it Run Variant Token Delta (per call) Step Savings (vs Baseline) Task Success Baseline (2026-03-13) -18.62% — 11/11 Hardened A +8.07% — 11/11 Enhanced (2026-03-27) -6.73% +27.78% 11/11 Key Takeaways:- What useful information do you glean from this vova_hn2? Perhaps Im just ignorant.
So it actually takes MORE tokens but less “steps”? This could all use actual discussion feom the creator. A blog post or detailed comment. Instead we get this.What sets me off is projects like this that throw random numbers and technical jargon at you because the user simply asked their LLM to do so. It gives the veneer of “oh it must be legitimate look at all the data” to people mentally stuck in 2024 not realizing anyone can generate junk and pass if off in a way that (used to be) convincing.