|
|
|
|
|
by flail
37 days ago
|
|
A quick smoke test, then. Gemini 3, Thinking Mode.
The article: https://techtrenches.dev/p/the-human-cost-of-10x-how-ai-is-p...
The prompt: literally what you suggested. Gemini: The article focuses on the environmental and human labor costs of scaling Artificial Intelligence, specifically focusing on water usage, electricity, and "ghost work." Which is hilarious, since the article doesn't even mention the words "water" or "electricity." Gemini remains unfazed, reporting the links that are not in the article (some don't exist at all) to make the final ruling: "The Tech Trenches document is highly accurate in its citations." Now, I know. Had I used Claude Code with relevant skills, it would have done better. But would it be good? |
|
* https://gemini.google.com/share/6bd33176b27c
Right, so https://techtrenches.dev/p/the-human-cost-of-10x-how-ai-is-p... is actually a substack, gemini is blocked from accessing it, and is bouncing off and hallucinating instead. Ok, that's an actual bug, that should not lead to the model starting to hallucinate. Imo the correct response should have been to fail loudly; which would have been a verification signal of its own.
ps: See also: https://news.ycombinator.com/item?id=48087485 ... I'm starting to think of it as "english is a new scripting language". Clearly the downside is that certain "runtime environments" are not compatible. %-/