Hacker News new | ask | show | jobs
by _--__--__ 482 days ago
This is neat but watching a reasoning model that stops to consider "I have read half of a dialogue block, time to press A to get the rest of the text" gets old really quick. I think I'd rather watch a model try to play pokemon against human opponents on a simulator like pokemon showdown (which I understand is a bit further in an IP rights grey area than emulating a 30 year old game). In that case you would get to see how it handles unknown information and updates its reasoning based on the success/failure of its predictions.
4 comments

A model doesn't need to play on visual simulators, it can very well do that on IRC (like the good old days of RS/GSBots), to show how it fares against humans.

One of the biggest challenge this Claude version faces is to read the visual data accurately. It was stuck in the Viridian forest and Pokemarts for a while because the overworld objects like trees and paths kept confusing it.

> play pokemon against human opponents on a simulator like pokemon showdown

That's precisely the pet project I'd take on if/when I bother to take the time making some deep learning agent. There's a bot that plays one of the ladders already but it's just a decision tree and the best players know how to predict its moves. It's like ~1500 ELO in a ladder where the best players are 1800+. Still not bad, to be fair; it would probably beat me.

The bot has a pre-selected team, which I believe always starts with the same mon. I'd be more interested in an agent that fully played the game, start-to-finish, including making a team based on play data and selecting a starter based on the current opponent's team.

Haha yeah this is cool but the days of watching Twitch Plays Pokemon or RNG Plays Pokemon or things like that were much more entertaining
> which I understand is a bit further in an IP rights grey area than emulating a 30 year old game

But Nintendo will never take down anything that is related to Showdown because it would highlight their massive hypocrisy!

It would set a precedent. People would go: "wait, but why did they never take down Showdown itself? Could it be that it's because they actually benefit from its existence? Then why did they take down X/Y/Z? Oh! It's because copyright law only applies when you want it to! It's all arbitrary and made up! You just need to be friends with the right people in the VGC and your pet project will be immune from all legal backlash!"

Or something.

Seriously I hate it so fucking much that Nintendo does nothing about Showdown, which blatantly steals a ton of game assets, and then nukes some random guy's fan project that no one ever played.