Hacker News new | ask | show | jobs
by clbrmbr 349 days ago
Does anyone have experience getting agents to understand terminal applications? Like, in general an arbitrary ncurses application.

A more specific case I’ve struggled with is output from a long-running program like ping. You’ve got to know when to terminate.

2 comments

I wrote a terminal-based falling sand game in rust and incrementally fed the entire screen output to a multimodal LLM (for better generalization) and also got it to attempt to generate interesting initial conditions by spitting out raw characters.
Instead of telling the agent to wait for something like ping, have it write a script to do it and then have it run the script.