What sorts of apps are those? I tried testing various models with a test app as a benchmark, a local first app with CRDTs, and many, even frontier models, struggle heavily.
I actually was able to get the CRDT stuff working with the latest models, just had to set thinking to the maximum and also have it continually test itself in a loop via MCP without my intervention so after chugging along for an hour it seemed to have fixed everything itself.
https://decaboy.fit for tracking progress at they gym
https://megaparley.com sports betting platform
A horse betting platform not published yet, still looking for an API odds provider
A car mechanic AI assistant not published yet
I've learned that the more detailed the initial prompt the better result I get. I can share any prompt if you want