What sorts of apps are those? I tried testing various models with a test app as a benchmark, a local first app with CRDTs, and many, even frontier models, struggle heavily.
I actually was able to get the CRDT stuff working with the latest models, just had to set thinking to the maximum and also have it continually test itself in a loop via MCP without my intervention so after chugging along for an hour it seemed to have fixed everything itself.