Hacker News new | ask | show | jobs
by suchintan 245 days ago
I think they're complementary, and that's the direction we're headed.

We can ask the vision based models to output why they are doing what they are doing, and fallback to code-based approaches for subsequent runs