Hacker News new | ask | show | jobs
by aestrad7 91 days ago
That's exactly the intent, independent, reproducible and no vendor relationship.

The monetization angle is interesting. A continuously updated version with more models and frontier models, agentic scenarios, and multi-turn testing would be genuinely useful for teams making deployment decisions. That's the direction for v2.