Hacker News new | ask | show | jobs
by piqi 1197 days ago
> Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.

Is the purpose to know which of the models OpenAI offers is most suitable for your workload/app? Could I use this to know if the cheaper model is sufficient for a particular use-case?