| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by piqi 1197 days ago
	> Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks. Is the purpose to know which of the models OpenAI offers is most suitable for your workload/app? Could I use this to know if the cheaper model is sufficient for a particular use-case?