Hacker News new | ask | show | jobs
by sebzim4500 1121 days ago
For anyone reading this, these are the actual prompts being used to assess the models.

https://github.com/kagisearch/pyllms/blob/ca9ad4d4bfdd9d58fe...