Hacker News new | ask | show | jobs
by calcsam 334 days ago
interesting idea, this benchmark maps fairly closely to the types of output I typically ask LLMs to generate for me day-to-day
1 comments

ayy great to hear!