| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by nirga 705 days ago
	I tend to find classic NLP metric more predictable and stable than "LLM as a judge" metrics so I'd try to see if you rely on them more. We've written a couple of blog posts about some of them: https://www.traceloop.com/blog

1 comments

swyx 705 days ago

for your blog can i offer a big downvote for the massive ai generated cover image thing? its a trend for normies but for developers its absolutely meaningless. give us info density pls

link

nirga 705 days ago

roger that! I like them though (am I a normie then?)

link