Hacker News new | ask | show | jobs
by m-dot-reviews 8 days ago
I looked for a forum like this a few months ago during my own model research, and didn't find one. So, here's the "catalog of clankers," a task-structured review site for LLMs. The idea is that if you're looking for an LLM to use for a specific task, there's a whole spectrum of models out there now, and benchmark data can be somewhat hard to apply to your own particular task.

For context, at the time I was looking for models that were good at generating novel DSL snippets (i.e. a language not in LLM training data) and then general SVG generation (not just pelicans).

Once there's something to publish, there will be a regular cadence of database dumps so that the actual content on the site (reviews, ratings, etc) is publicly available and not locked up by the site itself.

PTAL!