Hacker News new | ask | show | jobs
by hex4def6 52 days ago
The point of a benchmark is that it allows a relative comparison. The Pelican is one such benchmark.

Feel free to create a "how does it compare to Claude 3.5 Sonnet" benchmark. If people find it useful, it will be run against new LLMs to generate additional points of comparison.

I will also say; it's really easy to just skim past comments. I suspect your ROI time-wise in creating this account to complain will never be recouped compared with just skimming past pelican comment chains.

1 comments

Usually I read the top comments in posts, they usually have the best information. I don't think the pelican test deserve to be at top position. HN top posts should reflect the best of our community, not by karma but by the value and insight that they provide.