Hacker News new | ask | show | jobs
by epolanski 151 days ago
The author of this post should benchmark his own blog for accessibility metrics, text contrast is dreadful..

On the other hand, this would be interesting for measuring agents in coding tasks, but there's quite a lot of context to provide here, both input and output would be massive.

2 comments

Pushed a fix. Could you check, please?

Any resources you can recommend to properly tackle this going forward?

Appreciate the feedback, will work on that.
Do you have any insights on the platform evaluation for coding tasks?
One more vote on fixing contrast from me.
Will fix, thanks :)
Tried Evalry, its a really nice concept, thanks for sharing it!