Hacker News new | ask | show | jobs
user: zone411
created: 2010-08-13
karma: 4257

https://twitter.com/LechMazur

10 LLM benchmarks: https://github.com/lechmazur/

https://www.linkedin.com/in/lech-mazur-69b70493/

Advameg (City-data.com) founder and CEO. AI startup founder.

Author: AI melody songwriting assistant https://melodies.ai

Author: Accurate COVID-19 county-by-county neural net case prediction model based on most data.

submissions:

0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
LLM Position Bias Benchmark: Swapped-Order Pairwise Judging
1 points | 0 comments
0 points | 0 comments
Show HN: Buyout Game Benchmark: Multi-Agent Bargaining, Transfers, and Takeovers
6 points | 0 comments
0 points | 0 comments
0 points | 0 comments
LLM Persuasion Benchmark: Multi-Turn Persuasion Between Models
9 points | 0 comments
0 points | 0 comments
Show HN: LLM Debate Benchmark
9 points | 3 comments
0 points | 0 comments
Show HN: LLM Sycophancy Benchmark: Opposite-Narrator Contradictions
3 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments