User: ag8 | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

user: ag8
created: 2019-07-03
karma: 2006

runrl.com

submissions:

0 points | 0 comments

0 points | 0 comments

0 points | 0 comments

0 points | 0 comments

Gourmand Syndrome

27 points | 9 comments

0 points | 0 comments

guys why does armenian completely break Claude

99 points | 65 comments

Sampling at negative temperature

203 points | 60 comments

Perfectly Replicating Coca Cola [video]

1 points | 1 comments

0 points | 0 comments

0 points | 0 comments

4 points | 2 comments

Scaling pretraining affects RL sample efficiency

1 points | 0 comments

Systematically generating tests that would have caught Anthropic's top‑K bug

2 points | 0 comments

0 points | 0 comments

4 points | 2 comments

Training Qwen to answer briefly yet intelligently using feedback control

4 points | 0 comments

0 points | 0 comments

0 points | 0 comments

0 points | 0 comments

0 points | 0 comments

0 points | 0 comments

0 points | 0 comments

0 points | 0 comments

Launch HN: RunRL (YC X25) – Reinforcement learning as a service

71 points | 22 comments

Generating the Funniest Joke with RL

1 points | 0 comments

0 points | 0 comments