Y
Hacker News
new
|
ask
|
show
|
jobs
user:
t55
created:
2023-08-18
karma:
892
ML researcher
submissions:
Target Policy Optimization
1 points
|
0 comments
Show HN: Kilroy – Knowledge base for teams using Claude Code
5 points
|
0 comments
Procedural Reasoning Datasets
1 points
|
0 comments
In Defence of Gary Marcus
3 points
|
0 comments
Reasoning Gym – Procedural RL reasoning datasets
1 points
|
0 comments
ChatGPT Agent [video]
3 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
ReasoningGym: Reasoning Environments for RL with Verifiable Rewards
105 points
|
28 comments
Show HN: Rehearsal.so, Duolingo for Public Speaking
3 points
|
1 comments
0 points
|
0 comments
End-to-End Vision Tokenizer Tuning
3 points
|
0 comments
YC Interview Mock Practice
2 points
|
0 comments
D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning
4 points
|
0 comments
Are LLMs more than autocomplete? AI Debate
1 points
|
0 comments
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
72 points
|
16 comments
How to stay in flow while using Cursor or Windsurf
2 points
|
0 comments
0 points
|
0 comments
Generative Modelling in Latent Space
2 points
|
0 comments
0 points
|
0 comments