Hacker News new | ask | show | jobs
user: ginda307
created: 2025-02-13
karma: 3

submissions:

CAD: Disaggregating Core Attention for Efficient Long-Context LLM Training
6 points | 0 comments
Disaggregated Inference: 18 Months Later
1 points | 0 comments
Reasoning Without Hesitating: Efficient Cot Through Certainty Probing
20 points | 5 comments
0 points | 0 comments