Hacker News new | ask | show | jobs
user: m4r1k
created: 2022-09-21
karma: 444

submissions:

0 points | 0 comments
0 points | 0 comments
1M Tokens/s: Scaling Qwen 3.5 27B on 96 B200 GPUs with vLLM
3 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Scaling Inference to Billions of Users and AI Agents
1 points | 0 comments
0 points | 0 comments
He Had Dangerous Delusions. ChatGPT Admitted It Made Them Worse
2 points | 2 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments