Y
Hacker News
new
|
ask
|
show
|
jobs
user:
m4r1k
created:
2022-09-21
karma:
444
submissions:
0 points
|
0 comments
0 points
|
0 comments
1M Tokens/s: Scaling Qwen 3.5 27B on 96 B200 GPUs with vLLM
3 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Scaling Inference to Billions of Users and AI Agents
1 points
|
0 comments
0 points
|
0 comments
He Had Dangerous Delusions. ChatGPT Admitted It Made Them Worse
2 points
|
2 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments