Y
Hacker News
new
|
ask
|
show
|
jobs
user:
philipkiely
created:
2018-08-20
karma:
1119
DevRel @ https://baseten.co
Email me: username at baseten.co
submissions:
0 points
|
0 comments
We built the fastest API for GLM-5.2 (280 TPS)
6 points
|
0 comments
0 points
|
0 comments
The Math Behind TurboQuant
8 points
|
3 comments
Show HN: Inference Engineering
2 points
|
0 comments
How We Built the Fastest Kimi K2.5 on Artificial Analysis
3 points
|
0 comments
Nvidia Invests $150M in AI Inference Startup Baseten
1 points
|
1 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Baseten raises $150M Series D at $2.15B
2 points
|
1 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs
247 points
|
175 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments