Hacker News new | ask | show | jobs
user: philipkiely
created: 2018-08-20
karma: 1119

DevRel @ https://baseten.co

Email me: username at baseten.co

submissions:

0 points | 0 comments
We built the fastest API for GLM-5.2 (280 TPS)
6 points | 0 comments
0 points | 0 comments
The Math Behind TurboQuant
8 points | 3 comments
Show HN: Inference Engineering
2 points | 0 comments
How We Built the Fastest Kimi K2.5 on Artificial Analysis
3 points | 0 comments
Nvidia Invests $150M in AI Inference Startup Baseten
1 points | 1 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Baseten raises $150M Series D at $2.15B
2 points | 1 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Running GPT-OSS-120B at 500 tokens per second on Nvidia GPUs
247 points | 175 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments