Hacker News new | ask | show | jobs
user: mikejulietbravo
created: 2018-08-10
karma: 255

submissions:

0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
How to get GLM 5.2 to 280 tokens per second
3 points | 1 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Show HN: Automatically Build Nvidia TRT-LLM Engines
2 points | 0 comments
Show HN: 60% higher tokens per second for 70B custom LLMs
1 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Show HN: Baseten Chains – Framework and SDK for Multi-Model AI Products
9 points | 5 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Open Source Inference Engine Baseten Raises $40M from IVP, Spark and Greylock
2 points | 1 comments
0 points | 0 comments