Y
Hacker News
new
|
ask
|
show
|
jobs
user:
mikejulietbravo
created:
2018-08-10
karma:
255
submissions:
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
How to get GLM 5.2 to 280 tokens per second
3 points
|
1 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Show HN: Automatically Build Nvidia TRT-LLM Engines
2 points
|
0 comments
Show HN: 60% higher tokens per second for 70B custom LLMs
1 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Show HN: Baseten Chains – Framework and SDK for Multi-Model AI Products
9 points
|
5 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Open Source Inference Engine Baseten Raises $40M from IVP, Spark and Greylock
2 points
|
1 comments
0 points
|
0 comments