Y
Hacker News
new
|
ask
|
show
|
jobs
user:
logotype
created:
2012-03-06
karma:
377
intentionally left blank.
submissions:
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Stateful Inference for Low-Latency Multi-Agent Tool Calling
2 points
|
1 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Attention Once Is All You Need: Stateful Transformers
3 points
|
4 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
New inference engine faster than vLLM, SGLang, TRT-LLM
2 points
|
3 comments
0 points
|
0 comments