Y
Hacker News
new
|
ask
|
show
|
jobs
by
gdiamos
217 days ago
Thanks I’ll check it out!
1 comments
gdiamos
217 days ago
Did I miss something?
https://github.com/NVlabs/Fast-dLLM/blob/main/llada/chat.py
That’s inference code, but where is the high perf web server?
link
That’s inference code, but where is the high perf web server?