Hacker News new | ask | show | jobs
DeepSpeed-FastGen: High-Throughput Text Generation for LLMs (github.com)
3 points by schrodeenger 957 days ago