Hacker News new | ask | show | jobs
DeepSpeed-FastGen: High-Throughput Text Generation for LLMs (twitter.com)
3 points by schrodeenger 960 days ago