Hacker News new | ask | show | jobs
Benchmarking Triton (TensorRT) Inference Server for Transformer Models (blog.einstein.ai)
3 points by julien_c 2256 days ago