Hacker News new | ask | show | jobs
by Grimm1 2076 days ago
Good work getting TensorRT running we had a real pain in the butt recently when working with it and just opted to go with ONNXRuntime, their graph optimizer and their TensorRT backend -- may not be as fast as straight TensorRT from comparisons I've seen but it got us to a competitive inference and latency so we're happy with it.
1 comments

Nice one! I've long been interested in the ONNX serving path.