Hacker News new | ask | show | jobs
by alzaeem 2162 days ago
if using deep learning models, consider using distilled and/or quantized models to reduce the resources required for inference