- dedicated hardware (https://cloud.google.com/tpu)
- optimized models (https://research.google/blog/turboquant-redefining-ai-effici...)