Hacker News new | ask | show | jobs
by QuadrupleA 635 days ago
How good are TPUs in comparison with state of the art Nvidia datacenter GPUs, or Groq's ASICs? Per watt, per chip, total cost, etc.? Is there any published data?
2 comments

MLPerf is a good place to start. The only problem is you don't have any verifiable information about TPU energy consumption. https://mlcommons.org/benchmarks/inference-datacenter/
I have some company notes from early 2024 which cannot be accurate but could help,

TPU v5e [1]: not available for purchase, only through GCP, storage=5B, LLM-Model=7B, efficiency=393TFLOP.

[1] https://cloud.google.com/tpu/docs/v5e