Your number is off by 64x.
It can do 125 petaflops at FP16
https://www.tomshardware.com/tech-industry/artificial-intell...