Those TFLOPS numbers are quite useless as they are "marketing peak TFLOPS". There's usually a 10-100× difference between that and actual computational capabilities in meaningful general workloads.
It only makes sense to compare specific, well-calibrated benchmarks, such as Linpack, which is what I did.
edit: you are right, this source is wrong, but we are getting closer fast.
A19 seems to be getting 2.3 tflops (still only 10%, but still a whole floor of computers vs a smartphone is crazy!).