Nobody has ever tested all possible inputs to 64 bit multiplication either. You can sample from the space.
https://en.wikipedia.org/wiki/Pentium_FDIV_bug
However, sampling really is the way to go when you face a combinatorial explosion. (If you can't prove it correct, that is.)