Hacker News new | ask | show | jobs
by hfmuehleisen 2240 days ago
The crux here is the transfer of data into the very limited RAM of the GPU. If the data is already conveniently placed on the GPU, they are indeed thousands of time faster. But this is mostly a situation that is artificially created for benchmarks. If the transfer is taken into account, usually CPU-based systems win.

Database query processing (SQL) usually requires a very low amount of compute cycles per byte of data (e.g. filter, hash table builds, ...), which means that the GPU cannot really take advantage of its compute power.

For ML this is a very different situation, lots of compute on few bytes, which is why we see GPUs and TPUs seriously being used there.

1 comments

The point that ML requires lots of compute on few bytes makes senses as ML performs mostly repetitive matrix operations, but doesn't a series of SQL queries on database data with a "star schema" also require lots of compute on the same bytes?