Hacker News new | ask | show | jobs
by jorgemf 1078 days ago
As far as I know you can preload the data in the GPU before processing it. What is the difference/advantage of what you are proposing?
1 comments

You wouldn't need the hardware to support arbitrary load/stores. In particular, you could get rid of (some of) the address lines...

I'm unsure if this would be much of a win.

well, the data is way smaller than the model (at least the current trend), and you probably still need random access for the weights of the model. I am not sure if it a gain worth to pursue.