Hacker News new | ask | show | jobs
by nuand 3858 days ago
Seems like an interesting approach. However the 21x speedup seems a little underwhelming considering the speed of the Zynq's programmble logic fabric and how parallizable NNs are. They're quoting a 21x speedup over the ARM processor on the Zynq 7020, which is on par with what powers the RaspberryPi 2. My guess is they didn't pipeline their design enough or appropriately causing one of the datapaths to significantly limit their throughput.