Hacker News new | ask | show | jobs
by 37ef_ced3 1346 days ago
Convolutions are fused into convolutions, elementwise operations are fused into convolutions, everything is inlined except where function calls are needed for pthread work units (and those work units are all custom/arbitrary).