Hacker News new | ask | show | jobs
by bertr4nd 1347 days ago
By “fully fused” do you mean no function call boundaries? (“Fused” is such an overloaded term)
1 comments

Convolutions are fused into convolutions, elementwise operations are fused into convolutions, everything is inlined except where function calls are needed for pthread work units (and those work units are all custom/arbitrary).