| HN Mirror

tradeoff worth naming: you avoid the autodiff graph overhead (hence the speedup), but any architecture change means rewriting every gradient by hand. fine for a pedagogical project, but that's exactly why autodiff exists.

link

love2read 114 days ago

Can you share a link?

link

WithinReason 113 days ago

https://www.ideone.com/VAz4Nn

Doesn't run inside IDEone due to the external download link, but you can copy&paste the code over

link

freakynit 111 days ago

24x speedup (over 10x already) and similar loss profile (for c++ version, optimized by claude): https://gist.github.com/freakynit/3982eab8413a89941bd0018e63......

link

verma7 108 days ago

This is amazing! Thanks for optimizing the code using Claude!

link