Hacker News new | ask | show | jobs
by andy_ppp 272 days ago
In reality it’s probably not a RELU modern LLMs use GeLU or something more advanced.