Hacker News new | ask | show | jobs
WINA: Weight informed Neuron activation for accelerating LLM inference (arxiv.org)
2 points by Ratelman 382 days ago