Hacker News new | ask | show | jobs
by sheepscreek 714 days ago
Very loosely, isn’t this what is happening inside most LLMs that have a “multi-head” mechanism?