Hacker News new | ask | show | jobs
by pyinstallwoes 393 days ago
Where does it happen ?
1 comments

My personal theory is that it’s an emergent property of many attention heads working together. If each attention head is a bird, reasoning would be the movement of the flock.