Hacker News new | ask | show | jobs
by zzzzzzzza 1071 days ago
i am suggesting the two strategies might have similar trade offs/benefits though I am not familiar enough with attention mechanisms to say for sure.

it's a comparison/analogy?