Hacker News new | ask | show | jobs
by Gettingolderev 683 days ago
Anyone knows if re-ordering also helps here?

If i can sort the lines of the matrix, which is probably defined by how the token embedding is setup, i could potentially zero out weights which do not contribute at all and have areas of zeros i could mark and skip?