|
|
|
|
|
by Gettingolderev
683 days ago
|
|
Anyone knows if re-ordering also helps here? If i can sort the lines of the matrix, which is probably defined by how the token embedding is setup, i could potentially zero out weights which do not contribute at all and have areas of zeros i could mark and skip? |
|