|
|
|
|
|
by GaggiX
1075 days ago
|
|
I was going to search on the internet about it, but then I realized you are the author (and there is nothing online I think). I imagine that the activations are left in FP16 and the weights are converted in FP16 during inference, right? Btw very cool |
|