|
|
|
|
|
by pumanoir
1206 days ago
|
|
I think is feasible.
The description even says is designed to save on vram[1].
I don't get the other comments about needing more vram than a 3090. Also, Neuralmagic may run their sparsification on ARM cpu's in the future, so keep an eye. 1. ChatRWKV v2: with "stream" and "split" strategies. 3G VRAM is enough to run RWKV 14B :) |
|