Y
Hacker News
new
|
ask
|
show
|
jobs
by
BatteryMountain
1254 days ago
With this card you can also run Open AI's Whisper with the Large model (the multilingual one!), as it requires 10GB.
2 comments
crabbycarrot
1253 days ago
Highly recommend quantizing the model (
https://pytorch.org/tutorials/recipes/recipes/dynamic_quanti...
). I converted the large model to use int8, and I'm able to run it 5x real-time on CPU with pretty low RAM requirements with still very good quality.
link
Const-me
1253 days ago
My implementation of Whisper uses slightly over 4GB VRAM running their large multilingual model:
https://github.com/Const-me/Whisper
link