Hacker News new | ask | show | jobs
by calum-bird 1561 days ago
Please note: this does make use of a remote server, so use with caution. Working on locally-hosted version for anyone with ~48GB of VRAM (and change) to spare.
1 comments

Wouldn't that much VRAM just be needed for training the model? Not inference?
Haha I wish! That's just loading the weights for inference :)
I sense a humblebrag
Haha indeed, NLP is fun!