Y
Hacker News
new
|
ask
|
show
|
jobs
by
kaszanka
1197 days ago
Here is the magnet link for posterity: magnet:?xt=urn:btih:ZXXDAUWYLRUXXBHUYEMS6Q5CE5WA3LVA&dn=LLaMA
2 comments
psychphysic
1197 days ago
Thanks not working for me...
Not that I could run it if I downloaded it.
link
q1w2
1197 days ago
Great, now how do I run it? Do I need a GPU with over 65GB RAM?
link
version_five
1197 days ago
Try this, it's for running llms that won't fit in the gpu:
https://github.com/FMInference/FlexGen
link
gpm
1197 days ago
Currently that looks like it only supports facebook's opt and galactica models. Though they do appear to plan to add support for more models.
link
rnosov
1197 days ago
Generally, you'll need multiply model size by two to get required amount of video RAM. There are 4 sizes, so you might get away with even smaller GPU for say 13B model.
link
bioemerl
1197 days ago
Nope, more like 111gb
link
Not that I could run it if I downloaded it.