Y
Hacker News
new
|
ask
|
show
|
jobs
by
vessenes
244 days ago
This is the spec for a training node. The inference requires 80GB of VRAM, so significantly less compute.
1 comments
andai
243 days ago
The default model is ~0.5B params right?
link