Y
Hacker News
new
|
ask
|
show
|
jobs
by
wmf
1 day ago
That just sounds like a 3090.
1 comments
cyanydeez
1 day ago
not at the vram sizes that control how much context to load; also, GPUs arn't as effiecient as direct inference.
link
wmf
22 hours ago
OK, B70.
link