Y
Hacker News
new
|
ask
|
show
|
jobs
by
markdeloura
763 days ago
How much less energy does a query against an 8B model take, vs a 70B model? Can we get more clever about using smaller, more specialized models?