Y
Hacker News
new
|
ask
|
show
|
jobs
by
elij
40 days ago
I'm using the 30b MOE model on same spec with 65k tokens as a sub agent with tooling and it absolutely writes decent code. The dense 9b I agree wasn't great.