Hacker News new | ask | show | jobs
by utopman 4 days ago
you are absolutely right. my title is very bad. I'll update it to a very less absolute statement. sorry for that.

I am now trying to sweet spot things with 27b model + tubo8 so I guess should have plenty quality context left.

the error I made in my tests is to stop with a working configuration that maximised my hardware use, and missing real deep software tests. The one shot 3D app I generated with previous setup is exactly telling this : I did not try my setup on real software development cases.

So thank you for guidance. I am not new using agentic code, but when it comes to proper setup with deep understanding of real trades off on inferences engines, I need more deep undestanding to make better decisions.

The 27b Q6_K turbo 8 for ~150K context should give me a real improvement on this stack. It's test party time :D

Edit : oops, I also found I cannot edit anymore my bold wrong title :/