Hacker News new | ask | show | jobs
by yourMadness 632 days ago
14B with Q4_K_M quantization is about 9 GB.

Remarkable that it is at all comparable to Sonnet 3.5

1 comments

Comparable, I guess. But the result is a lot worse compared to Sonnet for sure. Parts of the example code doesn't make much sense. Meanwhile Sonnet seems to have the latest API of Bevy considered, and mostly makes sense.