|
|
|
|
|
by johnsmith1840
397 days ago
|
|
I seem to be alone in this but the only methods truly good at coding are slow heavy test time compute models. o1-pro and o1-preview are the only models I've ever used that can reliably update and work with 1000 LOC without error. I don't let o3 write any code unless it's very small. Any "cheap" model will hallucinate or fail massively when pushed. One good tip I've done lately. Remove all comments in your code before passing or using LLMs, don't let LLM generated comments persist under any circumstance. |
|
I wouldn't be shocked if huge, expensive-to-run models performed better and if all the "optimized" versions were actually labs trying to ram cheaper bullshit down everyone's throat. Basically chinesium for LLMs; you can afford them but it's not worth it. I remember someone saying o1 was, what, 200B dense? I might be misremembering.