|
|
|
|
|
by benjiro29
4 hours ago
|
|
GLM 5.2 Max = Opus 4.8 Max in thinking behavior. The thinking chain is so similar, and so is the amount of token usage on the output. If you want reasonable token usage, you need to run it GLM 5.2 at High. There is little drop in quality from Max to High (for most tasks). And it cuts token usage by 2 a 2.5x. GLM 5.2, Max is really something you only need for complex tasks. In essence, GLM 5.2 is Opus 4.8 its little brother, at a way, WAY cheaper price. There has been really no training on Opus models going on, really, none i tell you! /sarcasm |
|