|
|
|
|
|
by TobTobXX
66 days ago
|
|
> Muse Spark is a natively multimodal reasoning model with support for [...] visual chain of thought [...]. Do they mean "the chain of thought is visible to the user" (ie. not hidden like ChatGPT), or "the medium of the chain of thought is not text, but visuals" (ie. thinking in images). I'd guess the former, since it wouldn't be economical to generate transient images, just for thinking. But I'm not sure why they'd highight that in that case. If it were the second thing, that'd be extremely interesting. The first model not to think in text. |
|