|
|
|
|
|
by tomohelix
951 days ago
|
|
If so, they made a big bet. Vision LLMs were literally made this year. Before that, parsing images to get a coherent response is pretty resource intensive and not really reliable at all. Designing the entire device around image capturing for context seems like a very risky approach so I doubt that was their main reason. |
|