Hacker News new | ask | show | jobs
by parineum 211 days ago
> LLMs understand text on a human level (even if they have other limitations).

Limitations like understanding...

1 comments

The problem is the data. LLM data is self supervised. Vision data is very sparsly annotated in the real world. Going a step further robotics data is is much sparser. So getting these models to improve on this long tail distribution will take time.