| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by froobius 349 days ago
	Transformers can easily be trained / designed to handle grids, it's just that off the shelf standard LLMs haven't been particularly, (although they would have seen some)

1 comments

nine_k 349 days ago

Are there some well-known examples of success in it?

link

thethimble 349 days ago

Vision transformers effectively encode a grid of pixel patches. It’s ultimately a matter of ensuring the position encoding incorporates both X and Y and position.

For LLMs we only have one axis of position and - more importantly - the vast majority of training data only is oriented in this way.

link