| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by segalord 888 days ago
	How are you planning to train a model based on annotated actions done on a page? Wouldn’t it be more practical to have a model like PIX2ACT that understands the action to be performed and does it? Or are you planning to compose future actions from decomposing flows?

1 comments

I'm shocked that PIX2ACT isn't more common