sound and video generation are very very primitive compared to image gen. definitely something we want to beef up though!