|
|
|
|
|
by echelon
713 days ago
|
|
The music, film, and game industries are about to be completely disrupted. LLMs and AGI might be hogwash, but processing multimedia is where Gen AI and especially diffusion models shine. Furthermore text-to-{whatever} models might produce slop, but Gen AI "exoskeletons" (spatial domain, temporal domain editors) are Photoshop and Blender from next century. These turbocharge creatives. Hearing and vision are simple operations relative to reasoning. They're naturally occurring physical signals that the animal kingdom has evolved, on several different occasions, to process. This is likely why they're such a low hanging fruit to replicate with Gen AI. |
|