|
|
|
|
|
by throw149102
1361 days ago
|
|
To my ear, they all sound about okay for 4 seconds, until my brain recognizes that there's no tension being built or story being told. It's like every track is 4 seconds of music followed by 4 seconds of music followed by 4 seconds of music rather than a track with a real sense of progression. Many have said in this thread already that maybe we ought to expect that a ml approach in the next few months/years could be much better. I'm not so confident that it will happen so soon. Audio might end up being a much harder problem than visuals, for a variety of different reasons. Having the time domain built into the medium requires some concept of memory, and even modern neural nets seem to struggle remembering what they said before the most recent prompt. Once again though, its not impossible. Just requires the right techniques and enough people focused on it. |
|
What AI gives you is a mash up, a mix of people's intent, a mix of people's feelings. What I want is the result of a singular person expressing his singularity though his work, I don't want the "average of the best" music or the "average of the best picture". This is good for content creation, when you need to pump out the maximum amount of "content" for people to "consume" (see marvel, netflix&co), but not for art
Art that leave a mark is always weird/quirky/personal/deep/&c. the fact that a machine can replicate the result removes the most interesting part of the equation, the human part. It's like making your own bread vs buying supermarket bread, the later is cheaper and faster, it might even taste better if you fucked it up, but it's a complete different experience