Hacker News new | ask | show | jobs
by gnulinux 936 days ago
MIDI is absolutely horrible for ML. It lacks very necessary information such as articulation etc which are important to make sense of music. It's popular because it's simple but there is no way to understand music by just looking at MIDI.

I'm a hobbyist in this space (am a composer myself as well a software engineer) and currently all tools are very poor. MusicXML is better than MIDI. MEI [1] is better than MusicXML etc.

The problem is there is miniscule amount of effort and money spent into this field because music overall makes peanuts. It really doesn't justify training expensive ML algorithms which is unfortunate.

[1] https://music-encoding.org/about/

2 comments

> MIDI is absolutely horrible for ML. It lacks very necessary information such as articulation etc which are important to make sense of music.

This depends enormously on the instrument. Consider someone playing a piece live on a keyboard: we can keep a MIDI recording of that and we've captured everything about their performance that the audience hears.

> MIDI is absolutely horrible for ML.

It depends what you're trying to do. If you're trying to generate sheet music that's pretty to look at and easily understandable to a performer, then yes obviously it's not enough. If you want notes that will actually sound good when played back, it's hard to beat it.

> If you want notes that will actually sound good when played back, it's hard to beat it.

I strongly disagree with this. There is no good algorithmic music generator trained on MIDI. They all generate elevator music.

Are you aware of the system I linked above? D.D. Johnson has a blogpost https://www.danieldjohnson.com/2015/08/03/composing-music-wi... with plenty of examples of what an instance of his model can generate. It may not be all that "good" in an absolute sense, but it's at least musically interesting, the opposite of elevator music. (There's also a proprietary model/AI called AIVA about which very little is known, but it does seem to be bona-fide AI output - albeit released in versions that have been orchestrated by humans - based on what it sounds like.)
Yes I'm familiar and...

> Here's a taste of things to come

it sounds like randomly generated MIDI... Doesn't sound like anything to me at all.

Music is very subjective but I've so far seen no model that's convincing. If you like it that's cool I suppose. I personally use algorithmic composing plenty in my own compositions (I write music for piano) and these kind of models don't do it for me. They're definitely tools, you can use them like ChatGPT to get a sense of things but we're decades away from producing "music" this way imho.

Well yes, this model like others is quite far from giving you a finished piece. But if it's giving you "a sense" of possibly useful ideas, that's enough to make it more than "random". (Besides, I'm not sure that we would even want AI to produce music on its own with zero human input - what would be the point of that? So "just noodling around, giving you some ideas to get started with" is quite good as far as it goes.)
My point is that -- in my experience and musical taste -- deterministic algorithms (e.g. literal scripts you write yourself to generate MusicXML, MIDI, lilypond, PDF etc) are orders of magnitude more useful than these neural network ML models that give you monolithic chunks of music. You can still use NN models in your scripts (e.g. have a model to determine chord distance, tonality etc) but there is no universal musical model that has come remotely close to convincing me so far. Of course, when it comes to Western classical music, "counterpoint" is probably the closest you can get to a universal musical model that you'll attempt to find in all pieces of music, but even that comes nowhere near even remotely close to explaining great majority of musical statements (even in something as contrapuntal as Bach). Especially when we come to 20th and 21st century music when people actively react to this model.