Hacker News new | ask | show | jobs
by fagerhult 1095 days ago
Here is the MusicGen paper from Facebook research: https://arxiv.org/abs/2306.05284

MusicGen is an LLM on top of EnCodec tokens, instead of working directly with audio. EnCodec is neural audio compression algorithm that encodes audio as tokens from a codebook. It's a really clever trick!

1 comments

The samples are outstanding. Even if they are cherry picked (not saying they are but even if) the output seems incredible.

https://ai.honu.io/papers/musicgen/