MusicGen-Looper: Generate fixed-bpm loops from text prompts | HN Mirror

Y	Hacker News new \| ask \| show \| jobs

	MusicGen-Looper: Generate fixed-bpm loops from text prompts (replicate.com)
	115 points by fagerhult 1094 days ago

13 comments

verst 1093 days ago

It doesn't seem to like slow music. I jokingly gave it a prompt to create a wedding ceremony processional music for the entrance of the bride at ~60bpm (I'm getting married in 2.5 weeks). EDIT: Exact prompt "wedding procession bride entrance"

File "predict.py", line 211, in predict raise ValueError( ValueError: Failed to generate a loop in the requested 60.23 bpm. Please try again.

EDIT: At 52 bpm (exact) it seems to work. What it generated would not sound good if looped however. In terms of style.. it sounded a little music box like - celesta or so (think of the beginning of the Harry Potter soundtrack) with some sustained strings and pizzicato strings. That would be appropriate, except the rhythm and chords are fairly random and I wouldn't exactly call this musical :)

yoz 1093 days ago

Many thanks for this joyous thing. I tried a few different prompts and always got something weird and interesting, but never quite what I wanted:

Firstly, with temperature set to 2:

“Amen break with a bag of spanners” (140 bpm): If the amen break is in there, I can’t tell. There does seem to be a kind of harp/bell thing doing the melody, though.

“John Bonham with kettle drums” (90 bpm): Lots of guitar, subdued drums, but could definitely be late-period Zeppelin. Variation 2 is the exception: Zep at the start and end, long pause in the middle so John can drag his sticks along a LEGO oil tanker.

“John Bonham with kettle drums and angry cat” (90 bpm): We are now inside the oil tanker.

Now setting the temperature to 1:

“Hardfloor in Luton Primark” (90 bpm): The bpm setting was an accidental leftover from the previous experiment, and the result sounds much more Primark than Hardfloor.

“Portishead at cheezy funfair” (110 bpm): It’s a very folk-y funfair. Accordions? Organs? What the hell?

Hours of fun! Again, many thanks!

dylan604 1093 days ago

is it just me that finds it weird that a thing that generates music loops can't loop the results in the provided player?

Edit: am i mis-interpreting the term looper here? it just made an output with a fade out.

KRAKRISMOTT 1094 days ago

How is this possible? I thought deep learning models struggle with approximating periodic functions like sin.

fagerhult 1094 days ago

Here is the MusicGen paper from Facebook research: https://arxiv.org/abs/2306.05284

MusicGen is an LLM on top of EnCodec tokens, instead of working directly with audio. EnCodec is neural audio compression algorithm that encodes audio as tokens from a codebook. It's a really clever trick!

IanCal 1094 days ago

The samples are outstanding. Even if they are cherry picked (not saying they are but even if) the output seems incredible.

https://ai.honu.io/papers/musicgen/

bravura 1094 days ago

bigvgan paper (https://arxiv.org/pdf/2206.04658.pdf)

"We introduce a proper inductive bias of periodicity to the generator by applying a recently proposed periodic activation called Snake function (Liu et al., 2020), defined as fα(x) = x + 1 α sin2 (αx), where α is a trainable parameter that controls the frequency of the periodic component of the signal and larger α gives higher frequency. The use of sin2 (x) ensures monotonicity and renders it amenable to easy optimization. Liu et al. (2020) demonstrates this periodic activation exhibits an improved extrapolation capability for temperature and financial data prediction."

footlose_3815 1094 days ago

"You must sign in to run this model"

In this day and age, curiosity is not worth the risk.

fagerhult 1094 days ago

Oops sorry about that, fixed now!

Minor49er 1093 days ago

Prompting with "harsh noise wall" resulted in some cool industrial breakbeats instead of raw noise. Looks like AI will not be taking Merzbow's job any time soon

onion2k 1094 days ago

I think it broke for me because I selected 140.06bpm. Adding 'step=1' to the range slider would help if the floating part was the cause.

JodieBenitez 1094 days ago

That thing refused to make me a 160 bpm chicago acid house loop... what a severe lack of imagination :-P

dylan604 1094 days ago

> what a severe lack of imagination

I think you just described chicago acid house :-P

JodieBenitez 1093 days ago

Ahaha... granted. Upvoting this even though I still groove to old Phuture tracks.

dylan604 1093 days ago

I could see where it would be much more simple for an "AI" to generate a techno sound compared to something more melodic. I tried getting it to make things, and just had no luck with anything that sounded close to what was being requested. Some of it had a beat, but I don't think I could dance to it (can't tell since it's not actually loopable), so I gave it a 2

tessierashpool 1093 days ago

fuck you

dang 1093 days ago

Please don't respond to a bad comment by breaking the site guidelines yourself. That only makes things worse.

https://news.ycombinator.com/newsguidelines.html

jnurmine 1094 days ago

I got some unexpected melodic mellow Indian-sounding sitar-music with the input prompt "suomisaundi". Not at all what I expected, but quite nice nevertheless!

It worked better with "suomisaundi psychedelic trance spugedelic".

This is a neat idea in many ways.

itairall 1094 days ago

I would love to see something like track based MusicGen for single instruments/channels that can be mixed.

Just seems like a fundamentally different problem than a photograph or painting.

DropInIn 1093 days ago

Yep... Everything I tried didn't work.... Only altered the BPM and text prompt...

Perhaps it's getting hugged to death...

marban 1094 days ago

Long way to go before this is going to replace Splice et al.

nerdbert 1094 days ago

Required me to make an account on github? No thanks.