| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by CrypticShift 1293 days ago

> predict the next millisecond of audio based on previous milliseconds of audio

Not milliseconds, but AudioLM [1] already does it with just seconds, for speech (and piano). Results are already very convincing (to me).

[1] https://google-research.github.io/seanet/audiolm/examples/

1 comments

nilozd 1292 days ago

yes but I think she's talking about something more like real-time, generating new output as you go through with the input (maybe like slicing windows from a stats. perspective)

link