Hacker News new | ask | show | jobs
by superkuh 25 days ago
Like Scott Manley says, going from a frequency domain image representation to a time domain sound file is something that is extremely old and does not and has not required AI the last 50 years. It's just that they vibe coded the extremely old, extremely normal algorithmic solution. AI did not recreate the dead pilots voice, it just made data preparation and coding a bit less work.

It's almost certain you've used software or seen/heard software output today that transformed between frequency domain and time domain. It's ubiquitous.

4 comments

FFTs are found in every nook and cranny of modern communications and computing.
It also works with time domain video files like audio visualizers: https://www.youtube.com/watch?v=E3gf88rSzqo

Nothing extremely surprising though.

It says in the article that the creator used OpenAI Codex, presumably because the spectrogram image wouldn’t have enough resolution by itself.
you are correct - I coded this in the late 1980s with digital sound domain experts