Hacker News new | ask | show | jobs
by vatican_banker 1850 days ago
My honest and unscientific opinion is that the difference _is_ discernible but the listener needs to know what to hear for. Also, the reproduction quality is impacted by several factors like room, equipment, and recording quality (not just speaker quality).

[Anecdotal] One example of the difference between MP3 and lossless: the "image" [1] on 256kbps MP3s is worse compared to the the original uncompressed, lossless, versions (but the listening room must be appropriately prepared to reproduce a good image).

This is a highly subjective topic. IMO we'll never reach full agreement. Personally, I listen MP3 while on-the-go and lossless music at home.

Important to keep in mind the "size" of the experiment. Two interesting quotes from the article in c't magazine:

> twelve participants would be asked to come to Hanover.

> It's true that the data we collected does not support watertight conclusions, but they do provide interesting insights.

[1] https://en.wikipedia.org/wiki/Stereo_imaging

3 comments

> Important to keep in mind the "size" of the experiment. Two interesting quotes from the article in c't magazine:

>> twelve participants would be asked to come to Hanover.

It's a mistake to apply vanilla statistical thinking here. The 12 participants were not randomly drawn from the German population, they were extremely skewed towards enthusiasts/professionals: audio engineers, an owner of an actual Nautilus 801, someone who worked on MP3/AAC at Fraunhofer IIS, someone who works preparing masters for Deutsche Gramophon. If these are the people who have enormous difficulty distinguishing 256kbps MP3 from the CD original, I'm certainly not going to worry that I am going to miss out on anything with 256kbps MP3.

If 12 Grand Slam participants tell me they can't tell the difference between a standard $100 and a $1000 high end tennis racket, I'm not going to delude myself into thinking that it's going to make any difference for me.

> It's a mistake to apply vanilla statistical thinking here. The 12 participants [...] were extremely skewed towards enthusiasts/professionals

It is still undetermined if having 12 highly-skilled professionals in the experiment is enough to have a conclusive experiment.

Also, this subject is so difficult to get right that the authors of the article themselves hedged by saying that experiment "does not support watertight conclusions".

This magazine did a test with mp3 https://www.heise.de/ct/artikel/Kreuzverhoertest-287592.html

The only one who was significantly able to tell if something was mp3 encoded or not, was a guy with a hearing damage who loved punk music. In fact, mp3 was developed for persons with normal hearing. So it is well possible that he was able to tell differences where other people were unable to.

Maybe the punk music had more to do with it. Sounds like the guy was keying off of subtleties of sonority and emotive quality which are a lot more fragile to digital processing.

It's quite easy to overprocess a digital audio file and wind up with something that is pristine as far as frequency response, but flat and 'pod people' like as far as emotive cues and intensity. Aliasing and cumulative losses to word length issues have a lot to do with it.

It's VERY easy to make digital stuff accurately represent frequencies like 2 Hz or 35kHz that our ears don't hear. It's a lot harder to make the digital stuff perform in the midrange when our perception can go, inconsistently and irregularly, waaaay beyond what we're used to thinking of as the limits.

I did some personal experiments back in the day when hard disks were expensive and found that the compression artefacts show up first in distorted guitars and cymbals, then brass instruments and everything else survives much lower bit rates. So that could explain why the punk rock fan hears the compression problems first.

By the way, the lossy compression algorithms don't try to produce exact frequency response but to leave out stuff that humans wouldn't hear anyway and compress the rest.

That's the original German version of the article which was translated in my hydrogenaudio link.
A good way to determine the point of transparency of lossy encoding for yourself, is to ABX test on your own equipment, with files you've converted yourself. A good way to do this is with Foobar2000's ABX plugin, which lets you compare back and forth and on whole tracks or short snippets if you want.

In my experience, headphones always yield the best results, and surprisingly it doesn't matter if I use the stock earbuds from my phone or a nice set of AKG over-ear headphones. It's not a matter of absolute sound quality, just the fact that you cut out room interactions and get the sound straight to your ears makes a big difference.

MP3 has some built-in flaws that no encoder can completely cover up, short sharp sounds like castanets really expose the pre-echo, harpsichord shows similar issues. It also has a tendency to make cymbals sound "washy" or "underwater", which all lossy codecs do to some degree, but MP3 is especially bad.

Still, at 192kbps I have to really focus to hear it in normal listening, but it's more or less always there even at 320kbps in problem tracks, if I really focus in on short sections. It just sounds subtly "off". But I hope no one actually listens to music like that, in short repeated sub-1 second sections to narrow in on a specific castanet snap ;-)

As for more modern codecs like Opus and AAC, it's generally completely transparent for me at 128kbps, and that's with a bit of playing it safe, I'm pretty sure I could drop Opus down to 96kbps. Modern codecs are really impressive.

I keep my music library in FLAC, both because I know it's CD quality and because it's an archive. I want to be able to convert the tracks to any new codec that may come along, if I need to.

My library is 280GB currently, and storage is cheap :-)

> I keep my music library in FLAC, both because I know it's CD quality and because it's an archive. I want to be able to convert the tracks to any new codec that may come along, if I need to.

I understand the sentiment. But the reality is, if the re-encoding is not likely going to happen within the next 10 years, your hearing will probably have deteriorated so much that you probably won't hear the difference anymore anyway (assuming you can hear a difference today, which is a big assumption).

I got the start of my music collection from my parents (as .wav's, or rather, I helped rip the cds). I intend to do the same. So it's not just one but several decades we're talking about.
> It also has a tendency to make cymbals sound "washy" or "underwater", which all lossy codecs do to some degree, but MP3 is especially bad.

Thank you for confirming this! I record my analog synths that I play through headphones off an old mixing board, however when it comes through my ADC->iPad, stuff seems to get lost and I spend time adjusting the mix and ADSR for recording. Have been seriously mulling a reel to reel, but many others have had the same idea and the market prices are astronomical.

It's not inherent to straight uncompressed PCM audio, it's strictly an artifact of lossy compression. A reel-to-reel tape deck will be noisier and extremely cumbersome compared to proper digital recording.

Recording should be done at 96kHz 24-bit or higher, to not have to meticulously optimize recording levels and to allow room for mixing and effects, without raising the noise floor to noticable levels.

Convert to normal CD quality as the last step before distribution.

>My library is 280GB currently, and storage is cheap

That’s around ¾ the amount of music I carry on my iPhone :-)

I don't collect music just to collect it, I only keep artists and albums around that I really like, or if it's something special and hard to find. Everything else is on YouTube or whatever for the rare occasion I need to listen to Metallica or AC/DC or something.
Yeah, same for me! I only keep albums that I really like. And from those I only keep that few thousand on my phone that I deem “essential”.
Agree with everything you say, but I would also add that the interactions with compression and other lossy signal processing that is frequently performed is not well studied. For example, when using Bluetooth headphones, it is likely that the music will be equalised/normalised, resampled to 48Khz (for mixing) and then re-encoded to a bluetooth codec e.g. LDAC. It is much safer to start with FLAC, if you cannot avoid such a signal chain.