That's not really true. 2kHz is a pretty big range to build a filter in (and with digital you can trivially sample at 192khz FFT and delete the high frequency data)
And the studio can master the recording from a high sample rate recording using an enormous FIR filter, because latency and computing power are mostly irrelevant.
High sample rate equipment just allows you to listen to studio intermediate data without having to have it downsampled first.
But there are problems. For example, if there is any non-linearity in the equipment this can easily cause inaudible tones in the ultrasonic range to intermodulate with each other to produce tones that are audible.
And the studio can master the recording from a high sample rate recording using an enormous FIR filter, because latency and computing power are mostly irrelevant.
High sample rate equipment just allows you to listen to studio intermediate data without having to have it downsampled first.
But there are problems. For example, if there is any non-linearity in the equipment this can easily cause inaudible tones in the ultrasonic range to intermodulate with each other to produce tones that are audible.
Nothing is simple.