They do say that the audio is processed locally, but that does not preclude them from making an API call to find a signature match.
> The audio stream is processed locally on your device and never recorded.