Ah this is different. The Nytimes article is about identifying/classifying dolphins from audio. This new model is about communicating with dolphins from generated audio.
The difference between recognizing someone from hearing them, and actually talking to them!
The difference between recognizing someone from hearing them, and actually talking to them!