I tried a few different things and couldn't make it work. I think the problem is that the audio data is getting sent as a text websocket message rather than binary message. I got a whole bunch of "error: stream did not contain valid UTF-8" and "No JSON object could be decoded" errors that I think both came from the remote end.