Fantastic demo. Do you know what's the difference between your stack and the livekit demo? [1] it shows your voice as text so you can see when you have to correct it.
Llama3 with ears just dropped (direct voice token input) which should be awesome with cerebras [2]