|
|
|
|
|
by firewolf34
762 days ago
|
|
That's the best interaction I've seen - so many are just GPT piped into a TTS, but this seems to actually be identifying different speakers in the crowd? There's a point where a guy butts in to ask a question and the robot basically says "wait a sec, I'm talking to this person, are you done? Okay, now I'll answer your question, raise your hand." That's another layer of interaction that's impressive and the latency and speech recognition quality is near-realtime. Very cool |
|