Hacker News new | ask | show | jobs
by gHA5 478 days ago
The underlying text generation should be made aware that it can make sounds. It told me it can't.

Also for proper emotional dialogue it needs to determine the human input emotions. It seems to work with a transcript of the input.