Right now we're using GPT-J-6B for our conversational model. We had to remove the iframe chat feature from our website, because we went kind of viral on Twitter Germany and the chat continued to crash. You can still try it out on:
https://alb.teddychat.click/
For the conversational stack this is our flow:
STT -> GPT -> TTS (on Tacotron2)