Hacker News new | ask | show | jobs
by treprinum 856 days ago
> Show me a 30b+ parameter model doing RAG as part of a conversation with voice responses in less than a second, running on Nvidia

I built one, should be live soon ;-)

1 comments

Exciting! Looking forward to seeing it.