Y
Hacker News
new
|
ask
|
show
|
jobs
by
nrrbtrbbrb
141 days ago
> There are LLMs for image generation,
That part isn’t handled by an LLM
> voice generation,
That part isn’t handled by an LLM
> video generation
That part isn’t handled by an LLM
2 comments
famouswaffles
141 days ago
Yes it can be, and often is. Advanced voice mode in chatGPT and the voice mode in Gemini are LLMs. So is the image gen in both chatGPT and Gemini (Nano Banana).
link
notepad0x90
140 days ago
What is it handled by? I'm honestly curious, there are models specifically labeled as for those tasks.
link