| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by heyitsdaad 202 days ago

Anything technically interesting you can share about any of these?

How do you decide on which parts to extract?

What models do you use for the features you listed?

1 comments

kokau 202 days ago

Sure, here’s what I can share: we use Veo 3, and Sora 2 is coming soon. For captions, we rely on FFmpeg combined with our own speech-to-text and dynamic subtitle API.

link