Y
Hacker News
new
|
ask
|
show
|
jobs
by
heyitsdaad
202 days ago
Anything technically interesting you can share about any of these?
How do you decide on which parts to extract?
What models do you use for the features you listed?
1 comments
kokau
202 days ago
Sure, here’s what I can share: we use Veo 3, and Sora 2 is coming soon. For captions, we rely on FFmpeg combined with our own speech-to-text and dynamic subtitle API.
link