Hacker News new | ask | show | jobs
by transitivebs 1268 days ago
The biggest question I have after building something similar is: what's the best way to break up transcripts into segments? You want the segments to be long enough to extract useful semantic info, but you don't want them to be too long either.
1 comments

60 sec segments with 30 sec overlap window seems to work quite well for me but YMMV