Y
Hacker News
new
|
ask
|
show
|
jobs
by
woodson
54 days ago
It highly depends on the sort of data you’re processing (phone calls, podcasts, meetings of more people recorded using single channel?). For NVIDIA/NeMo, check out their softformer diarization models (also streaming).