| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by yujonglee 320 days ago
	It can be solved with speaker segmentation/embedding models, although it is not perfect. One thing we do with Hyprnote is that we have a Descript-like transcript editor that allows you to easily edit/assign speakers. Once we integrate a speaker diarization model with that, I think we'll be in good shape. If you are interested, you can join our Discord and follow updates. :) https://hyprnote.com/discord

2 comments

mijoharas 320 days ago

Oh awesome, I was reading through to see about whether it had speaker diarization (why I got rid of my whisper script I use).

I'll look forward to the Linux version.

Is there any chance of a headless mode? (I.e. start, and write transcript to stdout with some light speaker diarization markup. e.g. "Speaker1: text")

link

yujonglee 319 days ago

> Is there any chance of a headless mode?

maybe. we might be able to add extension system that each extension can have that info and do whatever it want within the app.

> I'll look forward to the Linux version.

https://github.com/fastrepl/hyprnote/issues/67 We have open issue. You might want to subscribe to it!

link

apwell23 320 days ago

our conference rooms even have some sort of rotating camera contraption that automatically focus on the person speaking

link