|
|
|
|
|
by dmitrykan
680 days ago
|
|
I'm working on the tool, that includes AI. My original target is to test it on my https://www.youtube.com/c/VectorPodcast by offering something that Lex Fridman does for his episodes. Current features:
1. Download from YT
2. Transcribe using Vosk (output has time codes included)
3. Speaker diarization using pyannote - this isn't perfect and needs a bit more ironing out. What needs to be done:
4. Store the transcription in a search engine (can include vectors)
5. Implement a webapp If anyone here is interested to join forces, let me know. |
|