|
|
|
|
|
by 7ewis
1246 days ago
|
|
Hey Alex, this is really cool. I've used ChatGPT quite a bit and Stable Diffusion in the past but still feel like I've only scratched the surface of what's possible. Great to see lots of projects popping up using the tech in new innovative ways! Please could you give an overview of how this actually works? Have some ideas of where the tech could be useful but not sure how I'd actually go about implementing it. Do you have a GPT model on a server and code to transcribe the video then summarise the transcription. Or do you use one of the APIs from OpenAI? If you use their APIs: * How costly has it been to run your service? (If you don't mind answering) * Is it customisable? If you wanted to run a chat bot for example, would you be able to make it understand the request (I'd assume something similar to an 'intent' when developing Alexa skills) and give it data so it knows the answer? |
|
> Please could you give an overview of how this actually works?
1. I download Youtube subtitles (it doesn't work for videos without youtube subtitles yet. my analytics shows that 15% of videos don't have subtitles. I tried to use OpenAI Whisper, but it takes several minutes to transcribe a video, so I put that task off for now)
2. Then I break the transcript into parts.
3. Then I summarize each part with GPT → and then I summarize the summaries to get chapter names → and then I summarize again to get the title.
Yes, I use OpenAI GPT API. I pay them their standard pricing for davinci-003 and the cost for 1 video is between $0.1 and $0.9 depending on the video length (actually, the transcript length). I have a hard limit to prevent abuse.
Yep, it's fully customizable. Yes, you can provide data to it. It would take 1 hour of coding to make a prototype of a chat bot. And then 500 hours to make it work well.