Hacker News new | ask | show | jobs
by nelsonenzo 1308 days ago
I wanted to do this exact project - record audio all day and then have AI process it - to identify behavior outburst of my autistic toddler.

It's critical information for early diagnosis and treatment, but it's really hard to capture the data while also dealing with the actual situation. Being able to send the sounds he makes to his therapist could also be usefull when then are trying to get him to mimic sounds and talk.

With that said, is the audio AI open sourced? The part that analyzes the audio stream?

Thanks for the links to the hardware, also a really important part!

3 comments

I would guess that they're using OpenAI's Whisper, which is open source: https://github.com/openai/whisper

It does speech-to-text, then you can use the full force of all the text analysis tools that are out there.

I've thought about this a lot.

My 8 y.o is Autistic and when he was little, I was struggling to catch evidence to provide to Speech and Language Therapy. I wanted a way to always record and have an easy way to pull out the key points.

Now I would love to correlate background noise (level and context) with meltdowns. We know babies crying set him off, as that's obvious, but would love to analyse further to spot other trends.

this one will suit you well, even has a magnetic back so you can attach it to something, https://www.aliexpress.us/item/1005003535825295.html
that's a fantastic use case!, the easiest way (and the one i'm currently using) is by upload the audio manually on :

https://replicate.com/openai/whisper