Hacker News new | ask | show | jobs
by btown 2738 days ago
(FYI your https://deepgram.com/v2/docs links are giving "error": "Not Found" JSON responses.)

I love progress in this space. Something I also think is necessary, though, is innovation in the discoverability interfaces around speech data. Can you search over potential transcriptions weighted by their likelihood, rather than just doing full-text search on the most-likely transcriptions? Can you visualize multiple potential transcriptions inline without overloading someone's visual cortex with information? Can you one-click-to-listen to any specific line? Can you enable people to switch conversations on the fly to an "off-the-record" mode, with such confidence that the default can be that every conversation is highlighted? Can you do all of this from Slack? Can you make setup a one-click process with Twilio OAuth? Can you do all of this from a web app that requires no coding?

All this, I'm sure, is part of an ecosystem that will be built on tools like yours, and that ecosystem fundamentally depends on the quality of the data - so it makes sense for you all to focus there first. But to the extent you want to capture the entire "stack," there's a tremendous space for someone to take the level of "passion" for data quality and apply that same instinct to quality-of-experience.

1 comments

This is a seriously fertile area where you get to "define the new interface".

It's a big problem though, since few buyers know they want those things. Around 95% of customers come into it with "give me the transcripts" and discover over time they want these other things too (some graphical, some technical). They just didn't know it was available.

New GUIs and data representations is a big part of it. Getting accuracy and scale in place is a big part. Building awareness and distribution of what's possible now is another big part.

Re: JSON Error; We fixed that doc link error you saw (it was pointing in the wrong place since we _just_ updated it).

The real docs link is: https://brain.deepgram.com/docs

Tableau (and the general business analytics space) have done a good job at reframing the problem as: "don't think about what you want as a leader at a company; instead, democratize data access so your team can decide what it wants, and pay for democratization not for your own features." See for instance: https://www.forbes.com/sites/briansolomon/2016/05/04/how-tab...

Arguably Elastic is a success story about bridging the worlds of an API-first technical stack with a democratized non-technical analytics framework. And they started by just powering excellent search, and building value-add layers over time. But they built into a then-vacuum of API offerings, whereas there are many other (potentially inferior, but well-funded) speech-to-text APIs. I'll be avidly following you guys as you navigate the space, and hopefully you're able to find some good "hooks" or uniquely-easy-to-roll-out integration stories that strike a balance between focus on technical excellence and driving awareness in a super-linear way.