Hacker News new | ask | show | jobs
by Kwpolska 834 days ago
From the post title, I expected some material improvement to how meetings are conducted. Automatically transcribing and summarising the meeting does not seem like any serious improvement or anything novel.

Also, it seems to be focused on what is said in the meeting. But what if the meeting is heavy on the visuals (a PowerPoint presentation, a demo) and the spoken content doesn’t make much sense without seeing the shared content? I imagine the output will not be very useful in that case.

5 comments

Visual understanding is key to anything that isn’t just talking heads in a discussion and what almost all of these tools are missing, I think because it’s a lot harder then using whisper and OpenAI (or equivalent) and a lot more expensive/complex to ground a conversation in the visually shared data, speaking from experience of building a tool that does understanding of what you’re screen sharing. We focus heavily on this for recorded info, pairing it with data on what’s actually happening on screen (think mouse tracking and action ontology) and find it significantly changes the understood summary and transcript of any more white collar type presentation or share like a developer doing a brownbag (we’re not meeting software, more Loom-like record and share)

Circleback folks, cool product, if you want to add visual/screen shared data as a dimension of understanding for you hit me up (diamond@augmend.com). We’re setting up a service for a few meeting/recording understanding products and would love to help you out here too.

Echoing same thoughts here too and was even going to lay some kudos for any tool looking to provide some improvements to the more mundane day to day aspects that we still have to deal with because of a lack of much innovation.

Meetings, emails, and other mundane stuff like documentation generation and such would be really prime things I'd love to see tackled. Sadly, its not sexy enough to expend resources on unless you throw in some AI or other trendy hook.

Email in particular is the bain of my existence. Our tools around it suck (Outlook), provide no real management features (gmail) or seem to be stuck in the early 2000s (Thunderbird).

Particularly I am desperate for more developer friendly ways to script, automate and manage email correspondence and chains because I feel 80% of my day is wasted just reviewing wtf is in my inbox.

In many cases, if you don't have to worry about jotting down notes and action items during a meeting, the meeting can be conducted more efficiently. While not novel, a well-written meeting summary also makes it easier to quickly get an understanding of what was discussed and needs to be done as a result of the meeting, whether you attended it a few hours/days/months ago or didn't attend at all. Being able to automatically act on the outcomes is the really exciting part.

I agree that there's more that can be done for meetings that heavily rely on visuals. Right now, we save what was screen shared (if recordings are enabled) but the visual component isn't used in generating outcomes. Lots of interesting things that can be done with multimodal models here.

Most if not all of my meetings are around discussing what is shared on a screen. This is rarely a PowerPoint and often a variety of different data sources from ever changing presenters. Looking at 3d models in a review and making notes / Todo would be awesome. So far transcripts of such meetings are rather useless.
Meetings are hard because prople are in different states at all times (clients may not know what they want, teams may not know enough about hiw ti solve certain problems, joe may be lazy and hungry one day...). Human things make meetings annoying so I agree with your sentiment that this may not solve our problems.

However. "meetings rely on visuals sometimes instead of just spoken words" doesn't feel like a fundamental issue that breaks this thing. It just sounds like a feature request.