Hacker News new | ask | show | jobs
by lkozloff 942 days ago
Interesting - this was actually a feature that got me to convert. I have some slight annoyances with how it handles natural pauses as I collect my thoughts and speak, but overall it's been great.

I've had some interesting discussions and it's helped me structure some thoughts by asking questions and follow-ups, then summarizing our conversation... all while I'm on a walk.

One fun thing I did with my family was have it do an interactive adventure story starring us. We had an adventure, and then used the built in DALL-E to generate images of scenes from our adventure.

5 comments

While walking the dog today, it talked me through some trade-offs between DBSCAN and isolation forests. Walking + verbalizing the problem is a very different and positive experience for me.

I've also used it several times on ~15-20min drives to memorize something I wanted to have available for immediate recall. I had it chunk & quiz me, and by the end of the drive I had it down pat. Fun use of drive time.

A word of caution - I've asked ChatGPT 3.5 to generate quizzes based on books before and while most answers were right, a few were technically wrong, and some were outright fabrications (presented very confidently!)
LLMs fabricating things? Well I never!
The memorization use case is brilliant. Put your talk track for a presentation in and say “help me memorize this by quizzing me”. Thanks!
For a second there, I genuinely thought you had a very smart dog!
> I had it chunk & quiz me

Can you go a bit into how this works? How do you prompt it? This is the first use of ChatGPT I've heard of that would directly benefit me.

On a computer, I'd paste in a corpus of some sort as a regular ChatGPT message - just because it's easier to accumulate a big string. Note with GPT4 Turbo and the recent UI upgrades, the context window is so large now that you can paste a sizable body of knowledge, possibly even as an attached file.

I'd then switch to the phone and retrieve the chat from History.

Here's an example prompt I just used to help my son prepare for a DMV written test:

``` I'm going to paste a large list of questions and answers and then switch to voice mode. Once I indicate that I'm ready, begin quizzing me on these questions. Feel free to rephrase slightly. My goal is to achieve complete retention of all of these questions through quizzing and spaced repetition. The questions are California DMV questions. I am preparing to take the written test. 1. *Q:* You may drive off of the paved roadway to pass another vehicle. *A:* Under no circumstances.

2. *Q:* You are approaching a railroad crossing with no warning devices and are unable to see 400 feet down the tracks in one direction. The speed limit is... *A:* 15 mph. ... ```

> I have some slight annoyances with how it handles natural pauses as I collect my thoughts and speak

I hate this about all voice assistants. They force you to speak in an unnatural way.

It seems like there’s nothing intelligent about when they decide to respond, it’s like they just wait for X milliseconds of silence instead of using the context of what’s being said like a human would.

Sometimes it’s the opposite problem. You finish asking your question but you gotta wait for the assistant to pick it up whereas a human would understand that you’re finished talking based on what you said.

It might seem small and unimportant but I really think it’s one of the main reasons why voice assistants feel so… artificial.

That and long ass responses.

>They force you to speak in an unnatural way.

I think the same thing can happen when speaking to anyone with different societal/cultural factors than yours. We just become more accustomed to it over time and it's less noticed. I think if this GPT had a big green alien face then we would find speaking to it less strange, somehow.

Yeah, but I don't want to be code-switching with a computer.
One of the very few things I like about Alexa is that it has an option for being more forgiving about stammers and pauses. It actually works pretty well.

Maybe the same will make it into ChatGPT at some point.

> I hate this about all voice assistants. They force you to speak in an unnatural way.

It would be great to have an option to have ChatGPT stop when saying "over" - it does not feel natural to have to blurt the questions all at once.

> That and long ass responses.

you can ask it to be terse and shorten answers in the custom instructions section.

Yeah, I wish there was an option to verbally cue that you were finished talking... like an "over and out" thing. Do you know about the feature where you can press the circle to force it to listen and then release for it to answer?
It's interesting how in my frustration I intuitively tried that push-to-talk sorta-feature and it worked. Integrating something like Whisper and streaming live text could be neat, especially considering how different cultures handle conversational pauses and turn-taking. Wondering why there's no move towards full-duplex conversations in such tools.
Because it’s hard. I’m not aware of any full duplex speech assistant existing yet.
I wondered if this was possible, so I tried to build it, and I ended up making an app: Call Annie (https://apps.apple.com/us/app/call-annie/id6447928709). Feel free to try it out and let me know what you think
Tell the AI to respond with "..." until you end your prompt with a certain phrase.
I use it a lot while driving, and that's hard to do in that scenario, but it's cool otherwise.
You could probably instruct it to behave that way.
Oh, that's sounds like it could be a lot of fun. Thanks for sharing. I'll remember that for when my little one is a bit older :)
Is there a keyword to talk to it, an Alexa equivalent?