How to turn on advanced voice mode in ChatGPT – a guide to new AI feature

Claudio Ctin3 hours ago3 hours ago14 mins

Among the many new updates from OpenAI, the tech company behind ChatGPT has announced the rollout of its new advanced voice mode feature, allowing users to have natural conversations with its chatbot.

The company said it’s not yet available in countries in the European Union including Iceland, Liechtenstein, Norway, Switzerland or the United Kingdom.

OpenAI’s co-founder and CEO, Sam Altman, wrote in a post on X: “Hope you think it was worth the wait.”

advanced voice mode rollout starts today! (will be completed over the course of the week)

hope you think it was worth the wait https://t.co/rEWZzNFERQ

— Sam Altman (@sama) September 24, 2024

Here’s what you need to know about it and how to turn advanced voice mode on in ChatGPT.

What is advanced voice mode on ChatGPT?

Voice conversations allow users to have a spoken conversation with ChatGPT, which means you can have more natural interactions and conversations. When you can ask questions or have discussions through voice input, ChatGPT will give a spoken response.

There are currently two types of voice conversations – standard and advanced.

Advanced Voice is rolling out to all Plus and Team users in the ChatGPT app over the course of the week.

While you’ve been patiently waiting, we’ve added Custom Instructions, Memory, five new voices, and improved accents.

It can also say “Sorry I’m late” in over 50 languages. pic.twitter.com/APOqqhXtDg

— OpenAI (@OpenAI) September 24, 2024

ReadWrite reported on OpenAI launching its new standard voice mode last month. Standard voice uses several large language models (LLMs) to generate its response, including transcribing what you say into text before sending it to OpenAI’s models for response. While standard voice is not generally multimodal like advanced voice, standard voice conversations also use GPT-4o alongside GPT-4o mini. Each prompt in standard voice counts towards your message limits.

Where the advanced mode differs is that it uses GPT-4o’s native audio capabilities and features. As a result, OpenAI hopes to produce more natural, real-time conversations that pick up on non-verbal cues, such as the speed the user is talking and can respond with emotion.

However, usage of advanced voice by Plus and Team users is limited daily.

How to activate voice mode in ChatGPT?

In July, OpenAI introduced an audio-only Advanced Voice Mode to a small group of ChatGPT Plus users, with plans to expand it to all subscribers this fall.

While screen and video sharing were part of the initial demo, they are currently not available in this alpha release, and OpenAI has not provided a timeline for their inclusion.

Plus subscribers will receive an email notification when the feature is available to them. Once activated, users can toggle between Standard and Advanced Voice Modes at the top of the app when using ChatGPT’s voice feature.

To start a voice conversation, tap the Voice icon in the bottom-right corner of your screen.

If you’re using advanced voice, you’ll see a blue orb in the center of the screen when the conversation begins. For standard voice, the orb will be black instead.

OpenAI’s new advanced voice mode for ChatGPT shows a blue orb. Credit: OpenAI

During the conversation, you can mute or unmute yourself by tapping the microphone icon at the bottom left. And when you’re ready to end the chat, just hit the exit icon in the bottom right.

If it’s your first time starting a voice chat, or the first time using advanced voice, you’ll be asked to pick a voice. Just a heads-up, the volume in the selector might be a bit different from what you hear in the conversation.

The advanced voice mode feature is being rolled out to some Plus users. Credit: Suswati Basu for ReadWrite

You can always change your voice in the settings later, and advanced voice users can even adjust their voice directly from the conversation screen using the customization menu in the top-right.

Make sure you’ve given the ChatGPT app permission to use your microphone so everything works smoothly.

And if this feature isn’t available for you yet, you’ll see a headphones icon instead of the mute/unmute buttons. With both versions, you can interrupt the conversation, steering it in a way that feels more appropriate for you.

Is ChatGPT voice available?

If you’re signed in to ChatGPT through the iOS, macOS, or Android apps, you already have access to the standard voice feature. However, advanced voice is currently only available to Plus and Team users.

There’s a daily limit for using advanced voice, which might change over time, but you’ll get a heads-up when you’re close to the limit—starting with a 15-minute warning. Once you hit the limit, your conversation will switch to standard voice automatically.

The advanced voice doesn’t support things like images yet, so users can only continue an advanced voice conversation with text or standard voice, not vice versa. Conversations started in standard voice can always be resumed using standard voice or text, but not advanced voice. Advanced voice isn’t available with GPTs either—you’ll have to switch to standard voice for that.

OpenAI hasn’t introduced certain accessible features either. Consequently, subtitles aren’t available during voice conversations, but the transcription will appear in your text chat afterward. Also, you can only have one voice chat at a time.

Advanced voice can create and access memories as well as custom instructions, just like standard voice, which also has these features.

Is ChatGPT voice chat safe?

In August, OpenAI revealed there were some security flaws with ChatGPT’s voice mode, but reassured that they were on top of it. OpenAI published a report on GPT-4o’s safety features, addressing known issues that occur when using the model.

The “safety challenges” with ChatGPT’s voice mode include typical concerns like generating inappropriate responses, such as erotic or violent content and making biased assumptions. OpenAI has trained the model to block such outputs, but the report highlights that nonverbal sounds, like erotic moans, violent screams, and gunshots, aren’t fully filtered. This means prompts involving these sensitive sounds might still trigger responses.

Another challenge is communicating with the model vocally. Testers found that GPT-4o could be tricked into copying someone’s voice or accidentally sounding like the user. To prevent this, OpenAI only allows pre-approved voices – not including a Scarlett Johansson-like voice, which the company has already removed. In addition, while GPT-4o can recognize other voices, it has been trained to reject such requests for privacy reasons, unless it’s identifying a famous quote.

Red-teamers also flagged that GPT-4o could be manipulated to speak persuasively, which poses a bigger risk when spreading misinformation or conspiracy theories, given the impact of spoken words. The model has been trained to refuse requests for copyrighted content and has extra filters to block music. And as a fun fact, it’s programmed not to sing at all. However, in this example from a user on X, the voice helps them to tune his guitar by humming the note.

Advanced Voice in ChatGPT tunes my guitar. pic.twitter.com/1H6mYZTCq7

— Pietro Schirano (@skirano) September 24, 2024

How can I stop sharing audio?

You can stop sharing your audio anytime by going to the data controls page in your ChatGPT settings. Just toggle off the “Improve voice for everyone” setting.

If you don’t see “Improve voice for everyone” in your Data Controls settings, that means you haven’t shared your audio with OpenAI, and it’s not being used to train models.

If you choose to stop sharing, audio from future voice chats won’t be used for model training. However, audio clips that were previously disassociated from your account may still be used to train OpenAI’s models.

OpenAI also mentioned that even if you stop sharing audio, it “may still use transcriptions of those chats to train our model” if the “Improve the model for everyone” setting is still on. To fully opt-out, disable “Improve the model for everyone.”

Audio clips from your advanced voice chats will be stored as long as the chat remains in your chat history. If you delete the chat, the audio clips will also be deleted within 30 days, unless they’re needed for security or legal reasons. If you’ve shared your audio clips with OpenAI to help train models, those clips may still be used, but only after they’ve been disassociated from your account.

Featured image: Ideogram / Canva

The post How to turn on advanced voice mode in ChatGPT – a guide to new AI feature appeared first on ReadWrite.

Please follow and like us:

Stiri similare

Amazon’s next entry-level Kindle with a brighter screen leaks

Claudio Ctin52 mins ago51 mins ago

Amazon’s new 12th-gen Kindle will come in at least two colors: black and matcha green. | Image: MediaMarkt With stock of its current generation e-readers dwindling, Amazon is expected to announce new Kindles soon, perhaps as early as next week. A Spanish retailer has a listing for a new 2024 version of the entry-level Kindle…

Strava makes it easier to keep your activity data private

Claudio Ctin1 hour ago51 mins ago

Workout tracker app Strava has a history of being used to stalk people, identifying where they live or their typical running paths (take a look at this Reddit thread of people commiserating, for instance). While the platform has some safety features, a new tool should make it easier to confirm your privacy settings immediately following…

Raycast is bringing its super-powerful Mac launcher to iOS and Windows

Claudio Ctin1 hour ago50 mins ago

Raycast immediately becomes your computer’s most useful text box. | Image: Raycast Raycast has become one of the best power-user Mac tools over the last few years. What started as a launcher — sort of a faster and better version of Apple’s own Spotlight tool — has become a way to interact with apps, manage…

California’s ‘click to cancel’ subscription bill is signed into law

Claudio Ctin2 hours ago50 mins ago

Governor Gavin Newsom has signed California’s “click to cancel” Assembly Bill 286 into law to make it easier for consumers to opt out of subscriptions. The bill, introduced in April 2024, forces companies that permit online or in-app sign-ups to allow for online or in-app unsubscribing as well. “AB 2863 is the most comprehensive ‘Click…

Apple issues urgent iOS 18 update for iPhone 16 after users complain about screen issues

Claudio Ctin2 hours ago2 hours ago

Apple issued a new update for its iPhone 16 range only days after its release, as some users complained about display issues. The iPhone 16 lineup was launched on Friday (Sept. 20), however, several users encountered problems with the touchscreen functionality. The iPhone 16, iPhone 16 Plus, iPhone 16 Pro, and iPhone 16 Pro Max…

The Legend of Zelda: Echoes of Wisdom is as familiar as it is fresh

Claudio Ctin2 hours ago2 hours ago

I grew up on two of the most classic games in the Legend of Zelda series: A Link to the Past and Link’s Awakening. And while there have been a handful of Zelda games with the classic overhead view, those have been mostly relegated to systems like the Game Boy Advance and the 3DS. Mainline…

‘The Legend of Zelda: Echoes of Wisdom’ review: Princess Zelda shines in her protagonist debut

Claudio Ctin2 hours ago50 mins ago

In 1986, it was weird that The Legend of Zelda was about a guy named Link. In 1998, Nintendo reinvented the series in 3D, but didn’t mess with the franchise’s fundamental contradiction. In 2017, Zelda was reinvented yet again with Breath of the Wild (and further expanded in 2023 with Tears of the Kingdom), but…

‘Apartment 7A’ clip shows evil lurks beyond the Bramford

Claudio Ctin2 hours ago50 mins ago

An “Apartment 7A” clip features Julia Garner pushed to her breaking point in “Rosemary’s Baby” prequel. Please follow and like us:

Why did Caroline Ellison do it?

Claudio Ctin2 hours ago2 hours ago

Photo Illustration by Cath Virginia / The Verge | Photo by Bloomberg, Agustina Torres, Getty Images The story of Sam Bankman-Fried was obvious enough: a Shakespearean level of arrogance that led to tragedy. But I have been puzzled for some time by Caroline Ellison, the former CEO of Alameda Research and star witness of the…

Zelda: Echoes of Wisdom doesn’t test your intelligence enough

Claudio Ctin2 hours ago2 hours ago

Image: Nintendo Zelda’s first solo adventure plays to her strengths, but even in her own game, she feels underutilized. Continue reading… Please follow and like us:

Crypto advocacy group changes Kamala Harris’ rating from ‘B’ to ‘N/A’

Claudio Ctin3 hours ago2 hours ago

After previously rating Vice President Kamala Harris as being ‘somewhat supportive,’ the Stand With Crypto group has now changed her rating to ‘N/A.’ Stand With Crypto is a non-profit organization aiming to unite crypto advocates. With policy being a key issue within the industry, the group is tracking the election and sharing statements made by…

Google Maps and Earth updated with greater imagery and expanded into new areas

Claudio Ctin3 hours ago3 hours ago

Google Earth and Maps are getting an update as the technology giant launches historical imagery and an expansion of Street View into almost 80 countries, with countries like Australia, Argentina, Brazil, Costa Rica, and more set to benefit from the changes. “You’ll be able to explore the picturesque countryside of Bosnia and its medieval villages,…

‘Daily Show’ has a blunt response to Trump’s attempt to win over women voters

Claudio Ctin3 hours ago2 hours ago

“Daily Show” host Desi Lydic has responded to Trump’s attempts to win back women voters during a recent rally in Pennsylvania. Please follow and like us:

Duolingo now offers a portable piano for its music course

Claudio Ctin3 hours ago3 hours ago

So would you say this is…Loogie green? | Image: Duolingo Duolingo, best known for its language-learning app, has teamed up with the instrument brand Loog to offer a beginner-friendly portable piano. The $249 Loog x Duolingo Piano is meant to complement Duolingo’s similarly gamified music course launched last year. The keyboard is essentially a co-branded…

‘Heartstopper’s Joe Locke and Kit Connor talk teenage vulnerability

Claudio Ctin3 hours ago2 hours ago

“Heartstopper” Season 3: Joe Locke and Kit Connor discuss teenage vulnerability, eating disorders, and the art of saying “hi!” Please follow and like us:

The Morning After: Get ready for Meta Connect

Claudio Ctin3 hours ago3 hours ago

Meta’s annual VR / AR shindig kicks off a few hours after this newsletter hits your inboxes. As usual, it’ll lay down the direction of travel for the next year of strapping stuff to your face. So, before the awkward stage banter begins, it’s worth reading up on what’s to come. We’ve prepared our usual…

X just released its first full transparency report since Elon Musk took over

Claudio Ctin3 hours ago3 hours ago

X has published its most detailed accounting of its content moderation practices since Elon Musk’s takeover of the company. The report, X’s first in more than a year, provides new insight into how X is enforcing its rules as it struggles to hang on to advertisers who have raised concerns about toxicity on the platform….

X just released its first transparency report in years. Here’s what they aren’t saying.

Claudio Ctin3 hours ago2 hours ago

For the first time since CEO Elon Musk’s takeover of X (formerly Twitter), the social media platform is taking the public behind the scenes of its increasingly opaque reporting and moderation practices. Sort of. Released today, the 15-page Global Transparency Report is the first public report on internal enforcement data beyond Dec. 2021 (Musk took…

watchOS 11 puts a Dynamic Island on your wrist

Claudio Ctin3 hours ago3 hours ago

The Smart Stack is much more contextual this year. watchOS 11 is nowhere near as flashy as Apple Intelligence, but it’s full of neat little moments. Continue reading… Please follow and like us: