Tips on how to use OpenAI's ChatGPT enhanced speech mode

OpenAI has introduced ChatGPT, an advanced voice mode for audio chats.

Jordan Novet, CNBC

ChatGPT is ready for more natural audio chats.

OpenAI announced Tuesday that its popular chatbot now has an enhanced voice feature for users who pay for the premium service. The tool enables more fluid conversations.

The release will continue throughout the week. The company said it is not yet available in EU countries, Iceland, Liechtenstein, Norway, Switzerland and the UK.

OpenAI announced the new feature in May. The launch received a lot of attention because a voice named Sky resembled that of Scarlett Johansson in the 2013 film “Her.” Legal counsel on behalf of Johansson sent OpenAI letters claiming the company did not have the right to use the nearly identical voice, and OpenAI suspended the use of that voice in its products, CNBC reported.

In the months that followed, users were able to configure ChatGPT to speak to them in other voices through a free version. The advanced version is more responsive and will stop talking and listen when you interrupt. There are now nine voices to choose from and you can enter instructions for voice chats in the Customizations section of the app's settings.

“I hope you think it was worth the wait,” wrote Sam Altman, co-founder and CEO of OpenAI, in an X-post on Tuesday.

It is an increasingly competitive space for OpenAI, which is supported by Microsoft.

In the last few weeks Google has released its own Gemini Live voice feature in English on Android devices. And on Monday, Reuters reported that Meta will be featuring celebrity voices throughout this week, accessible via Facebook, Instagram and WhatsApp.

OpenAI got a head start on the generative AI chatbot market when it launched ChatGPT in late 2022. In August, OpenAI told the media that ChatGPT had over 200 million weekly active users.

Advanced mode is only available to subscribers of OpenAI's Plus, Team, or Enterprise plans. The cheapest option is the Plus tier at $20 per month.

How to proceed

Once you pay, you can easily get started, provided OpenAI has granted access to your device.

First, make sure you have the latest version of the app on your phone.

Open the ChatGPT app.

OpenAI says you'll receive a notification in the app once access to the new feature is enabled. Click the “Next” button to get started.

Create a new chat by swiping right or tapping the two-line icon in the top left corner and selecting ChatGPT at the top. To the right of the message text box and microphone icon, you should see a sound wave icon. Tap it and make sure your sound is on.

After a few seconds, you'll hear a slight “popping” sound and the circle in the middle of the screen will turn into a flowing, sky-like blue and white animation. Start speaking. You should get a response quickly. Don't be surprised if the audio is a little interrupted.

OpenAI said it has improved accents in some foreign languages ​​and increased the speed of conversation. But if you don't like what you hear, you can ask ChatGPT to speak differently. For example, you can tell it to speak faster or to incorporate a Southern accent.

With the advanced language mode, you can let ChatGPT tell you a fairy tale before bed, prepare for a job interview, or even work on your foreign language skills.

But even if you pay, you don't get unlimited access to the enhanced voice mode. After using it for about half an hour on Tuesday, I saw “15 minutes left” at the bottom of the screen.

OpenAI did not immediately respond to a request for details on the time limit.

REGARD: OpenAI is the undisputed leader in the AI ​​supercycle, says Apoorv Agrawal of Altimeter Capital

Comments are closed.