Skip to main content

ChatGPT’s highly anticipated Advanced Voice could arrive ‘next week’

screencap. two people sitting at a desk talking to OpenAI's Advanced Voice mode on a cellphone
OpenAI

OpenAI CEO and co-founder Sam Altman revealed on X (formerly Twitter) Thursday that its Advanced Voice feature will begin rolling out “next week,” though only for a few select ChatGPT-Plus subscribers.

The company plans to “start the alpha with a small group of users to gather feedback and expand based on what we learn.”

alpha rollout starts to plus subscribers next week!

— Sam Altman (@sama) July 25, 2024

Advanced Voice, which does away with the text prompt and enables users to converse directly with the AI as one would another human, was initially announced in May alongside the release of GPT-4o during the company’s Spring Update event. Unlike existing digital assistants like Siri and Google Assistant, which only provide canned answers to user queries, ChatGPT’s Advanced Voice provides human-like responses, nearly latency-free, and in multiple languages.

The GPT-4o model is able to respond to audio inputs in 320 milliseconds on average, which is on par with how quickly humans react to normal conversation. As you can see in the demo video below, the model can converse with multiple users simultaneously, improvise talking points and questions in both English and Portuguese as well as conveying them with human-ish emotions, including “laughter.”

Learning a new language with ChatGPT Advanced Voice Mode

There’s no word yet on how the company will choose participants for alpha trial aside from them being $20/month ChatGPT Plus-tier subscribers. The alpha release was originally scheduled for June, though that date was pushed back “to reach our bar to launch” and improve its ability to detect and reject prohibited forms of content, as well as buttress the company’s IT infrastructure to accommodate the anticipated user load increase.

As the company announced in June, the feature’s full rollout won’t happen until at least this fall, and its exact timing will, again, depend on it “meeting our high safety and reliability bar.”

Giving ChatGPT the ability to converse naturally with its users is a huge advancement. Eliminating the need for a context window reduce user hardware requirements and expand the potential integrations and use cases for AI (such as increasing access to users with body mobility or dexterity limitations).

It can also help speed the technology’s adoption by the public by reducing the barrier to entry for less-tech-savvy users who are comfortable with interacting with their computers via “hey Siri” but blanch at the prospect of prompt engineering.

Andrew Tarantola
Andrew has spent more than a decade reporting on emerging technologies ranging from robotics and machine learning to space…
GPT-4: everything you need to know about ChatGPT’s standard AI model
A laptop opened to the ChatGPT website.

People were in awe when ChatGPT came out, impressed by its natural language abilities as an AI chatbot originally powered by the GPT-3.5 large language model. But when the highly anticipated GPT-4 large language model came out, it blew the lid off what we thought was possible with AI, with some calling it the early glimpses of AGI (artificial general intelligence).
What is GPT-4?
GPT-4 is the newest language model created by OpenAI that can generate text that is similar to human speech. It advances the technology used by ChatGPT, which was previously based on GPT-3.5 but has since been updated. GPT is the acronym for Generative Pre-trained Transformer, a deep learning technology that uses artificial neural networks to write like a human.

According to OpenAI, this next-generation language model is more advanced than ChatGPT in three key areas: creativity, visual input, and longer context. In terms of creativity, OpenAI says GPT-4 is much better at both creating and collaborating with users on creative projects. Examples of these include music, screenplays, technical writing, and even "learning a user's writing style."

Read more
OpenAI just took the shackles off the free version of ChatGPT
ChatGPT results on an iPhone.

OpenAI announced the release of its newest snack-sized generative model, dubbed GPT-4o mini, which is both less resource intensive and cheaper to operate than its standard GPT-4o model, allowing developers to integrate the AI technology into a far wider range of products.

It's a big upgrade for developers and apps, but it also expands the capabilities and reduces limitations on the free version of ChatGPT. GPT-4o mini is now available to users on the Free, Plus, and Team tiers through the ChatGPT web and app for users and developers starting today, while ChatGPT Enterprise subscribers will gain access next week. GPT-4o mini will replace the company's existing small model, GPT-3.5 Turbo, for end users beginning today.

Read more
The ChatGPT app has changed how I use my Mac in three key ways
The Option+Space shortcut of the macOS ChatGPT app.

After a long wait, OpenAI has launched the ChatGPT app on macOS for everyone to use. I’ve been playing around with it to see how it works and what it’s good at, and I’ve come away pretty impressed so far. It’s got all the power of ChatGPT in a handy desktop package. Better yet, you don’t need to pay to use it, as there’s no cost to download it, and it works with a free OpenAI account (free accounts do have limits placed on their usage, though, as they do on the web).

After seeing what I can get out of it, I’ve found there are three things I really love about the new ChatGPT Mac app. From the way it launches to its impressive capabilities, I think you’ll enjoy these aspects of the app as well.
It launches with a clever shortcut

Read more