ChatGPT Voice Mode Delayed Due to Safety Concerns

In May, OpenAI showcased an impressively realistic, near real-time “advanced voice mode” for its ChatGPT platform, announcing it would be available to paying users within weeks.

However, months later, OpenAI indicates that more time is needed.

On its official Discord server, OpenAI revealed that the rollout of the advanced Voice Mode, initially scheduled in alpha for a select group of ChatGPT Plus users in late June, has been delayed due to persistent issues, pushing the launch to sometime in July.

OpenAI explained, “For example, we’re improving the model’s ability to detect and refuse certain content. We’re also working on improving the user experience and preparing our infrastructure to scale to millions while maintaining real-time responses. As part of our iterative deployment strategy, we’ll start the alpha with a small group of users to gather feedback and expand based on what we learn.”

OpenAI has announced a potential delay for the launch of Advanced Voice Mode to all ChatGPT Plus customers, possibly extending into the fall. This hinges on meeting internal safety and reliability standards. However, the rollout of new video and screen sharing capabilities, showcased separately during OpenAI’s spring press event, remains unaffected.

These features allow ChatGPT to solve math problems from images and explain device settings menus. They are designed for seamless use across smartphones and desktop clients, including the macOS app, which is now available to all ChatGPT users as of today.

OpenAI writes, “ChatGPT’s advanced Voice Mode can understand and respond with emotions and nonverbal cues, moving us closer to real-time, natural conversations with AI. Our mission is to bring these new experiences to you thoughtfully.”

During the launch event, OpenAI showcased ChatGPT’s impressive responsiveness, demonstrating its ability to instantly solve math problems visible on paper through a researcher’s smartphone camera.

The debut of OpenAI’s advanced Voice Mode stirred controversy due to its default “Sky” voice, which bore a strong resemblance to actress Scarlett Johansson’s voice. Johansson responded by stating she had consulted legal counsel to investigate the voice’s development and had declined multiple requests from OpenAI to license her voice for ChatGPT.

OpenAI clarified that it had not used Johansson’s voice without permission or created a soundalike, subsequently removing the voice in question.