OpenAI, the developer of ChatGPT, announced on Tuesday that it would postpone the release of its “Voice Mode” feature until July due to technical difficulties
The company had initially intended to introduce the authentic voice conversation experience to a limited number of ChatGPT Plus users in late June.
However, it has announced that it will postpone the rollout to ensure it meets the launch standard.
“For instance, we are enhancing the model’s capacity to identify and reject specific content.”
In a post on social media platform X, OpenAI stated that they are “working on improving the user experience and preparing our infrastructure to scale to millions while maintaining real-time responses.”
The company stated that the feature would initially be released to a limited number of users to collect feedback. It will be made available to all Plus users in the autumn, subject to safety and reliability checks.
Additionally, OpenAI is developing novel video and screen-sharing capabilities.
It recently announced the release of a new AI model, GPT-4o, in May.
This model can engage in realistic voice conversations and interact with text and images, making it the latest move in the company’s efforts to remain at the forefront of the burgeoning AI technology race.
The new audio capabilities will allow users to speak to ChatGPT and receive real-time responses without latency and interrupt ChatGPT while it is talking, both are essential components of realistic conversations that AI voice assistants have found difficult to replicate.