- By Ashish Singh
- Wed, 31 Jul 2024 12:02 PM (IST)
- Source:JND
OpenAI, a Microsoft-backed artificial intelligence startup, announced on X on Tuesday that it is beginning to roll out an enhanced speech mode to a select set of ChatGPT Plus users. The company announced that it required more time to meet its launch requirement, therefore it postponed the release of the realistic voice conversation experience from late June to July.
The new audio features will enable users to interrupt ChatGPT while it is speaking and speak with it in real-time, both of which are essential components of realistic interactions that have proven difficult for AI assistants to achieve.
In June, OpenAI announced that it was striving to enhance the user experience and set up its infrastructure for future model scalability, all the while enhancing the model's capacity to identify and reject specific content.
The company has been attempting to launch new generative AI solutions in an effort to stay ahead of the rapidly adopting businesses in the burgeoning AI race.
By adding filters, OpenAI has been working to prevent copyright infringement issues in the meantime. It prevents the most recent GPT-4o model from producing audio that is copyrighted, such as voices, music, and more.
For those who don't know, OpenAI shocked users by responding in a tone similar to a person and generating a voice like Scarlett Johansson. After the actor filed a lawsuit against the business, OpenAI removed the voice from the demo.
OpenAI Working On More Advanced Reasoning Tech Code Named 'Strawberry', Says Report
The company has also introduced SearchGPT, an AI-powered search tool which makes OpenAI a direct competitor to Google. However, it is marked as a temporary prototype but can return answers from credible sources. It also has conversation capabilities with real-time responses.