ChatGPT Plus Members To Benefit From GPT-4o Enhanced Voice Mode
OpenAI will introduce "Voice Mode" for the GPT-4o model in ChatGPT for Plus members starting next week. OpenAI CEO, Sam Altman, confirmed on X (formerly Twitter) that voice mode for GPT-4o will be available in a limited "alpha" release for ChatGPT Plus.
When OpenAI launched its new flagship AI model GPT-4o in May, it highlighted significant enhancements to its talkback feature for ChatGPT. Although Voice Mode is already present in both free and paid versions of ChatGPT, its capabilities are quite restricted.
The current version of Voice Mode in ChatGPT operates with latencies averaging 2.8 seconds for GPT-3.5 and 5.4 seconds for GPT-4. This delay stems from a data processing pipeline involving three separate models: one transcribes audio to text, either GPT-3.5 or GPT-4 processes the text, and another converts the text back to audio. According to OpenAI, this process results in a significant loss of information to the main intelligence source, GPT-4.
With the introduction of the GPT-4o model, which is trained end-to-end across text, vision, and audio, all inputs and outputs are processed by the same neural network. This integration reduces latency for a more natural conversational experience and improves results since all information is processed within the same neural network.
Additionally, OpenAI stated that GPT-4o is better equipped to handle interruptions and manage group conversations effectively. It also filters out background noise and adapts to different tones more efficiently.
The advancements in the GPT-4o model aim to provide users with a smoother and more responsive interaction with ChatGPT's Voice Mode. By processing all data through a single neural network, OpenAI hopes to enhance the overall user experience significantly.
The upcoming alpha release of Voice Mode for GPT-4o will allow Plus members to test these improvements firsthand. As OpenAI continues to refine its AI models, users can expect further enhancements in future updates.
This development marks another step forward in OpenAI's efforts to improve conversational AI technology. The company remains committed to pushing the boundaries of what AI can achieve in natural language processing and user interaction.
