By Anindita Nayak
Bhubaneswar, May 15, 2024
OpenAI recently unveiled ChatGPT-4o, its latest flagship AI model, during a livestream event led by Chief Technology Officer Mira Murati. Described as significantly faster and boasting enhanced capabilities spanning text, vision, and audio, GPT-4o marks a milestone in AI advancement. Notably, OpenAI aims to democratize access to this cutting-edge technology, making it freely available to all users. GPT-4o promises to revolutionize AI-driven interactions across various platforms, from ChatGPT to the API, opening doors of new possibilities in human-machine communication and beyond.
Here are some of its key features to know about:
ChatGPT-4o is Omni-Modal:
OpenAI’s latest model, GPT-4o, embodies the concept of omni, signifying its ability to seamlessly process inputs in text, audio, and image formats, delivering corresponding outputs. This omni-modal AI breakthrough allows for more intuitive interactions by understanding text, speech, and video inputs. OpenAI boasts substantial improvements over its predecessor, promising faster response times and enhanced capabilities across various tasks. Notably, it will support 50 languages, including Indian languages, amplifying its accessibility and impact on a global scale.
Revolutionizing Multi-modal Interactions:
OpenAI’s CEO, Sam Altman, heralds GPT-4o as a groundbreaking advancement in AI, boasting native multimodal capabilities. This updated model can seamlessly understand and produce content in voice, text, or image formats, mimicking human-like verbal responses for real-time interactions. With lightning-fast audio processing, GPT-4o can handle inputs in just 232 milliseconds on average. Moreover, starting May 13 2024, users with ChatGPT-4o Plus accounts can leverage enhanced messaging capabilities, sending up to 80 messages every three hours, while those on ChatGPT-4 Plus accounts are allocated 40 messages within the same timeframe. It’s worth noting that unused messages do not carry over to the next cycle. Additionally, within ChatGPT Team workspaces, message caps for both GPT-4 and GPT-4o surpass those of ChatGPT Plus accounts.
Introduces Free Access with Enhanced Features:
OpenAI breaks barriers by offering GPT-4o features completely free of charge to all users, marking a significant shift towards democratizing access to advanced AI capabilities. The update brings premium features previously exclusive to paid subscribers, including web searches, multi-voice interactions, and data storage, to everyone. While Plus users will enjoy up to five times higher message limits, the model’s rollout will occur gradually over the following weeks. Despite limitations on interactions for free users, OpenAI’s move represents a major step towards fostering widespread AI utilization and innovation.
Conversational Mode:
ChatGPT 4o impresses with its ability to engage in lifelike conversations, making interactions with the AI more human-like and immersive. OpenAI showcased its real-time conversational prowess at the event, demonstrating capabilities such as interrupting the AI mid-speech, requesting tone changes, and eliciting responses to user emotions. While the Talkback feature in Voice Mode is already present in both free and paid tiers of ChatGPT, OpenAI emphasizes significant enhancements with the new GPT-4o model. By being trained end-to-end across text, vision, and audio, GPT-4o ensures seamless processing of inputs and outputs within the same neural network, reducing latency and yielding improved conversational outcomes.
Enhances Conversational Depth with Memory Function:
ChatGPT 4o introduces a memory function that retains past interactions within a session, enabling more contextual and personalized responses. For instance, if a user mentions working on a specific topic, ChatGPT can tailor its suggestions or information retrieval accordingly throughout the conversation. This feature is available to free users, empowering them to ask ChatGPT to remember information for future conversations. Additionally, free-tier users gain access to the GPT Store, where they can browse and utilize custom bots. While they cannot create and share custom bots themselves, this expansion of access represents OpenAI’s commitment to democratizing advanced AI capabilities and fostering deeper, more personalized interactions for all users.
Voice Mode Exclusive to Paid Subscribers:
OpenAI’s Voice Mode feature with GPT-4o will remain exclusive to paid-tier subscribers, with rollout to ChatGPT Plus subscribers underway and plans for availability to Team and enterprise users in the near future. Additionally, the GPT-4o model is being extended to paid subscribers with fewer limitations, including ChatGPT Plus and Team users, with forthcoming availability for enterprise users. Plus users will enjoy message limits up to five times greater than free users, while Team and enterprise users will benefit from even higher limits, as per the company’s announcement.
Market Response:
OpenAI’s product launch precedes Google’s annual I/O developer conference, where AI, including the Gemini chatbot and Search Generative Experience, is anticipated to be a central theme. Observers within the tech industry speculate that ChatGPT-4o’s implications could affect various sectors, including translation, tutoring, customer support, image generation, and interview preparation.