xAI, founded by Elon Musk, has announced a significant update for its Grok chatbot, enhancing it with real-time vision and multilingual voice capabilities.
This development positions Grok among leading AI platforms such as Google’s Gemini and OpenAI’s ChatGPT in the rapidly changing landscape of AI assistants.
Update Highlights
The newly introduced “Vision” feature enables iPhone users to simply point their camera at objects and receive immediate, context-sensitive feedback. Whether it’s recognizing a product or translating text, Grok utilizes the camera input to deliver comprehensive answers.
This is a tweet with a video. April 23, 2025
— ebbyamir (@ebbyamir) April 23, 2025
Moreover, Grok now facilitates real-time voice interactions in over 145 languages, including English, Spanish, French, Japanese, Chinese, Turkish, and Hindi. This multilingual functionality empowers users to engage with the AI in a more natural manner, eliminating language obstacles and enhancing overall accessibility.
Limitations
At the moment, these features are only accessible to iOS users. Android users can unlock them by subscribing to the SuperGrok plan, priced at $30 per month.
The advancements in Grok are indicative of a larger trend towards “agentic AI”—systems designed to perceive their surroundings, set goals, and make decisions with minimal human intervention. This evolution highlights a significant shift from reactive assistants to proactive collaborators in everyday tasks.
What’s New in Grok
- Memory: Grok can now retain information from past interactions, allowing for more tailored and cohesive conversations.
- App Builder: Users can create applications or documents directly within Grok, streamlining their workflows and enhancing productivity.
With these latest features, Grok is swiftly closing the gap and, in certain areas, matching its competitors like Gemini and ChatGPT.
As AI assistants increasingly become a part of our everyday lives, innovations such as real-time vision and multilingual support are establishing new benchmarks for user engagement and accessibility.