Elon Musk’s AI firm, xAI, has added a new image understanding function to its AI model, Grok on Monday.
An xAI staffer and the official @grok handle confirmed the development on X former twitter..
The in-house AI chatbot now has picture-understanding capabilities, enabling it to process and analyze visual content. Users can now upload images and ask the AI questions based on them.
Image understanding, also known as computer vision, enables an AI system to see and comprehend visual data contained within an image or video. Currently, this feature is only accessible for static photos.
In a different post, Musk stated that Grok can use the new visual understanding feature to convey the meaning of a joke. He said that the functionality is in its early phases, implying that it will “rapidly improve”.
However, computer vision is not a novel function for AI systems; practically every major AI model, including Gemini, ChatGPT, Copilot, Claude, and others, supports it.
An X user brought this up, expressing worry that Grok still lacks many basic functionality.
xAI’s latest addition builds on the August Grok-2 update, which included image production capabilities powered by Black Forest Labs’ FLUX.1 model.
Founded in March 2023, xAI intends to compete directly with existing competitors such as OpenAI and Anthropic.
The initial Grok model was released in November 2023, and it outperformed GPT-3.5 but fell short of GPT-4’s performance.
Grok may soon understand papers, according to Musk’s response to a user who faulted the model for being unable to handle certain file formats (such as PDFs).
“Not for long,” Musk said, saying that “we are getting done in months what everyone else took years.”
To make the product more appealing, the social network has been working to add more capabilities to both the AI chatbot and the paid user tiers on X.
We earlier reported that bilingual tutors, especially those who speak Hindi well, are being sought after by Elon Musk’s artificial intelligence startup, xAI, to assist in the creation of AI systems that will help humanity in its quest for knowledge and comprehend the cosmos.