Thursday, 1 May 2025
27.9 C
Singapore
32.8 C
Thailand
25.7 C
Indonesia
28.6 C
Philippines

Elon Musk’s AI company, xAI, enhances Grok with multimodal inputs

xAI, Elon Musk's AI company, adds image capabilities to Grok, offering enhanced features for users and closing the gap with competitors.

As revealed in public developer documents, Elon Musk’s artificial intelligence (AI) company, xAI, is working on integrating multimodal inputs into its Grok chatbot. This development implies that users will soon be able to upload images to Grok and receive text-based responses.

In a recent blog post by xAI, a teaser indicated that the upcoming Grok-1.5V version will introduce “multimodal models across various domains.” The latest updates in the developer documents suggest advancements towards the implementation of a new model.

The developer documents showcase a sample Python script illustrating how developers can leverage the xAI software development kit library to generate responses based on both text and images. By reading an image file, setting up a text prompt, and utilising the xAI SDK, developers can create responses efficiently.

Enhancements for Grok users

Grok, initially launched by xAI in November 2023, is accessible to users subscribed to the X Premium Plus service. The most recent update, Grok 1.5, introduced enhanced reasoning capabilities to the platform in March.

The model is trained on various textual data from publicly available sources up to Q3 2023 and datasets meticulously reviewed by human evaluators. While Grok-1 was not trained on xAI data, it possesses real-time knowledge of the world, including information from x posts.

Founded by Elon Musk in March 2023, xAI is a newcomer to the AI industry, lagging behind competitors like OpenAI’s ChatGPT. However, xAI’s blog post highlights that their Grok 1.5 model is narrowing the gap with GPT-4 across different benchmarks, covering a broad spectrum of academic problems from grade school to high school.

Challenges in benchmarking Large Language Models

Benchmarking large language models can be contentious. Models may excel in benchmarks if the data is part of their training set, akin to memorising answers rather than understanding the content. Despite these challenges, xAI is making significant strides with Grok’s development.

The landscape of AI is evolving towards multimodal conversational chatbots, with notable advancements announced at events like Google I/O and OpenAI’s release of GPT-4o. Grok’s integration of multimodal capabilities signifies a step forward in keeping pace with industry trends and enhancing the user experience.

Hot this week

Google extends free battery repair programme for Pixel 7a users

Google offers free battery replacement for Pixel 7a users in the US, UK, Canada, India, Germany, Japan, and Singapore who are experiencing swelling issues.

Grouphug brings AI to WhatsApp groups to turn private chats into memes

Grouphug wants to turn your WhatsApp group chats into memes using AI—and that’s only the beginning of this clever new app.

Content moderators around the world join forces to demand better conditions

Content moderators form a global alliance to demand better working conditions and mental health support from Big Tech companies.

OVHcloud launches AI Endpoints to simplify access to open-source models

OVHcloud launches AI Endpoints to offer serverless access to over 40 open-source AI models across key global markets.

Borderlands 4 reveals first look at new gameplay and characters

Borderlands 4 reveals extended gameplay, two new Vault Hunters, and co-op features ahead of its launch on 12 September 2025.

Garmin introduces Instinct 3 – Tactical Edition smartwatch in Singapore

Garmin launches the Instinct 3 – Tactical Edition in Singapore, combining durability, tactical tools, health tracking, and solar power.

Verizon report reveals 80% of APAC breaches caused by system intrusions

System intrusions caused 80% of data breaches in APAC, according to Verizon’s 2025 report, with malware and ransomware threats on the rise.

Asia Pacific’s AI progress held back by network limitations, says IDC report

APAC’s AI ambitions are limited by poor network infrastructure, with 94% of firms saying their networks can’t support large-scale AI projects.

Borderlands 4 reveals first look at new gameplay and characters

Borderlands 4 reveals extended gameplay, two new Vault Hunters, and co-op features ahead of its launch on 12 September 2025.

Related Articles

Popular Categories