Active Stocks
Fri May 24 2024 15:59:27
  1. Tata Steel share price
  2. 174.80 -0.37%
  1. NTPC share price
  2. 374.85 0.68%
  1. State Bank Of India share price
  2. 828.60 -0.45%
  1. ITC share price
  2. 436.10 -1.16%
  1. Power Grid Corporation Of India share price
  2. 318.50 -0.39%
Business News/ Ai / xAI introduces Grok 1.5 Vision: Know how it competes with GPT-4 and Gemini 1.5 Pro
BackBack

xAI introduces Grok 1.5 Vision: Know how it competes with GPT-4 and Gemini 1.5 Pro

Elon Musk's xAI unveils Grok 1.5 Vision with computer vision abilities, enhancing its interaction with real-world objects. Benchmark tests show its superior performance in RealWorldQA but lower in other evaluations compared to OpenAI's GPT-4 with Vision and Google's Gemini 1.5 Pro.

Elon Musk’s xAI to launches improved version of Grok chatbot.Premium
Elon Musk’s xAI to launches improved version of Grok chatbot.

xAI, an artificial intelligence venture by Elon Musk, has unveiled an improved version of its Grok 1.5 model called Grok 1.5 Vision. This upgraded model now includes computer vision abilities, enabling it to understand and answer questions about images. This update comes shortly after OpenAI introduced its GPT-4 model, which also features computer vision capabilities.

The announcement of this enhancement was made through xAI's official X account (formerly Twitter), where they shared details about the model's new features in a blog post. While the fundamental features of Grok 1.5 remain unchanged in this updated version, the addition of vision capabilities is expected to expand its capabilities in interacting with the real world.

xAI conducted benchmark tests to evaluate Grok 1.5 Vision's performance across various metrics, including their proprietary RealWorldQA benchmark, which assesses the model's understanding of real-world spatial concepts. Additionally, the model underwent assessments in other tests like MMMU and ChartQA. Notably, in the RealWorldQA test, Grok outperformed OpenAI's GPT-4 with Vision and Google's Gemini 1.5 Pro, although it showed lower performance in other evaluations.

Computer vision is an exciting area of computer science that focuses on enabling computers, including AI models, to identify and understand real-world objects through images and videos. Its goal is to give machines vision capabilities similar to humans.

Major tech companies are heavily investing in developing AI models with vision capabilities. Google's Gemini 1.5 Pro and OpenAI's GPT-4 with Vision are prominent competitors in this field.

The potential applications of computer vision are extensive and transformative. For example, Healthify, an Indian platform for calorie tracking and nutrition, recently introduced a feature called 'Snap'. Users can take photos of food items, and the AI suggests healthier recipe adjustments and exercise plans to balance calorie intake. Computer vision also holds promise for medical diagnosis, autonomous vehicles, and more.

 

 

You are on Mint! India's #1 news destination (Source: Press Gazette). To learn more about our business coverage and market insights Click Here!

Catch all the Business News, Market News, Breaking News Events and Latest News Updates on Live Mint. Download The Mint News App to get Daily Market Updates.
More Less
Published: 15 Apr 2024, 08:11 PM IST
Next Story footLogo
Recommended For You