Visual reasoning added to Gemini Flash models

Google has added Agentic Vision functionality to its Gemini 3 Flash model. The company says it combines visual reasoning and code execution to provide answers based on visual evidence. Google says this feature fundamentally changes the way AI models process images.

Introduced on January 27, Agentic Vision is available through the Gemini API in Google AI Studio development tools and Vertex AI in the Gemini app.

Gemini Flash’s Agentic Vision transforms image understanding from a static act to an agentic process, Google says. Through a combination of visual reasoning and code execution, the model develops a plan to incrementally grow, inspect, and manipulate images. Until now, multimodal models typically processed the world with a single, static line of sight. According to Google, if you missed small details like a serial number or a sign in the distance, you had to guess. In contrast, Agentic Vision transforms image understanding into active exploration, introducing the agent’s “think-act-observe” loop into image understanding tasks, the company said.

Source link

What's Hot

I’ve seen all the Marvel movies. Here’s how to save your MCU

London Stock Exchange Group share price rises as PISCES debut nears and financial results approach

Indian Americans largely disapprove of Trump’s first-year performance, but Democrats aren’t benefiting: Survey

Visual reasoning added to Gemini Flash models

D Street Massacre, Humanity Milestones, Bangladesh Election Results, PMO Shift, and More

A smarter way for AI to understand text and images

Surprisingly Tough Competition for Meta’s Ray-Ban

20 Most Anticipated Sex Movies of 2025

How to tell the difference between fake and genuine Adidas Sambas

President Trump’s SEC nominee Paul Atkins marries multi-billion dollar roof fortune

Alice Munro’s Passive Voice | New Yorker

D Street Massacre, Humanity Milestones, Bangladesh Election Results, PMO Shift, and More

A smarter way for AI to understand text and images

Surprisingly Tough Competition for Meta’s Ray-Ban

How AI assistance impacts the formation of coding skills \ Anthropic

Our Picks

I’ve seen all the Marvel movies. Here’s how to save your MCU

London Stock Exchange Group share price rises as PISCES debut nears and financial results approach

Indian Americans largely disapprove of Trump’s first-year performance, but Democrats aren’t benefiting: Survey

Most Popular

Anthropic agrees to work with music publishers to prevent copyright infringement

chatgpt makers claim data breach claims “seriously”

Everything you need to know

Subscribe to Updates

What's Hot

Visual reasoning added to Gemini Flash models

Related Posts