Google DeepMind shows how AI can think deeper

Google DeepMind, a UK-based research company, recently published new research exploring techniques to better leverage inference time calculations.

Google DeepMind’s technology, Mind Evolution, uses language models to generate a variety of candidate solutions, which are then recombined and refined based on feedback from evaluators.

Unlike sequential inference approaches such as self-refinement and tree search, which require evaluation of individual inference steps, the authors claim that Mind Evolution performs a global refinement of a complete solution.

The TravelPlanner benchmark evaluates a model’s ability to organize travel plans “for users who express their preferences and constraints.” Across various levels of travel planning difficulty, the Mind Evolution technique outperformed other techniques.

The meeting planning task evaluates the model for its ability to schedule meetings based on constraints such as number of people in the meeting, availability, location, and travel time.

The authors also suggested that this task is different from that of TravelPlanner because not all meetings can be scheduled due to conflicting constraints such as availability and location.

The presented results show that Mind Evolution performs better than the baseline strategy, achieving a success rate of 85.0% on the validation set and 83.8% on the test set. In particular, our two-step approach using Gemini 1.5 Pro achieved success rates of 98.4% and 98.2% in validation and testing, respectively.

That said, the authors also note that Mind Evolution’s “key limitations” are that it primarily focuses on natural language planning problems, and that “proposed solutions can be evaluated and critiqued programmatically.” I also admit that.

Calculating inference time is a widely used concept in large language models, especially OpenAI’s o1 inference model. This technique is considered an effective way to solve scaling problems in large language models.

A few days ago, Google DeepMind also published a study introducing inference time scaling for diffusion models.

The study, titled “Inference Time Scaling of Diffusion Models Beyond Scaling of the Denoising Step,” investigated the impact of providing additional computing resources to image generation models as they produce results. I’m doing it.

Last December, Google announced the Gemini 2.0 Flash Thinking model. This model provides advanced reasoning capabilities and illustrates its thinking. Logan Kilpatrick, Google’s senior product manager, said the model “unleashes more powerful reasoning power to illustrate that thinking.”

Source link

What's Hot

Express Investigation: One nation, a few parivars | Express Investigations News

Security challenges and diplomatic efforts

Elon Musk says AI will take over all jobs and humans will be free to grow vegetables

Google DeepMind shows how AI can think deeper

Elon Musk says AI will take over all jobs and humans will be free to grow vegetables

New study finds AI assistants make widespread errors when it comes to news

ChatGPT is becoming erotic, but can OpenAI really stay adults-only?

20 Most Anticipated Sex Movies of 2025

President Trump’s SEC nominee Paul Atkins marries multi-billion dollar roof fortune

How to tell the difference between fake and genuine Adidas Sambas

Alice Munro’s Passive Voice | New Yorker

Elon Musk says AI will take over all jobs and humans will be free to grow vegetables

New study finds AI assistants make widespread errors when it comes to news

ChatGPT is becoming erotic, but can OpenAI really stay adults-only?

Meta AI’s app downloads and daily users skyrocket after launch of ‘Vibes’ AI video feed

Our Picks

Express Investigation: One nation, a few parivars | Express Investigations News

Security challenges and diplomatic efforts

Elon Musk says AI will take over all jobs and humans will be free to grow vegetables

Most Popular

10 things you should never say to an AI chatbot

Character.AI faces lawsuit over child safety concerns

Analyst warns Salesforce investors about AI agent optimism

Subscribe to Updates

What's Hot

Google DeepMind shows how AI can think deeper

Related Posts