Yao Shunyu’s Google Debut: New Gemini Model Shatters SOTA Records, Only 7 Humans Defending Carbon

Yao Shunyu’s Google Debut: New Gemini Model Shatters SOTA Records, Only 7 Humans Defending Carbon

Yao Shunyu’s Google Debut: New Gemini Model Shatters SOTA Records, Only 7 Humans Defending Carbon

https://eu.36kr.com/en/p/3681358416129668

Publish Date: 2026-02-13 03:34:00

Source Domain: eu.36kr.com

Facing the fierce attacks of Claude Opus 4.6 and GPT Codex 5.3, Google countered with a major upgrade of Gemini 3 Deep Think.

On Codeforces, a benchmark testing platform with various competitive programming challenges, it achieved an astonishing 3455 Elo score, equivalent to the 8th place in the world.

Now, only 7 people in the world have a higher programming level than it. The previous highest score was 2727 Elo, achieved by o3 a year ago.

The capabilities of Gemini 3 Deep Think go beyond that. It also set a record of 84.6% on ARC – AGI – 2, a leading benchmark recognized for testing AI reasoning ability.

It’s worth noting that the scores of previous top – performing models hovered between 60% and 70%, and Claude Opus 4.6 only scored 68.8%.

On the Humanity’s Last Exam (HLE), Gemini 3 Deep Think also refreshed the state – of – the – art (SOTA) and achieved a score of 48.4%.

Google officials said that the new version of Deep Think is a reasoning mode specially developed by Google to push the frontiers of intelligence and address modern challenges in science, research, and engineering.

Another “legend” – Shunyu Yao, a legendary winner of the special scholarship from the Department of Physics at Tsinghua University, joined Google DeepMind in September last year and is also involved in the development of this new Deep Think model.

The new version of DeepThink has entered the laboratory

How powerful is the upgraded Gemini 3 Deep Think?

Its ambition is not just to win benchmark tests, but to enter the fields of scientific research and engineering and help engineers handle complex tasks.

The new version of Deep Think can analyze sketches, model complex shapes, and directly generate solid files for 3D printing. Here is a laptop stand it printed:

Google VP Josh Woodward posted the printed result on X, and it looks quite true to the sketch:

Lisa Carbone, a mathematician at Rutgers University, used Gemini 3 Deep Think to review a highly specialized mathematical…

Source