After weeks of teasing its new frontier model, Google finally launched Gemini 3 on Tuesday with claims of being the new state of the art tool in the AI world. Google’s Gemini 2.5 Pro model had earlier been widely believed to be the top of the line AI model for most workflows but Elon Musk’s Grok AI briefly overtook the tool in some benchmarks but Gemini seems to be once again reaching the top of the AI food chain.
How does Gemini 3 compare against other top models?
As per the benchmarks shared by Google, the new Gemini 3 Pro model not only overtakes Gemini 2.5 Pro but also widens the gap with other rivals like ChatGPT and Claude.
On the popular LMArena leaderboard, Gemini 3 Pro is the new top model with a score of 1501 for text related tasks, surpassing the Grok 4.1-Thinking and Grok 4.1 model. Gemini 3 Pro also dethroned GPT-5 in WebDev leaderboard. LMArena says that Gemini 3 Pro is now the number 1 model in Coding, Math, Creative Writing, Long Queries in nearly all of its leaderboards.
On Humanity’s Last Exam, a benchmark specifically designed to test academic reasoning, Gemini 3 Pro achieved a score of 37.5%, placing it well ahead of GPT-5.1 which sat at the number 2 spot with 26.5%, and Claude Sonnet 4.5 which trailed at 13.7%
Gemini 3 Pro also showed remarkable performance on MathArena Apex, a benchmark consisting of challenging math contest problems. While Gemini 2.5 Pro, Claude Sonnet 4.5, and GPT-5.1 all scored in the low single digits (between 0.5% and 1.6%, Gemini 3 Pro secured the number 1 spot with a score of 23.4%.
The new model also showed improvements in screen understanding and agentic workflows. On ScreenSpot-Pro, a benchmark designed to evaluate a model’s ability to understand computer screens, Gemini 3 Pro achieved a score of 72.7%, showing complete dominance against Claude Sonnett 4.5 and GPT-5.1 which had scores of 36.2% and 3.5% respectively.
Gemini 3 Pro still failed to take the lead in coding related tasks in some benchmarks. For instance on SWE-Bench Verified, Claude Sonnet 4.5 managed to hold the number 1 spot with 77.2%, while Gemini 3 Pro came on the third spot scored 76.2%, and GPT-5.1 took the second spot with 76.3% score.
With AI companies releasing new models at even shorter intervals, its unlikely that Gemini 3 Pro will remain the category leader for long but for now the new model does have the lead in most of the benchmarks. However, do note that benchmarks may not always reveal the full picture about the AI model since many companies could be gaming these benchmarks to get their model listed on higher spots and the actual test of the model can only be through user experience.
gemini ai, gemini, google gemini, gemini 3 pro, gemini pro, gemini pro 3, gemini 3 google, google 3, gemini ai 3, please unblock challenges.cloudflare.com to proceed., gemini 3 price, gemini antigravity, antigravity, gemini 3 antigravity, gemini 3 release
#Googles #Gemini #Pro #fare #ChatGPT #Grok #Claude

