OpenAI Launches ChatGPT-5.2: Check Latest Tools, Capabilities, Performance And Upgrades | Technology News

By Samira Vishwas On Dec 12, 2025

Open AI has introduced ChatGPT-5.2 with major improvements across several key areas. The new version performs better at creating spreadsheets, building presentations, writing code, analyzing images, and managing long or complex tasks. It can also handle multi-step projects more efficiently with advanced tool-use abilities.

According to OpenAI, GPT-5.2 has recorded strong performance across several benchmark tests. On the GDPval assessment, the model scored higher than human professionals in well-defined knowledge-based tasks covering 44 occupations.

The GPT-5.2 Thinking model achieved a new state-of-the-art score on GDPval, it measures performance on well-defined knowledge work tasks across 44 occupations. According to OpenAI, the model beats or matches top industry professionals in 70.9% of comparisons.

Add Zee News as a Preferred Source

These tasks include creating presentations, building spreadsheets, and producing other work-related outputs. The model also completed tasks more than 11 times faster and at less than 1% of the cost of human experts, based on historical data.

Coding Upgrades

GPT-5.2 also shows strong improvements in software development. On SWE-Bench Pro, a challenging evaluation covering four programming languages, GPT-5.2 Thinking achieved 55.6%, setting a new state-of-the-art result. This version of the ChatGPT is considered more robust than earlier tests, making the achievement notable for real-world coding scenarios.

In feedback from companies such as Cognition, Warp, Charlie Labs, JetBrains, and Augment Code, GPT-5.2 demonstrated advanced agentic coding ability, with improvements in interactive coding, code reviews, and bug detection.

(Also Read: The Game Awards 2025: Clair Obscur: Expedition 33 Dominates With Multiple Wins)

Better Accuracy and Fewer Hallucinations

OpenAI claims that GPT-5.2 Thinking hallucinates significantly less compared to its predecessor. When tested on de-identified ChatGPT queries, incorrect responses were 30% less common, marking an improvement in factual reliability.

Long-Context Understanding

The model also sets new performance records in long-context reasoning. It leads scores on OpenAI MRCRv2, it measures how well a model can understand and connect information placed across long documents.

Companies such as Zoom, Databricks, Hex, and Triple Whale also observed that GPT-5.2 handles long-horizon reasoning and tool-calling tasks with state-of-the-art capability.