OpenAI's GPT-5.2: Technical Gains at the Expense of Language Skills

Sam Altman, OpenAI’s CEO, recently admitted that the company “screwed up” the language capabilities of its latest ChatGPT iteration, GPT-5.2, highlighting a potential plateau in large language model development and raising questions about whether AI models can excel across all domains simultaneously.

Altman Acknowledges GPT-5.2’s Writing Shortcomings

Speaking at a developer town hall, Altman candidly admitted that OpenAI prioritized technical capabilities over writing proficiency in GPT-5.2. “We did decide, and I think for good reason, to put most of our effort in 5.2 into making it super good at intelligence, reasoning, coding, engineering, that kind of thing,” he explained, adding that the company has “limited bandwidth” that sometimes leads them to “focus on one thing and neglect another.”

Altman promised that future versions would improve writing capabilities, stating, “We will make future versions of GPT 5.x hopefully much better at writing than 4.5 was.”

Signs of Regression in GPT-5.2

Data scientist Mehul Gupta identified several concerning regressions in GPT-5.2’s performance:

Flatter tone in responses
Decreased translation capabilities
Inconsistent behavior across different tasks
Major regression in “instant mode” performance
Difficulties handling real-world documents like contracts and PDFs

Gupta noted that while GPT-5.2 performs well on clean benchmarks, it “struggles with the noise of reality,” often forgetting details, contradicting itself, misreading references, and hallucinating information when dealing with complex real-world documents.

The Broader Implications

This situation raises a critical question for the AI industry: Can frontier AI models continue to excel at all tasks simultaneously, or will improvements in one area necessarily come at the expense of capabilities in others?

The release of GPT-5.2 heavily emphasized technical tasks like coding and spreadsheet formatting, with minimal mention of writing or creative capabilities. This shift has left many non-technical users feeling that ChatGPT’s development is hitting a plateau or even regressing in certain areas.

The Future Direction of LLMs

Altman’s admission suggests that OpenAI is aware of these tradeoffs and is working to address them in future iterations. However, it remains to be seen whether AI developers can overcome these apparent limitations and create models that truly advance across all domains simultaneously.

As LLMs continue to evolve, finding the right balance between technical prowess and language capabilities will likely remain a significant challenge for OpenAI and other AI developers.