"OpenAI says GPT-5.2 Thinking beats or ties 'human professionals' on 70.9 percent of tasks in the GDPval benchmark (compared to 53.3 percent for Gemini 3 Pro). The company also claims the model ...