Gemini 2.5 Deep Think scores competitive coding gold in ‘profound leap’ for abstract problem-solving
After a mathematics win in July, Gemini 2.5 Deep Think has now scored a gold-medal level performance in competitive coding.
DeepMind’s Gemini 2.5 Deep Think has achieved gold-medal level at the ICPC, solving 10 problems including one no human could ...
It is reported that DeepSeek-R1 is also the first mainstream large language model in the world to undergo peer review. Nature ...
OpenAI’s GPT-5 and DeepMind’s Gemini rival top student programmers at ICPC finals, solving unprecedented algorithmic ...
In this competition, the advanced version of “Gemini 2.5 Deep Seek” participated remotely online and solved 10 out of 12 ...
As artificial intelligence agents are given more power inside organisations, Exabeam’s chief AI officer, Steve Wilson, argues ...
On September 17, 2025, this research was published in the journal Nature under the title DeepSeek-R1 incentivizes reasoning ...
Wang, S. (2025) A Review of Agent Data Evaluation: Status, Challenges, and Future Prospects as of 2025. Journal of Software ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results