Tongyi DeepResearch
Tongyi DeepResearch is the first fully open-source Web Agent to achieve performance.
Tongyi DeepResearch is the first fully open-source Web Agent to achieve performance on par with OpenAI’s DeepResearch across a comprehensive suite of benchmarks. It is designed for researchers and developers looking to create autonomous agents that can perform complex information-seeking tasks. Tongyi DeepResearch demonstrates state-of-the-art results, scoring 32.9 on the academic reasoning task Humanity’s Last Exam (HLE), 43.4 on BrowseComp, and 46.7 on BrowseComp-ZH.
Tongyi DeepResearch works by utilizing a novel data synthesis solution applied across the entire training pipeline, from Agentic Continual Pre-training (CPT) and Supervised Fine-Tuning (SFT) for cold-starting, to the final Reinforcement Learning (RL) stage. The model is trained using a customized on-policy Group Relative Policy Optimization (GRPO) algorithm, which ensures that the learning signal is always relevant to the model’s current capabilities. The training objective is optimized using a token-level policy gradient loss, and the model is evaluated on a range of benchmarks, including Humanity’s Last Exam (HLE), BrowseComp, and xbench-DeepSearch.
Tongyi DeepResearch is most valuable for researchers and developers who need to create autonomous agents that can perform complex information-seeking tasks. The model’s ability to synthesize web-based QA data through a novel pipeline and its capacity to systematically outperform existing proprietary and open-source Deep Research agents make it an ideal choice for those looking to create advanced agents. The model’s open-source nature also makes it an attractive option for those who want to customize and extend the model to suit their specific needs.
| Tool | Pricing | Upvotes | Rating |
|---|---|---|---|
Read AI |
Freemium | ▲ 112 | ★ 3.7 |
BigIdeasDB |
Freemium | ▲ 315 | ★ 3.5 |
Juice AI |
Freemium | ▲ 280 | ★ 4.1 |
Read AI
BigIdeasDB
Juice AI