Tau2 Telecom
text
+
+
+
+
About
TAU2-telecom is the telecommunications sector component of the τ²-Bench framework, testing conversational agents in telecom customer service scenarios. This specialized benchmark evaluates AI agents' ability to handle telecommunications-specific tasks including service plans, technical support, billing inquiries, and network issues while maintaining accuracy in tool usage and following telecom industry protocols and policies.
+
+
+
+
Evaluation Stats
Total Models9
Organizations4
Verified Results0
Self-Reported9
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers
Score Distribution
9 models
Top Score
96.7%
Average Score
55.1%
High Performers (80%+)
2Top Organizations
#1Anthropic
1 model
83.0%
#2Moonshot AI
2 models
65.8%
#3OpenAI
3 models
59.5%
#4Alibaba Cloud / Qwen Team
3 models
34.2%
+
+
+
+
Leaderboard
9 models ranked by performance on Tau2 Telecom
| License | Links | ||||
|---|---|---|---|---|---|
| Aug 7, 2025 | Proprietary | 96.7% | |||
| Oct 15, 2025 | Proprietary | 83.0% | |||
| Jul 11, 2025 | MIT | 65.8% | |||
| Sep 5, 2025 | MIT | 65.8% | |||
| Apr 16, 2025 | Proprietary | 58.2% | |||
| Jul 25, 2025 | Apache 2.0 | 45.6% | |||
| Sep 10, 2025 | Apache 2.0 | 43.9% | |||
| Aug 6, 2024 | Proprietary | 23.5% | |||
| Sep 10, 2025 | Apache 2.0 | 13.2% |