Tau2 Telecom

text
+
+
+
+
About

TAU2-telecom is the telecommunications sector component of the τ²-Bench framework, testing conversational agents in telecom customer service scenarios. This specialized benchmark evaluates AI agents' ability to handle telecommunications-specific tasks including service plans, technical support, billing inquiries, and network issues while maintaining accuracy in tool usage and following telecom industry protocols and policies.

+
+
+
+
Evaluation Stats
Total Models9
Organizations4
Verified Results0
Self-Reported9
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers

Score Distribution

9 models
Top Score
96.7%
Average Score
55.1%
High Performers (80%+)
2

Top Organizations

#1Anthropic
1 model
83.0%
#2Moonshot AI
2 models
65.8%
#3OpenAI
3 models
59.5%
#4Alibaba Cloud / Qwen Team
3 models
34.2%
+
+
+
+
Leaderboard
9 models ranked by performance on Tau2 Telecom
LicenseLinks
Aug 7, 2025
Proprietary
96.7%
Oct 15, 2025
Proprietary
83.0%
Jul 11, 2025
MIT
65.8%
Sep 5, 2025
MIT
65.8%
Apr 16, 2025
Proprietary
58.2%
Jul 25, 2025
Apache 2.0
45.6%
Sep 10, 2025
Apache 2.0
43.9%
Aug 6, 2024
Proprietary
23.5%
Sep 10, 2025
Apache 2.0
13.2%
+
+
+
+
Resources