- Home
- /
- Benchmarks
- /
- Finance Agent
Finance Agent
Finance
+
+
+
+
About
Finance Agent benchmarks LLMs on realistic financial analysis tasks including reading SEC filings, earnings analysis, financial modeling, and quantitative research queries.
+
+
+
+
Evaluation Stats
Total Models11
Organizations5
Verified Results0
Self-Reported0
+
+
+
+
Benchmark Details
Max Score100
+
+
+
+
Performance Overview
Score distribution and top performers
Score Distribution
11 models
Top Score
63.3%
Average Score
56.1%
High Performers (80%+)
0Top Organizations
#1Anthropic
4 models
59.2%
#2OpenAI
3 models
55.5%
#3Google DeepMind
1 model
55.2%
#4Zhipu AI
1 model
53.2%
#5xAI
2 models
53.0%
+
+
+
+
Leaderboard
11 models ranked by performance on Finance Agent
| License | Links | ||||
|---|---|---|---|---|---|
| Feb 17, 2026 | Proprietary | 63.3% | |||
| Feb 5, 2026 | Proprietary | 60.1% | |||
| Dec 11, 2025 | Proprietary | 59.0% | |||
| Nov 24, 2025 | Proprietary | 58.8% | |||
| Nov 12, 2025 | Proprietary | 55.3% | |||
| Nov 18, 2025 | Proprietary | 55.2% | |||
| Sep 29, 2025 | Proprietary | 54.5% | |||
| Jul 9, 2025 | Proprietary | 53.5% | |||
#09GLM 5 New | Feb 11, 2026 | MIT | 53.2% | ||
| Nov 17, 2025 | Proprietary | 52.5% |
Showing 1 to 10 of 11 models
+
+
+
+
Additional Metrics
Extended metrics for top models on Finance Agent
| Model | Score | Cost/Test | Latency |
|---|---|---|---|
| Claude Opus 4.6 | 60.1 | $1.11 | 289.73s |
| GPT-5.2 | 59.0 | $0.98 | 587.16s |
| Claude Opus 4.5 | 58.8 | $1.5 | 181.87s |
| GPT-5.1 | 55.3 | $0.47 | 578.06s |
| Gemini 3 Pro | 55.2 | $0.56 | 183.62s |
| Claude Sonnet 4.5 | 54.5 | $1.1 | 202.07s |
| Grok 4 | 53.5 | $1.07 | 321.04s |
| GLM 5 | 53.2 | $0.5 | 564.09s |
| Grok 4.1 Fast | 52.5 | $0.06 | 90.58s |
| GPT-5 | 52.1 | $0.59 | 926.61s |