Comprehensive side-by-side LLM comparison
GPT-5.1 leads with 19.9% higher average benchmark score. Overall, GPT-5.1 is the stronger choice for coding tasks.
OpenAI
GPT-5.1, released by OpenAI in November 2025, is a large language model from the GPT-5 family that delivers incremental improvements in reasoning, instruction following, and multimodal understanding over GPT-5. It features a 400K token context window and targets general-purpose development, long-context analysis, and agentic workflows.
Amazon
Amazon Nova 2 Lite, released by Amazon Web Services on December 2, 2025, is a fast, cost-efficient reasoning model available on Amazon Bedrock with a 1M token context window enabling extended document, video, and image analysis. It features three extended thinking intensity levels, built-in code interpreter, web grounding tools, and native support for text, image, video, and document input. Nova 2 Lite targets cost-sensitive agentic applications, document analysis pipelines, and real-time workloads requiring multimodal reasoning.
1 month newer

GPT-5.1
OpenAI
2025-11

Nova 2 Lite
Amazon
2025-12-02
Context window and performance specifications
Average performance across 1 common benchmarks
GPT-5.1
Nova 2 Lite
Performance comparison across key benchmark categories
GPT-5.1
Nova 2 Lite
Available providers and their performance metrics
GPT-5.1
Nova 2 Lite
AWS Bedrock
GPT-5.1
Nova 2 Lite
GPT-5.1
Nova 2 Lite