#01 GPT-5GPT-5 is our flagship model for coding, reasoning, and agentic tasks across domains. The best model for coding and agentic tasks with higher reasoning capabilities and medium speed. | Aug 7, 2025 | | 74.9% | 88.0% | 93.4% | - | - | |
#02 GPT-5 CodexGPT-5 Codex has been trained specifically for conducting code reviews and finding critical flaws. When reviewing, it navigates your codebase and analyzes code patterns to identify potential security vulnerabilities, performance issues, and bugs. | Sep 15, 2025 | | 74.5% | - | - | - | - | |
#03 o3OpenAI's most powerful reasoning model. o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images. | Apr 16, 2025 | | 69.1% | 81.3% | - | - | - | |
#04 o4-minio4-mini is OpenAI's latest small o-series model, optimized for fast, effective reasoning with exceptionally efficient performance in coding and visual tasks. It is faster and more affordable than o3. | Apr 16, 2025 | | 68.1% | 68.9% | - | - | - | |
#05 GPT-4.1GPT-4.1 is OpenAI's latest and most advanced flagship model, significantly improving upon GPT-4 Turbo in performance across benchmarks, speed, and cost-effectiveness. | Apr 14, 2025 | | 54.6% | 51.6% | - | - | - | |
#06 o3-miniA smaller variant of O3, expected to offer enhanced multimodal capabilities, improved reasoning, and more efficient resource utilization compared to previous models while maintaining strong performance on core tasks. | Jan 30, 2025 | | 49.3% | 66.7% | - | - | - | |
#07 o1-previewA research preview model focused on mathematical and logical reasoning capabilities, demonstrating improved performance on tasks requiring step-by-step reasoning, mathematical problem-solving, and code generation. The model shows enhanced capabilities in formal reasoning while maintaining strong general capabilities. | Sep 12, 2024 | | 41.3% | - | - | - | - | |
#08 o1A research preview model focused on mathematical and logical reasoning capabilities, demonstrating improved performance on tasks requiring step-by-step reasoning, mathematical problem-solving, and code generation. The model shows enhanced capabilities in formal reasoning while maintaining strong general capabilities. | Dec 17, 2024 | | 41.0% | - | 88.1% | - | - | |
#09 GPT-4.5GPT-4.5 is OpenAI's most advanced model, offering improved reasoning, coding, and creative capabilities with faster performance and longer context handling than GPT-4. It features enhanced instruction following, reduced hallucinations, and better factual accuracy. | Feb 27, 2025 | | 38.0% | - | 88.0% | - | - | |
#10 GPT-4oGPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding. | Aug 6, 2024 | | 33.2% | 30.7% | - | - | - | |