Comprehensive side-by-side LLM comparison
GPT-4 leads with 17.5% higher average benchmark score. GPT-4 is available on 2 providers. Overall, GPT-4 is the stronger choice for coding tasks.
Gemma 3N E4B was developed as a slightly larger edge-optimized variant, designed to provide enhanced capabilities while still fitting within edge computing constraints. Built to serve applications that can accommodate more parameters for improved quality, it expands the range of edge deployment options.
OpenAI
GPT-4 was created as a large multimodal model capable of accepting image and text inputs while producing text outputs. Developed to exhibit human-level performance on various professional and academic benchmarks, it marked a significant advancement in reliability, creativity, and handling of nuanced instructions compared to its predecessors.
2 years newer

GPT-4
OpenAI
2023-06-13

Gemma 3n E4B
2025-06-26
Context window and performance specifications
Average performance across 3 common benchmarks

Gemma 3n E4B

GPT-4
GPT-4
2022-12-31
Gemma 3n E4B
2024-06-01
Available providers and their performance metrics

Gemma 3n E4B

GPT-4
Azure

Gemma 3n E4B

GPT-4

Gemma 3n E4B

GPT-4
OpenAI