Comprehensive side-by-side LLM comparison
GPT-4 leads with 23.6% higher average benchmark score. GPT-4 is available on 2 providers. Overall, GPT-4 is the stronger choice for coding tasks.
Gemma 3N E2B was created as a specialized efficient variant, designed for deployment in edge and mobile environments with strict resource constraints. Built with optimizations for edge computing, it brings Gemma capabilities to devices and applications where traditional models would be impractical.
OpenAI
GPT-4 was created as a large multimodal model capable of accepting image and text inputs while producing text outputs. Developed to exhibit human-level performance on various professional and academic benchmarks, it marked a significant advancement in reliability, creativity, and handling of nuanced instructions compared to its predecessors.
2 years newer

GPT-4
OpenAI
2023-06-13

Gemma 3n E2B
2025-06-26
Context window and performance specifications
Average performance across 3 common benchmarks

Gemma 3n E2B

GPT-4
GPT-4
2022-12-31
Gemma 3n E2B
2024-06-01
Available providers and their performance metrics

Gemma 3n E2B

GPT-4
Azure

Gemma 3n E2B

GPT-4

Gemma 3n E2B

GPT-4
OpenAI