LLM Compare

Honest, task-based comparisons of the top language models. No marketing — just results.

Model Overview

Model	Org	Context	Strengths
GPT-4o	OpenAI	128k	CodingReasoningVision
Claude 3.7 Sonnet	Anthropic	200k	Long contextWritingInstruction following
Gemini 1.5 Pro	Google	1M	MultimodalLong contextCode
Mistral Large	Mistral AI	32k	Open weightsFastEuropean data
Llama 3.1 405B	Meta	128k	Open sourceSelf-hostableReasoning