Last updated:
Text (LLM), image, video, and multimodal—alongside the combined Model board.
Model leaderboard (all)
Desempenho entre tarefas (multimodal/visão/linguagem; dados de exemplo).
Public ranking policy: rows are sorted by composite score (desc). Composite score is a weighted sum of normalized sub-metrics; ties are broken by higher recent activity.
| Posição | Modelo | Provedor / equipe | Tipo | Pontuação | Notas |
|---|---|---|---|---|---|
| 1 | xAI: Grok 4.1 Fast | x-ai | Multimodal | 98.7 | 2.0M ctx · $0.35/1M avg |
| 2 | OpenAI: GPT-5.4 | openai | Multimodal | 62 | 1.1M ctx · $8.75/1M avg |
| 3 | Google: Lyria 3 Pro Preview | Multimodal | 62 | 1.0M ctx · $0.00/1M avg | |
| 4 | Meta: Llama 4 Maverick | meta-llama | Multimodal | 72.4 | 1.0M ctx · $0.38/1M avg |
| 5 | MiniMax: MiniMax-01 | minimax | Multimodal | 69.7 | 1.0M ctx · $0.65/1M avg |
| 6 | Qwen: Qwen3.5-Flash | qwen | Multimodal | 72.2 | 1.0M ctx · $0.16/1M avg |
| 7 | Amazon: Nova 2 Lite | amazon | Multimodal | 65.9 | 1.0M ctx · $1.40/1M avg |
| 8 | Anthropic: Claude Sonnet 4.6 | anthropic | Multimodal | 62 | 1.0M ctx · $9.00/1M avg |
| 9 | Mistral: Ministral 3 8B 2512 ≈5 GB VRAM (low-end 4-bit inference estimate) | mistralai | Multimodal | 62 | 262k ctx · $0.15/1M avg |
| 10 | ByteDance Seed: Seed 1.6 Flash | bytedance-seed | Multimodal | 62 | 262k ctx · $0.19/1M avg |
| 11 | MoonshotAI: Kimi K2.5 | moonshotai | Multimodal | 62 | 262k ctx · $1.05/1M avg |
| 12 | Xiaomi: MiMo-V2-Omni | xiaomi | Multimodal | 62 | 262k ctx · $1.20/1M avg |
| 13 | Z.ai: GLM 5V Turbo | z-ai | Multimodal | 62 | 203k ctx · $2.60/1M avg |
| 14 | Free Models Router | openrouter | Multimodal | 62 | 200k ctx · $0.00/1M avg |
| 15 | Perplexity: Sonar Pro Search | perplexity | Multimodal | 62 | 200k ctx · $9.00/1M avg |
| 16 | Arcee AI: Spotlight | arcee-ai | Multimodal | 62 | 131k ctx · $0.18/1M avg |
| 17 | NVIDIA: Nemotron Nano 12B 2 VL ≈7 GB VRAM (low-end 4-bit inference estimate) | nvidia | Multimodal | 62 | 131k ctx · $0.40/1M avg |
| 18 | ByteDance: UI-TARS 7B ≈4 GB VRAM (low-end 4-bit inference estimate) | bytedance | Multimodal | 62 | 128k ctx · $0.15/1M avg |
| 19 | Baidu: ERNIE 4.5 VL 424B A47B ≈234 GB VRAM (low-end 4-bit inference estimate) | baidu | Multimodal | 62 | 123k ctx · $0.83/1M avg |
| 20 | Reka Edge | rekaai | Multimodal | 62 | 16k ctx · $0.10/1M avg |