AI Models Leaderboard

Independent analysis of AI. Compare models by intelligence, speed, and price.

/
Filters:
SelectModel ↕Creator ↕Context Window ↕Intelligence ↓ⓘPrice ($/M) ↕ⓘSpeed (Tokens/s) ↕Latency (First Chunk s) ↕End-to-End Response (s) ↕
Claude Opus 4.8 (max)
claude-opus-4-8-max
Anthropic
1M61.00$4.10/M58 tok/s17.47 s26.05 s
GPT-5.5 (xhigh)
gpt-5-5-xhigh
OpenAI
922k60.00$4.35/M57 tok/s73.96 s82.77 s
GPT-5.5 (high)
gpt-5-5-high
OpenAI
922k59.00$4.35/M51 tok/s17.22 s26.94 s
Claude Opus 4.7 (max)
claude-opus-4-7-max
Anthropic
1M57.00$4.10/M45 tok/s10.79 s21.91 s
Gemini 3.1 Pro Preview
gemini-3-1-pro-preview
Google
1M57.00$1.74/M133 tok/s19.85 s23.60 s
GPT-5.5 (medium)
gpt-5-5-medium
OpenAI
922k57.00$4.35/M50 tok/s6.91 s16.91 s
Qwen3.7 Max
qwen3-7-max
Alibaba
1M57.00$1.43/M192 tok/s2.58 s17.71 s
Gemini 3.5 Flash
gemini-3-5-flash
Google
1M55.00$1.31/M183 tok/s18.79 s21.52 s
Gemini 3.5 Flash (medium)
gemini-3-5-flash-medium
Google
1M55.00$1.31/M174 tok/s13.85 s16.72 s
Kimi K2.6
kimi-k2-6
Kimi
256k54.00$0.70/M44 tok/s2.31 s114.04 s
MiMo-V2.5-Pro
mimo-v2-5-pro
Xiaomi
1M54.00$0.18/M50 tok/s3.32 s53.27 s
GPT-5.3 Codex (xhigh)
gpt-5-3-codex-xhigh
OpenAI
400k54.00$1.87/M80 tok/s95.05 s101.33 s
Grok 4.3 (high)
grok-4-3-high
xAI
1M53.00$0.64/M145 tok/s10.94 s14.39 s
Muse Spark
muse-spark
Meta
262k52.00————
Claude Opus 4.7 (Non-reasoning, high)
claude-opus-4-7-non-reasoning-high
Anthropic
1M52.00$4.10/M42 tok/s1.17 s13.01 s
Claude Sonnet 4.6 (max)
claude-sonnet-4-6-max
Anthropic
1M52.00$2.46/M46 tok/s106.64 s117.50 s
DeepSeek V4 Pro (Max)
deepseek-v4-pro-max
DeepSeek
1M52.00$0.18/M48 tok/s1.78 s104.11 s
GLM-5.1
glm-5-1
Z AI
200k51.00$0.90/M62 tok/s1.51 s70.81 s
GPT-5.5 (low)
gpt-5-5-low
OpenAI
922k51.00$4.35/M53 tok/s1.67 s11.12 s
Qwen3.6 Plus
qwen3-6-plus
Alibaba
1M50.00$0.43/M53 tok/s2.93 s116.85 s
DeepSeek V4 Pro (High)
deepseek-v4-pro-high
DeepSeek
1M50.00$0.18/M46 tok/s1.78 s55.68 s
MiniMax-M2.7
minimax-m2-7
MiniMax
205k50.00$0.22/M108 tok/s2.65 s30.19 s
MiMo-V2.5
mimo-v2-5
Xiaomi
1M49.00$0.06/M93 tok/s2.88 s29.82 s
GPT-5.4 mini (xhigh)
gpt-5-4-mini-xhigh
OpenAI
400k49.00$0.65/M164 tok/s5.09 s8.13 s
Grok 4.3 (medium)
grok-4-3-medium
xAI
1M49.00$0.64/M146 tok/s7.28 s10.70 s
GLM-5-Turbo
glm-5-turbo
Z AI
200k47.00————
DeepSeek V4 Flash (Max)
deepseek-v4-flash-max
DeepSeek
1M47.00$0.06/M111 tok/s1.22 s56.09 s
DeepSeek V4 Flash (High)
deepseek-v4-flash-high
DeepSeek
1M46.00$0.08/M———
Qwen3.6 27B
qwen3-6-27b
Alibaba
262k46.00$0.90/M57 tok/s3.86 s112.65 s
Qwen3.5 397B A17B
qwen3-5-397b-a17b
Alibaba
262k45.00$0.90/M53 tok/s2.54 s72.50 s
MiMo-V2-Omni-0327
mimo-v2-omni-0327
Xiaomi
256k45.00$0.34/M93 tok/s2.83 s29.70 s
Claude Sonnet 4.6 (Non-reasoning)
claude-sonnet-4-6-non-reasoning
Anthropic
1M44.00$2.46/M42 tok/s1.17 s13.10 s
GPT-5.4 nano (xhigh)
gpt-5-4-nano-xhigh
OpenAI
400k44.00$0.18/M152 tok/s4.25 s7.54 s
Grok 4.3 (low)
grok-4-3-low
xAI
1M44.00$0.64/M113 tok/s4.27 s8.71 s
GLM-5.1
glm-5-1
Z AI
200k44.00$0.90/M49 tok/s1.75 s11.93 s
Qwen3.6 35B A3B
qwen3-6-35b-a3b
Alibaba
262k43.00$0.37/M174 tok/s2.41 s16.77 s
MiMo-V2-Omni
mimo-v2-omni
Xiaomi
256k43.00$0.00/M91 tok/s3.71 s31.26 s
Gemini 3.5 Flash (minimal)
gemini-3-5-flash-minimal
Google
1M43.00$1.31/M169 tok/s0.90 s3.86 s
Kimi K2.6
kimi-k2-6
Kimi
256k43.00$0.70/M45 tok/s2.34 s13.50 s
GLM 5V Turbo
glm-5v-turbo
Z AI
200k43.00————
Claude Sonnet 4.6 (Non-reasoning, Low Effort)
claude-sonnet-4-6-non-reasoning-low-effort
Anthropic
1M43.00$2.46/M42 tok/s1.26 s13.07 s
Hy3-preview
hy3-preview
Tencent
256k42.00$0.10/M98 tok/s3.90 s29.34 s
GPT-5.5 Instant (May 2026)
gpt-5-5-instant-may-2026
OpenAI
400k42.00$4.35/M———
Qwen3.5 122B A10B
qwen3-5-122b-a10b
Alibaba
262k42.00$0.68/M140 tok/s2.55 s20.43 s
MiMo-V2-Flash (Feb 2026)
mimo-v2-flash-feb-2026
Xiaomi
256k41.00$0.06/M131 tok/s1.98 s21.04 s
GPT-5.5 (Non-reasoning)
gpt-5-5-non-reasoning
OpenAI
922k41.00$4.35/M51 tok/s0.96 s10.68 s
Qwen3.5 397B A17B
qwen3-5-397b-a17b
Alibaba
262k40.00$0.90/M53 tok/s2.50 s11.86 s
DeepSeek V4 Pro
deepseek-v4-pro
DeepSeek
1M39.00$0.18/M52 tok/s1.93 s11.54 s
Mistral Medium 3.5
mistral-medium-3-5
Mistral
256k39.00$2.10/M148 tok/s1.83 s18.74 s
Gemma 4 31B
gemma-4-31b
Google
256k39.00$0.00/M35 tok/s1.07 s64.15 s
Qwen3.5 Omni Plus
qwen3-5-omni-plus
Alibaba
256k39.00$0.84/M55 tok/s2.45 s11.48 s
Step 3.5 Flash 2603
step-3-5-flash-2603
StepFun
256k38.00$0.00/M163 tok/s1.20 s16.56 s
Ring-2.6-1T
ring-2-6-1t
InclusionAI
262k38.00$0.52/M120 tok/s3.22 s24.09 s
o3
o3
OpenAI
200k38.00$1.55/M131 tok/s5.98 s9.79 s
GPT-5.4 nano
gpt-5-4-nano
OpenAI
400k38.00$0.18/M148 tok/s3.01 s6.38 s
GPT-5.4 mini (medium)
gpt-5-4-mini-medium
OpenAI
400k38.00$0.65/M158 tok/s4.11 s7.26 s
Command A+
command-a
Cohere
192k37.00$0.00/M222 tok/s0.26 s11.54 s
Qwen3.6 27B
qwen3-6-27b
Alibaba
262k37.00$0.90/M58 tok/s3.86 s12.56 s
Claude 4.5 Haiku
claude-4-5-haiku
Anthropic
200k37.00$0.82/M92 tok/s20.98 s26.43 s
DeepSeek V4 Flash
deepseek-v4-flash
DeepSeek
1M36.00$0.06/M114 tok/s1.38 s5.78 s
JT-35B-Flash
jt-35b-flash
China Mobile
256k36.00————
NVIDIA Nemotron 3 Super
nvidia-nemotron-3-super
NVIDIA
1M36.00$0.28/M187 tok/s1.82 s15.19 s
Qwen3.5 122B A10B
qwen3-5-122b-a10b
Alibaba
262k36.00$0.68/M162 tok/s2.51 s5.59 s
Nova 2.0 Pro Preview (medium)
nova-2-0-pro-preview-medium
Amazon
256k36.00$1.47/M114 tok/s12.99 s34.89 s
MiMo-V2.5-Pro
mimo-v2-5-pro
Xiaomi
1M36.00$0.58/M50 tok/s3.20 s13.28 s
Gemini 2.5 Pro
gemini-2-5-pro
Google
1M35.00$1.34/M133 tok/s22.23 s25.99 s
Nova 2.0 Lite (high)
nova-2-0-lite-high
Amazon
1M35.00$0.52/M153 tok/s14.33 s30.68 s
Hy3-preview
hy3-preview
Tencent
256k34.00$0.10/M89 tok/s3.98 s9.62 s
Ling-2.6-1T
ling-2-6-1t
InclusionAI
262k34.00$0.52/M———
Doubao Seed Code
doubao-seed-code
ByteDance Seed
256k34.00————
Gemini 3.1 Flash-Lite
gemini-3-1-flash-lite
Google
1M34.00$0.22/M270 tok/s5.57 s7.43 s
gpt-oss-120b (high)
gpt-oss-120b-high
OpenAI
131k33.00$0.20/M329 tok/s0.86 s8.46 s
Mercury 2
mercury-2
Inception
128k33.00$0.14/M785 tok/s3.11 s3.75 s
Qwen3.5 9B
qwen3-5-9b
Alibaba
262k32.00$0.11/M68 tok/s2.28 s38.84 s
Gemma 4 31B
gemma-4-31b
Google
256k32.00$0.17/M18 tok/s1.39 s29.89 s
K-EXAONE
k-exaone
LG AI Research
256k32.00————
Nova 2.0 Pro Preview (low)
nova-2-0-pro-preview-low
Amazon
256k32.00$2.13/M117 tok/s11.16 s32.50 s
Trinity Large Thinking
trinity-large-thinking
Arcee AI
512k32.00$0.24/M157 tok/s1.15 s17.03 s
Qwen3.6 35B A3B
qwen3-6-35b-a3b
Alibaba
262k32.00$0.56/M178 tok/s2.52 s5.32 s
Gemma 4 26B A4B
gemma-4-26b-a4b
Google
256k31.00$0.14/M———
Claude 4.5 Haiku
claude-4-5-haiku
Anthropic
200k31.00$0.82/M90 tok/s0.77 s6.34 s
Grok 4.3
grok-4-3
xAI
1M31.00$0.64/M110 tok/s0.63 s5.17 s
Qwen3.5 35B A3B
qwen3-5-35b-a3b
Alibaba
262k31.00$0.42/M154 tok/s2.13 s5.38 s
MiMo-V2-Flash
mimo-v2-flash
Xiaomi
256k30.00$0.12/M130 tok/s2.07 s5.92 s
EXAONE 4.5 33B
exaone-4-5-33b
LG AI Research
262k30.00————
Nova 2.0 Lite (medium)
nova-2-0-lite-medium
Amazon
1M30.00$0.52/M146 tok/s21.21 s38.33 s
ERNIE 5.0 Thinking Preview
ernie-5-0-thinking-preview
Baidu
128k29.00————
Nemotron Cascade 2 30B A3B
nemotron-cascade-2-30b-a3b
NVIDIA
1M28.00————
Qwen3 Coder Next
qwen3-coder-next
Alibaba
256k28.00$0.43/M107 tok/s1.62 s6.27 s
Nova 2.0 Omni (medium)
nova-2-0-omni-medium
Amazon
1M28.00$0.52/M———
Mistral Small 4
mistral-small-4
Mistral
256k28.00$0.20/M181 tok/s0.70 s14.49 s
Qwen3.5 9B
qwen3-5-9b
Alibaba
262k27.00————
Magistral Medium 1.2
magistral-medium-1-2
Mistral
128k27.00$2.30/M39 tok/s1.90 s66.75 s
Gemma 4 26B A4B
gemma-4-26b-a4b
Google
256k27.00$0.16/M83 tok/s1.59 s7.58 s
Qwen3.5 4B
qwen3-5-4b
Alibaba
262k27.00$0.04/M194 tok/s0.42 s13.31 s
Qwen3 Next 80B A3B
qwen3-next-80b-a3b
Alibaba
262k27.00$1.05/M137 tok/s2.31 s20.54 s
Ling 2.6 Flash
ling-2-6-flash
InclusionAI
262k26.00$0.06/M———
Solar Pro 3
solar-pro-3
Upstage
128k26.00————
Qwen3.5 Omni Flash
qwen3-5-omni-flash
Alibaba
256k26.00$0.17/M241 tok/s1.87 s3.95 s
JT-MINI
jt-mini
China Mobile
128k25.00————
Nova 2.0 Lite (low)
nova-2-0-lite-low
Amazon
1M25.00$0.52/M152 tok/s9.97 s26.43 s
gpt-oss-20B (high)
gpt-oss-20b-high
OpenAI
131k24.00$0.07/M235 tok/s0.74 s11.38 s
gpt-oss-120b (low)
gpt-oss-120b-low
OpenAI
131k24.00$0.20/M348 tok/s0.87 s8.04 s
GPT-5.4 nano
gpt-5-4-nano
OpenAI
400k24.00$0.18/M148 tok/s0.63 s3.99 s
NVIDIA Nemotron 3 Nano
nvidia-nemotron-3-nano
NVIDIA
1M24.00$0.07/M132 tok/s2.10 s21.02 s
LongCat Flash Lite
longcat-flash-lite
LongCat
256k24.00$0.00/M81 tok/s6.33 s12.48 s
K-EXAONE
k-exaone
LG AI Research
256k23.00————
GPT-5.4 mini
gpt-5-4-mini
OpenAI
400k23.00$0.65/M144 tok/s0.64 s4.10 s
Nova 2.0 Omni (low)
nova-2-0-omni-low
Amazon
1M23.00$0.52/M———
Nova 2.0 Pro Preview
nova-2-0-pro-preview
Amazon
256k23.00$2.13/M123 tok/s1.08 s5.14 s
Mi:dm K 2.5 Pro
mi-dm-k-2-5-pro
Korea Telecom
128k23.00————
Mistral Large 3
mistral-large-3
Mistral
256k23.00$0.60/M52 tok/s1.09 s10.75 s
Qwen3.5 4B
qwen3-5-4b
Alibaba
262k23.00$0.04/M200 tok/s0.45 s2.95 s
INTELLECT-3
intellect-3
Prime Intellect
131k22.00————
Devstral 2
devstral-2
Mistral
256k22.00$0.00/M66 tok/s1.21 s8.74 s
Solar Open 100B
solar-open-100b
Upstage
128k22.00————
Nemotron 3 Nano Omni 30B A3B Reasoning
nemotron-3-nano-omni-30b-a3b-reasoning
NVIDIA
256k21.00$0.10/M299 tok/s1.03 s9.40 s
gpt-oss-20B (low)
gpt-oss-20b-low
OpenAI
131k21.00$0.07/M242 tok/s0.78 s11.11 s
Qwen3 Next 80B A3B
qwen3-next-80b-a3b
Alibaba
262k20.00$0.65/M146 tok/s2.28 s5.70 s
Devstral Small 2
devstral-small-2
Mistral
256k19.00$0.00/M68 tok/s1.13 s8.50 s
Motif-2-12.7B
motif-2-12-7b
Motif Technologies
128k19.00————
Nova Premier
nova-premier
Amazon
1M19.00$2.18/M35 tok/s2.92 s17.28 s
Gemma 4 E4B
gemma-4-e4b
Google
128k19.00————
Llama Nemotron Super 49B v1.5
llama-nemotron-super-49b-v1-5
NVIDIA
128k19.00$0.13/M47 tok/s1.34 s54.61 s
Mistral Small 4
mistral-small-4
Mistral
256k19.00$0.20/M157 tok/s0.69 s3.86 s
Llama 4 Maverick
llama-4-maverick
Meta
1M18.00$0.34/M109 tok/s1.01 s5.61 s
Magistral Small 1.2
magistral-small-1-2
Mistral
128k18.00$0.60/M108 tok/s0.81 s24.00 s
Sarvam 105B (high)
sarvam-105b-high
Sarvam
128k18.00$0.04/M90 tok/s2.09 s29.99 s
Nova 2.0 Lite
nova-2-0-lite
Amazon
1M18.00$0.52/M141 tok/s1.34 s4.89 s
MiniCPM5-1B
minicpm5-1b
OpenBMB
128k18.00————
Llama 3.1 405B
llama-3-1-405b
Meta
128k17.00$3.13/M35 tok/s2.36 s16.67 s
EXAONE 4.0 32B
exaone-4-0-32b
LG AI Research
131k17.00————
Nova 2.0 Omni
nova-2-0-omni
Amazon
1M17.00$0.52/M———
Qwen3.5 2B
qwen3-5-2b
Alibaba
262k16.00$0.03/M———
Nanbeige4.1-3B
nanbeige4-1-3b
Nanbeige
256k16.00————
Ministral 3 14B
ministral-3-14b
Mistral
256k16.00$0.20/M77 tok/s0.80 s7.30 s
Falcon-H1R-7B
falcon-h1r-7b
TII UAE
256k16.00————
Qwen3 Omni 30B A3B
qwen3-omni-30b-a3b
Alibaba
66k16.00$0.32/M91 tok/s1.96 s29.41 s
Step3 VL 10B
step3-vl-10b
StepFun
66k15.00————
Gemma 4 E2B
gemma-4-e2b
Google
128k15.00————
Llama Nemotron Ultra
llama-nemotron-ultra
NVIDIA
128k15.00$0.72/M52 tok/s2.43 s50.77 s
ERNIE 4.5 300B A47B
ernie-4-5-300b-a47b
Baidu
131k15.00$0.36/M25 tok/s3.57 s23.42 s
Solar Pro 2
solar-pro-2
Upstage
66k15.00————
NVIDIA Nemotron Nano 12B v2 VL
nvidia-nemotron-nano-12b-v2-vl
NVIDIA
128k15.00$0.24/M———
Ministral 3 8B
ministral-3-8b
Mistral
256k15.00$0.15/M97 tok/s0.64 s5.79 s
Gemma 4 E4B
gemma-4-e4b
Google
128k15.00————
NVIDIA Nemotron Nano 9B V2
nvidia-nemotron-nano-9b-v2
NVIDIA
131k15.00$0.05/M122 tok/s0.70 s21.19 s
Granite 4.1 30B
granite-4-1-30b
IBM
131k15.00————
NVIDIA Nemotron 3 Nano 4B
nvidia-nemotron-3-nano-4b
NVIDIA
262k15.00————
Qwen3.5 2B
qwen3-5-2b
Alibaba
262k15.00$0.03/M247 tok/s0.42 s2.45 s
Llama Nemotron Super 49B v1.5
llama-nemotron-super-49b-v1-5
NVIDIA
128k15.00$0.13/M48 tok/s1.30 s11.67 s
Llama 3.3 70B
llama-3-3-70b
Meta
128k14.00$0.60/M81 tok/s1.61 s7.79 s
Kimi Linear 48B A3B Instruct
kimi-linear-48b-a3b-instruct
Kimi
1M14.00————
Ring-flash-2.0
ring-flash-2-0
InclusionAI
128k14.00$0.18/M———
Solar Pro 2
solar-pro-2
Upstage
66k14.00————
Llama 4 Scout
llama-4-scout
Meta
10M14.00$0.22/M106 tok/s0.86 s5.58 s
Command A
command-a
Cohere
256k13.00$3.25/M54 tok/s1.80 s11.12 s
Llama 3.1 Nemotron 70B
llama-3-1-nemotron-70b
NVIDIA
128k13.00$1.20/M292 tok/s0.50 s2.21 s
NVIDIA Nemotron 3 Nano
nvidia-nemotron-3-nano
NVIDIA
1M13.00$0.07/M87 tok/s0.42 s6.17 s
NVIDIA Nemotron Nano 9B V2
nvidia-nemotron-nano-9b-v2
NVIDIA
131k13.00$0.06/M142 tok/s1.03 s4.54 s
MiniCPM-V 4.6 1.3B
minicpm-v-4-6-1-3b
OpenBMB
262k13.00————
Granite 4.1 8B
granite-4-1-8b
IBM
131k12.00$0.06/M108 tok/s0.79 s5.42 s
Sarvam 30B (high)
sarvam-30b-high
Sarvam
66k12.00$0.03/M163 tok/s1.94 s17.29 s
Gemma 4 E2B
gemma-4-e2b
Google
128k12.00————
R1 1776
r1-1776
Perplexity
128k12.00————
Llama 3.2 90B (Vision)
llama-3-2-90b-vision
Meta
128k12.00$1.38/M58 tok/s1.24 s9.83 s
EXAONE 4.0 32B
exaone-4-0-32b
LG AI Research
131k12.00————
Ministral 3 3B
ministral-3-3b
Mistral
256k11.00$0.10/M192 tok/s0.51 s3.12 s
Jamba 1.7 Large
jamba-1-7-large
AI21 Labs
256k11.00$2.60/M62 tok/s1.59 s9.60 s
Granite 4.0 H Small
granite-4-0-h-small
IBM
128k11.00$0.08/M350 tok/s10.36 s11.79 s
Qwen3 Omni 30B A3B
qwen3-omni-30b-a3b
Alibaba
66k11.00$0.32/M96 tok/s2.06 s7.26 s
Qwen3.5 0.8B
qwen3-5-0-8b
Alibaba
262k11.00$0.01/M———
LFM2 24B A2B
lfm2-24b-a2b
Liquid AI
33k10.00$0.04/M120 tok/s0.58 s4.76 s
Phi-4
phi-4
Microsoft
16k10.00$0.16/M38 tok/s2.02 s15.10 s
Nova Micro
nova-micro
Amazon
130k10.00$0.03/M290 tok/s0.92 s2.64 s
NVIDIA Nemotron Nano 12B v2 VL
nvidia-nemotron-nano-12b-v2-vl
NVIDIA
128k10.00$0.24/M227 tok/s1.06 s3.26 s
Phi-4 Multimodal
phi-4-multimodal
Microsoft
128k10.00$0.00/M17 tok/s1.06 s31.30 s
Qwen3.5 0.8B
qwen3-5-0-8b
Alibaba
262k10.00$0.01/M69 tok/s0.44 s7.64 s
Jamba Reasoning 3B
jamba-reasoning-3b
AI21 Labs
262k10.00————
Reka Flash 3
reka-flash-3
Reka AI
128k10.00$0.26/M———
Ling-mini-2.0
ling-mini-2-0
InclusionAI
131k9.00————
Llama 3.2 11B (Vision)
llama-3-2-11b-vision
Meta
128k9.00$0.25/M53 tok/s0.70 s10.18 s
Granite 4.1 3B
granite-4-1-3b
IBM
131k9.00————
Phi-4 Mini
phi-4-mini
Microsoft
128k8.00$0.00/M———
Exaone 4.0 1.2B
exaone-4-0-1-2b
LG AI Research
64k8.00————
Exaone 4.0 1.2B
exaone-4-0-1-2b
LG AI Research
64k8.00————
LFM2.5-1.2B-Thinking
lfm2-5-1-2b-thinking
Liquid AI
32k8.00————
Jamba 1.7 Mini
jamba-1-7-mini
AI21 Labs
258k8.00————
LFM2 2.6B
lfm2-2-6b
Liquid AI
33k8.00$0.00/M———
LFM2.5-1.2B-Instruct
lfm2-5-1-2b-instruct
Liquid AI
32k8.00$0.00/M———
Granite 4.0 H 1B
granite-4-0-h-1b
IBM
128k8.00————
Gemma 3 270M
gemma-3-270m
Google
32k8.00————
Apertus 70B Instruct
apertus-70b-instruct
Swiss AI Initiative
66k8.00$1.03/M———
Granite 4.0 Micro
granite-4-0-micro
IBM
128k8.00————
Granite 4.0 1B
granite-4-0-1b
IBM
128k7.00————
LFM2 8B A1B
lfm2-8b-a1b
Liquid AI
33k7.00$0.00/M———
LFM2.5-VL-1.6B
lfm2-5-vl-1-6b
Liquid AI
32k6.00$0.00/M———
Granite 4.0 350M
granite-4-0-350m
IBM
33k6.00————
Apertus 8B Instruct
apertus-8b-instruct
Swiss AI Initiative
66k6.00$0.11/M———
Granite 4.0 H 350M
granite-4-0-h-350m
IBM
33k5.00————
Tiny Aya Global
tiny-aya-global
Cohere
8k5.00$0.00/M———
EXAONE 4.5 33B
exaone-4-5-33b
LG AI Research
262k—————
Gemini 3 Deep Think
gemini-3-deep-think
Google
128k—————
Mi:dm K 2.5 Pro Preview
mi-dm-k-2-5-pro-preview
Korea Telecom
128k—————
GPT-5.5 Pro (xhigh)
gpt-5-5-pro-xhigh
OpenAI
922k—————