LLM TOKEN
COST CALC
Compare token pricing across 314 LLM models from 52 AI providers
Total Models
314
Providers
52
Last Updated
27 minutes ago
Price data loaded successfully. Showing 314 models from 52
providers.
Model Pricing by Provider
Provider
GPT
Models
60
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
OpenAI: gpt-oss-20b openai/gpt-oss-20b | 131K | $0.030 | $0.140 | LOW |
OpenAI: gpt-oss-120b openai/gpt-oss-120b | 131K | $0.039 | $0.190 | LOW |
OpenAI: gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b | 131K | $0.075 | $0.300 | LOW |
OpenAI: GPT-5 Nano openai/gpt-5-nano | 400K | $0.050 | $0.400 | LOW |
OpenAI: GPT-4.1 Nano openai/gpt-4.1-nano | 1.0M | $0.100 | $0.400 | LOW |
OpenAI: GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview | 128K | $0.150 | $0.600 | LOW |
OpenAI: GPT-4o-mini openai/gpt-4o-mini | 128K | $0.150 | $0.600 | LOW |
OpenAI: GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18 | 128K | $0.150 | $0.600 | LOW |
OpenAI: GPT-5.4 Nano openai/gpt-5.4-nano | 400K | $0.200 | $1.25 | MED |
OpenAI: GPT-4.1 Mini openai/gpt-4.1-mini | 1.0M | $0.400 | $1.60 | MED |
OpenAI: GPT-3.5 Turbo openai/gpt-3.5-turbo | 16K | $0.500 | $1.50 | MED |
OpenAI: GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini | 400K | $0.250 | $2.00 | MED |
OpenAI: GPT-5 Mini openai/gpt-5-mini | 400K | $0.250 | $2.00 | MED |
OpenAI: GPT Audio Mini openai/gpt-audio-mini | 128K | $0.600 | $2.40 | MED |
OpenAI: GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613 | 4K | $1.00 | $2.00 | MED |
OpenAI: GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct | 4K | $1.50 | $2.00 | MED |
OpenAI: GPT-5 Image Mini openai/gpt-5-image-mini | 400K | $2.50 | $2.00 | MED |
OpenAI: GPT-5.4 Mini openai/gpt-5.4-mini | 400K | $0.750 | $4.50 | MED |
OpenAI: o4 Mini High openai/o4-mini-high | 200K | $1.10 | $4.40 | MED |
OpenAI: o4 Mini openai/o4-mini | 200K | $1.10 | $4.40 | MED |
OpenAI: o3 Mini High openai/o3-mini-high | 200K | $1.10 | $4.40 | MED |
OpenAI: o3 Mini openai/o3-mini | 200K | $1.10 | $4.40 | MED |
OpenAI: GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k | 16K | $3.00 | $4.00 | MED |
OpenAI: o4 Mini Deep Research openai/o4-mini-deep-research | 200K | $2.00 | $8.00 | HIGH |
OpenAI: o3 openai/o3 | 200K | $2.00 | $8.00 | HIGH |
OpenAI: GPT-4.1 openai/gpt-4.1 | 1.0M | $2.00 | $8.00 | HIGH |
OpenAI: GPT-5.1-Codex-Max openai/gpt-5.1-codex-max | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5.1 openai/gpt-5.1 | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5.1 Chat openai/gpt-5.1-chat | 128K | $1.25 | $10 | HIGH |
OpenAI: GPT-5.1-Codex openai/gpt-5.1-codex | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5 Codex openai/gpt-5-codex | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5 Chat openai/gpt-5-chat | 128K | $1.25 | $10 | HIGH |
OpenAI: GPT-5 openai/gpt-5 | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT Audio openai/gpt-audio | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o Audio openai/gpt-4o-audio-preview | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o Search Preview openai/gpt-4o-search-preview | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o openai/gpt-4o | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-5.3 Chat openai/gpt-5.3-chat | 128K | $1.75 | $14 | HIGH |
OpenAI: GPT-5.3-Codex openai/gpt-5.3-codex | 400K | $1.75 | $14 | HIGH |
OpenAI: GPT-5.2-Codex openai/gpt-5.2-codex | 400K | $1.75 | $14 | HIGH |
OpenAI: GPT-5.2 Chat openai/gpt-5.2-chat | 128K | $1.75 | $14 | HIGH |
OpenAI: GPT-5.2 openai/gpt-5.2 | 400K | $1.75 | $14 | HIGH |
OpenAI: GPT-5.4 openai/gpt-5.4 | 1.1M | $2.50 | $15 | HIGH |
OpenAI: GPT-5 Image openai/gpt-5-image | 400K | $10 | $10 | HIGH |
OpenAI: GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 | 128K | $5.00 | $15 | HIGH |
OpenAI: GPT-4o (extended) openai/gpt-4o:extended | 128K | $6.00 | $18 | HIGH |
OpenAI: GPT-4 Turbo openai/gpt-4-turbo | 128K | $10 | $30 | HIGH |
OpenAI: GPT-4 Turbo Preview openai/gpt-4-turbo-preview | 128K | $10 | $30 | HIGH |
OpenAI: GPT-4 Turbo (older v1106) openai/gpt-4-1106-preview | 128K | $10 | $30 | HIGH |
OpenAI: o3 Deep Research openai/o3-deep-research | 200K | $10 | $40 | HIGH |
OpenAI: o1 openai/o1 | 200K | $15 | $60 | HIGH |
OpenAI: GPT-4 openai/gpt-4 | 8K | $30 | $60 | HIGH |
OpenAI: GPT-4 (older v0314) openai/gpt-4-0314 | 8K | $30 | $60 | HIGH |
OpenAI: o3 Pro openai/o3-pro | 200K | $20 | $80 | HIGH |
OpenAI: GPT-5 Pro openai/gpt-5-pro | 400K | $15 | $120 | HIGH |
OpenAI: GPT-5.2 Pro openai/gpt-5.2-pro | 400K | $21 | $168 | HIGH |
OpenAI: GPT-5.4 Pro openai/gpt-5.4-pro | 1.1M | $30 | $180 | HIGH |
OpenAI: o1-pro openai/o1-pro | 200K | $150 | $600 | HIGH |
Provider
Claude
Models
13
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Anthropic: Claude 3 Haiku anthropic/claude-3-haiku | 200K | $0.250 | $1.25 | MED |
Anthropic: Claude 3.5 Haiku anthropic/claude-3.5-haiku | 200K | $0.800 | $4.00 | MED |
Anthropic: Claude Haiku 4.5 anthropic/claude-haiku-4.5 | 200K | $1.00 | $5.00 | MED |
Anthropic: Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 | 1.0M | $3.00 | $15 | HIGH |
Anthropic: Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | 1.0M | $3.00 | $15 | HIGH |
Anthropic: Claude Sonnet 4 anthropic/claude-sonnet-4 | 1.0M | $3.00 | $15 | HIGH |
Anthropic: Claude 3.7 Sonnet anthropic/claude-3.7-sonnet | 200K | $3.00 | $15 | HIGH |
Anthropic: Claude 3.7 Sonnet (thinking) anthropic/claude-3.7-sonnet:thinking | 200K | $3.00 | $15 | HIGH |
Anthropic: Claude Opus 4.6 anthropic/claude-opus-4.6 | 1.0M | $5.00 | $25 | HIGH |
Anthropic: Claude Opus 4.5 anthropic/claude-opus-4.5 | 200K | $5.00 | $25 | HIGH |
Anthropic: Claude Opus 4.1 anthropic/claude-opus-4.1 | 200K | $15 | $75 | HIGH |
Anthropic: Claude Opus 4 anthropic/claude-opus-4 | 200K | $15 | $75 | HIGH |
Anthropic: Claude Opus 4.6 (Fast) anthropic/claude-opus-4.6-fast | 1.0M | $30 | $150 | HIGH |
Provider
Gemini
Models
22
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Google: Gemma 3n 4B google/gemma-3n-e4b-it | 33K | $0.020 | $0.040 | LOW |
Google: Gemma 3 4B google/gemma-3-4b-it | 131K | $0.040 | $0.080 | LOW |
Google: Gemma 3 12B google/gemma-3-12b-it | 131K | $0.040 | $0.130 | LOW |
Google: Gemma 3 27B google/gemma-3-27b-it | 131K | $0.080 | $0.160 | LOW |
Google: Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite-001 | 1.0M | $0.075 | $0.300 | LOW |
Google: Gemma 4 26B A4B google/gemma-4-26b-a4b-it | 262K | $0.080 | $0.350 | LOW |
Google: Gemini 2.5 Flash Lite Preview 09-2025 google/gemini-2.5-flash-lite-preview-09-2025 | 1.0M | $0.100 | $0.400 | LOW |
Google: Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | 1.0M | $0.100 | $0.400 | LOW |
Google: Gemini 2.0 Flash google/gemini-2.0-flash-001 | 1.0M | $0.100 | $0.400 | LOW |
Google: Gemma 4 31B google/gemma-4-31b-it | 262K | $0.130 | $0.380 | LOW |
Google: Gemma 2 27B google/gemma-2-27b-it | 8K | $0.650 | $0.650 | MED |
Google: Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview | 1.0M | $0.250 | $1.50 | MED |
Google: Nano Banana (Gemini 2.5 Flash Image) google/gemini-2.5-flash-image | 33K | $0.300 | $2.50 | MED |
Google: Gemini 2.5 Flash google/gemini-2.5-flash | 1.0M | $0.300 | $2.50 | MED |
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) google/gemini-3.1-flash-image-preview | 66K | $0.500 | $3.00 | MED |
Google: Gemini 3 Flash Preview google/gemini-3-flash-preview | 1.0M | $0.500 | $3.00 | MED |
Google: Gemini 2.5 Pro google/gemini-2.5-pro | 1.0M | $1.25 | $10 | HIGH |
Google: Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview | 1.0M | $1.25 | $10 | HIGH |
Google: Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 | 1.0M | $1.25 | $10 | HIGH |
Google: Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools | 1.0M | $2.00 | $12 | HIGH |
Google: Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview | 1.0M | $2.00 | $12 | HIGH |
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) google/gemini-3-pro-image-preview | 66K | $2.00 | $12 | HIGH |
Provider
Grok
Models
10
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
xAI: Grok 4.1 Fast x-ai/grok-4.1-fast | 2.0M | $0.200 | $0.500 | LOW |
xAI: Grok 4 Fast x-ai/grok-4-fast | 2.0M | $0.200 | $0.500 | LOW |
xAI: Grok 3 Mini x-ai/grok-3-mini | 131K | $0.300 | $0.500 | LOW |
xAI: Grok 3 Mini Beta x-ai/grok-3-mini-beta | 131K | $0.300 | $0.500 | LOW |
xAI: Grok Code Fast 1 x-ai/grok-code-fast-1 | 256K | $0.200 | $1.50 | MED |
xAI: Grok 4.20 Multi-Agent x-ai/grok-4.20-multi-agent | 2.0M | $2.00 | $6.00 | MED |
xAI: Grok 4.20 x-ai/grok-4.20 | 2.0M | $2.00 | $6.00 | MED |
xAI: Grok 4 x-ai/grok-4 | 256K | $3.00 | $15 | HIGH |
xAI: Grok 3 x-ai/grok-3 | 131K | $3.00 | $15 | HIGH |
xAI: Grok 3 Beta x-ai/grok-3-beta | 131K | $3.00 | $15 | HIGH |
Provider
Qwen
Models
46
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Qwen: Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct | 33K | $0.040 | $0.100 | LOW |
Qwen: Qwen-Turbo qwen/qwen-turbo | 131K | $0.033 | $0.130 | LOW |
Qwen: Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507 | 262K | $0.071 | $0.100 | LOW |
Qwen: Qwen3.5-9B qwen/qwen3.5-9b | 256K | $0.050 | $0.150 | LOW |
Qwen: Qwen3 14B qwen/qwen3-14b | 41K | $0.060 | $0.240 | LOW |
Qwen: Qwen3 32B qwen/qwen3-32b | 41K | $0.080 | $0.240 | LOW |
Qwen: Qwen3.5-Flash qwen/qwen3.5-flash-02-23 | 1.0M | $0.065 | $0.260 | LOW |
Qwen: Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct | 160K | $0.070 | $0.270 | LOW |
Qwen: Qwen3 30B A3B qwen/qwen3-30b-a3b | 41K | $0.080 | $0.280 | LOW |
Qwen: Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 | 262K | $0.090 | $0.300 | LOW |
Qwen: Qwen3 8B qwen/qwen3-8b | 41K | $0.050 | $0.400 | LOW |
Qwen: Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 | 131K | $0.080 | $0.400 | LOW |
Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct | 33K | $0.120 | $0.390 | LOW |
Qwen: Qwen3 VL 32B Instruct qwen/qwen3-vl-32b-instruct | 131K | $0.104 | $0.416 | LOW |
Qwen: Qwen VL Plus qwen/qwen-vl-plus | 131K | $0.137 | $0.410 | LOW |
Qwen: Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct | 131K | $0.080 | $0.500 | LOW |
Qwen: Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct | 131K | $0.130 | $0.520 | LOW |
Qwen: QwQ 32B qwen/qwq-32b | 131K | $0.150 | $0.580 | LOW |
Qwen: Qwen2.5 VL 32B Instruct qwen/qwen2.5-vl-32b-instruct | 128K | $0.200 | $0.600 | LOW |
Qwen: Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking | 131K | $0.098 | $0.780 | LOW |
Qwen: Qwen3 Coder Next qwen/qwen3-coder-next | 262K | $0.150 | $0.800 | LOW |
Qwen: Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking | 1.0M | $0.260 | $0.780 | MED |
Qwen: Qwen Plus 0728 qwen/qwen-plus-2025-07-28 | 1.0M | $0.260 | $0.780 | MED |
Qwen: Qwen-Plus qwen/qwen-plus | 1.0M | $0.260 | $0.780 | MED |
Qwen: Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct | 262K | $0.200 | $0.880 | MED |
Qwen: Qwen3 Coder Flash qwen/qwen3-coder-flash | 1.0M | $0.195 | $0.975 | MED |
Qwen: Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct | 262K | $0.090 | $1.10 | MED |
Qwen: Qwen3 Coder 480B A35B qwen/qwen3-coder | 262K | $0.220 | $1.00 | MED |
Qwen: Qwen3.5-35B-A3B qwen/qwen3.5-35b-a3b | 262K | $0.163 | $1.30 | MED |
Qwen: Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking | 131K | $0.117 | $1.36 | MED |
Qwen: Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct | 33K | $0.800 | $0.800 | MED |
Qwen: Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 | 131K | $0.150 | $1.50 | MED |
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct | 33K | $0.660 | $1.00 | MED |
Qwen: Qwen3 VL 30B A3B Thinking qwen/qwen3-vl-30b-a3b-thinking | 131K | $0.130 | $1.56 | MED |
Qwen: Qwen3.5-27B qwen/qwen3.5-27b | 262K | $0.195 | $1.56 | MED |
Qwen: Qwen3.5 Plus 2026-02-15 qwen/qwen3.5-plus-02-15 | 1.0M | $0.260 | $1.56 | MED |
Qwen: Qwen3.6 Plus qwen/qwen3.6-plus | 1.0M | $0.325 | $1.95 | MED |
Qwen: Qwen3 235B A22B qwen/qwen3-235b-a22b | 131K | $0.455 | $1.82 | MED |
Qwen: Qwen3.5-122B-A10B qwen/qwen3.5-122b-a10b | 262K | $0.260 | $2.08 | MED |
Qwen: Qwen VL Max qwen/qwen-vl-max | 131K | $0.520 | $2.08 | MED |
Qwen: Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b | 262K | $0.390 | $2.34 | MED |
Qwen: Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking | 131K | $0.260 | $2.60 | MED |
Qwen: Qwen3 Coder Plus qwen/qwen3-coder-plus | 1.0M | $0.650 | $3.25 | MED |
Qwen: Qwen3 Max Thinking qwen/qwen3-max-thinking | 262K | $0.780 | $3.90 | MED |
Qwen: Qwen3 Max qwen/qwen3-max | 262K | $0.780 | $3.90 | MED |
Qwen: Qwen-Max qwen/qwen-max | 33K | $1.04 | $4.16 | MED |
Provider
DeepSeek
Models
11
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
DeepSeek: R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b | 33K | $0.290 | $0.290 | LOW |
DeepSeek: DeepSeek V3.2 deepseek/deepseek-v3.2 | 164K | $0.260 | $0.380 | LOW |
DeepSeek: DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp | 164K | $0.270 | $0.410 | LOW |
DeepSeek: DeepSeek V3.1 deepseek/deepseek-chat-v3.1 | 33K | $0.150 | $0.750 | LOW |
DeepSeek: DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 | 164K | $0.200 | $0.770 | LOW |
DeepSeek: DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus | 164K | $0.210 | $0.790 | LOW |
DeepSeek: DeepSeek V3 deepseek/deepseek-chat | 164K | $0.320 | $0.890 | MED |
DeepSeek: R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b | 131K | $0.700 | $0.800 | MED |
DeepSeek: DeepSeek V3.2 Speciale deepseek/deepseek-v3.2-speciale | 164K | $0.400 | $1.20 | MED |
DeepSeek: R1 0528 deepseek/deepseek-r1-0528 | 164K | $0.500 | $2.15 | MED |
DeepSeek: R1 deepseek/deepseek-r1 | 64K | $0.700 | $2.50 | MED |
Provider
Mistral
Models
25
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Mistral: Mistral Nemo mistralai/mistral-nemo | 131K | $0.020 | $0.040 | LOW |
Mistral: Mistral Small 3 mistralai/mistral-small-24b-instruct-2501 | 33K | $0.050 | $0.080 | LOW |
Mistral: Ministral 3 3B 2512 mistralai/ministral-3b-2512 | 131K | $0.100 | $0.100 | LOW |
Mistral: Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct | 128K | $0.075 | $0.200 | LOW |
Mistral: Ministral 3 8B 2512 mistralai/ministral-8b-2512 | 262K | $0.150 | $0.150 | LOW |
Mistral: Mistral 7B Instruct v0.1 mistralai/mistral-7b-instruct-v0.1 | 3K | $0.110 | $0.190 | LOW |
Mistral: Mistral Small Creative mistralai/mistral-small-creative | 33K | $0.100 | $0.300 | LOW |
Mistral: Ministral 3 14B 2512 mistralai/ministral-14b-2512 | 262K | $0.200 | $0.200 | LOW |
Mistral: Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507 | 32K | $0.100 | $0.300 | LOW |
Mistral: Devstral Small 1.1 mistralai/devstral-small | 131K | $0.100 | $0.300 | LOW |
Mistral: Mistral Small 4 mistralai/mistral-small-2603 | 262K | $0.150 | $0.600 | LOW |
Mistral: Saba mistralai/mistral-saba | 33K | $0.200 | $0.600 | LOW |
Mistral: Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct | 128K | $0.350 | $0.560 | LOW |
Mistral: Mixtral 8x7B Instruct mistralai/mixtral-8x7b-instruct | 33K | $0.540 | $0.540 | MED |
Mistral: Codestral 2508 mistralai/codestral-2508 | 256K | $0.300 | $0.900 | MED |
Mistral: Mistral Large 3 2512 mistralai/mistral-large-2512 | 262K | $0.500 | $1.50 | MED |
Mistral: Devstral 2 2512 mistralai/devstral-2512 | 262K | $0.400 | $2.00 | MED |
Mistral: Mistral Medium 3.1 mistralai/mistral-medium-3.1 | 131K | $0.400 | $2.00 | MED |
Mistral: Devstral Medium mistralai/devstral-medium | 131K | $0.400 | $2.00 | MED |
Mistral: Mistral Medium 3 mistralai/mistral-medium-3 | 131K | $0.400 | $2.00 | MED |
Mistral Large 2411 mistralai/mistral-large-2411 | 131K | $2.00 | $6.00 | MED |
Mistral Large 2407 mistralai/mistral-large-2407 | 131K | $2.00 | $6.00 | MED |
Mistral: Pixtral Large 2411 mistralai/pixtral-large-2411 | 131K | $2.00 | $6.00 | MED |
Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct | 66K | $2.00 | $6.00 | MED |
Mistral Large mistralai/mistral-large | 128K | $2.00 | $6.00 | MED |
Provider
Cohere
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Cohere: Command R7B (12-2024) cohere/command-r7b-12-2024 | 128K | $0.037 | $0.150 | LOW |
Cohere: Command R (08-2024) cohere/command-r-08-2024 | 128K | $0.150 | $0.600 | LOW |
Cohere: Command A cohere/command-a | 256K | $2.50 | $10 | HIGH |
Cohere: Command R+ (08-2024) cohere/command-r-plus-08-2024 | 128K | $2.50 | $10 | HIGH |
Provider
MoonshotAI
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
MoonshotAI: Kimi K2.5 moonshotai/kimi-k2.5 | 262K | $0.383 | $1.72 | MED |
MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905 | 262K | $0.400 | $2.00 | MED |
MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2 | 131K | $0.570 | $2.30 | MED |
MoonshotAI: Kimi K2 Thinking moonshotai/kimi-k2-thinking | 262K | $0.600 | $2.50 | MED |
Provider
ByteDance
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
ByteDance: UI-TARS 7B bytedance/ui-tars-1.5-7b | 128K | $0.100 | $0.200 | LOW |
Provider
DeepCogito
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Deep Cogito: Cogito v2.1 671B deepcogito/cogito-v2.1-671b | 128K | $1.25 | $1.25 | MED |
Provider
Baidu
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Baidu: ERNIE 4.5 21B A3B Thinking baidu/ernie-4.5-21b-a3b-thinking | 131K | $0.070 | $0.280 | LOW |
Baidu: ERNIE 4.5 21B A3B baidu/ernie-4.5-21b-a3b | 120K | $0.070 | $0.280 | LOW |
Baidu: ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b | 30K | $0.140 | $0.560 | LOW |
Baidu: ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b | 123K | $0.280 | $1.10 | MED |
Baidu: ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b | 123K | $0.420 | $1.25 | MED |
Provider
Z-AI
Models
12
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Z.ai: GLM 4 32B z-ai/glm-4-32b | 128K | $0.100 | $0.100 | LOW |
Z.ai: GLM 4.7 Flash z-ai/glm-4.7-flash | 203K | $0.060 | $0.400 | LOW |
Z.ai: GLM 4.5 Air z-ai/glm-4.5-air | 131K | $0.130 | $0.850 | LOW |
Z.ai: GLM 4.6V z-ai/glm-4.6v | 131K | $0.300 | $0.900 | MED |
Z.ai: GLM 4.7 z-ai/glm-4.7 | 203K | $0.390 | $1.75 | MED |
Z.ai: GLM 4.6 z-ai/glm-4.6 | 205K | $0.390 | $1.90 | MED |
Z.ai: GLM 4.5V z-ai/glm-4.5v | 66K | $0.600 | $1.80 | MED |
Z.ai: GLM 4.5 z-ai/glm-4.5 | 131K | $0.600 | $2.20 | MED |
Z.ai: GLM 5 z-ai/glm-5 | 80K | $0.720 | $2.30 | MED |
Z.ai: GLM 5.1 z-ai/glm-5.1 | 203K | $0.950 | $3.15 | MED |
Z.ai: GLM 5V Turbo z-ai/glm-5v-turbo | 203K | $1.20 | $4.00 | MED |
Z.ai: GLM 5 Turbo z-ai/glm-5-turbo | 203K | $1.20 | $4.00 | MED |
Provider
Tencent
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Tencent: Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct | 131K | $0.140 | $0.570 | LOW |
Provider
MiniMax
Models
7
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
MiniMax: MiniMax M2.5 minimax/minimax-m2.5 | 197K | $0.118 | $0.990 | MED |
MiniMax: MiniMax M2.1 minimax/minimax-m2.1 | 197K | $0.290 | $0.950 | MED |
MiniMax: MiniMax M2 minimax/minimax-m2 | 197K | $0.255 | $1.00 | MED |
MiniMax: MiniMax-01 minimax/minimax-01 | 1.0M | $0.200 | $1.10 | MED |
MiniMax: MiniMax M2.7 minimax/minimax-m2.7 | 197K | $0.300 | $1.20 | MED |
MiniMax: MiniMax M2-her minimax/minimax-m2-her | 66K | $0.300 | $1.20 | MED |
MiniMax: MiniMax M1 minimax/minimax-m1 | 1.0M | $0.400 | $2.20 | MED |
Provider
Meta-Llama
Models
12
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Meta: Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct | 16K | $0.020 | $0.050 | LOW |
Meta: Llama 3 8B Instruct meta-llama/llama-3-8b-instruct | 8K | $0.030 | $0.040 | LOW |
Meta: Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct | 60K | $0.027 | $0.200 | LOW |
Meta: Llama Guard 4 12B meta-llama/llama-guard-4-12b | 164K | $0.180 | $0.180 | LOW |
Meta: Llama 4 Scout meta-llama/llama-4-scout | 328K | $0.080 | $0.300 | LOW |
Meta: Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct | 80K | $0.051 | $0.340 | LOW |
Meta: Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct | 131K | $0.100 | $0.320 | LOW |
Meta: Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct | 131K | $0.245 | $0.245 | LOW |
Llama Guard 3 8B meta-llama/llama-guard-3-8b | 131K | $0.480 | $0.030 | LOW |
Meta: Llama 4 Maverick meta-llama/llama-4-maverick | 1.0M | $0.150 | $0.600 | LOW |
Meta: Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct | 131K | $0.400 | $0.400 | LOW |
Meta: Llama 3 70B Instruct meta-llama/llama-3-70b-instruct | 8K | $0.510 | $0.740 | MED |
Provider
Microsoft
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Microsoft: Phi 4 microsoft/phi-4 | 16K | $0.065 | $0.140 | LOW |
WizardLM-2 8x22B microsoft/wizardlm-2-8x22b | 66K | $0.620 | $0.620 | MED |
Provider
NVIDIA
Models
6
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
NVIDIA: Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2 | 131K | $0.040 | $0.160 | LOW |
NVIDIA: Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b | 262K | $0.050 | $0.200 | LOW |
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 | 131K | $0.100 | $0.400 | LOW |
NVIDIA: Nemotron 3 Super nvidia/nemotron-3-super-120b-a12b | 262K | $0.100 | $0.500 | LOW |
NVIDIA: Nemotron Nano 12B 2 VL nvidia/nemotron-nano-12b-v2-vl | 131K | $0.200 | $0.600 | LOW |
NVIDIA: Llama 3.1 Nemotron 70B Instruct nvidia/llama-3.1-nemotron-70b-instruct | 131K | $1.20 | $1.20 | MED |
Provider
Perplexity
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Perplexity: Sonar perplexity/sonar | 127K | $1.00 | $1.00 | MED |
Perplexity: Sonar Reasoning Pro perplexity/sonar-reasoning-pro | 128K | $2.00 | $8.00 | HIGH |
Perplexity: Sonar Deep Research perplexity/sonar-deep-research | 128K | $2.00 | $8.00 | HIGH |
Perplexity: Sonar Pro Search perplexity/sonar-pro-search | 200K | $3.00 | $15 | HIGH |
Perplexity: Sonar Pro perplexity/sonar-pro | 200K | $3.00 | $15 | HIGH |
Provider
Amazon
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Amazon: Nova Micro 1.0 amazon/nova-micro-v1 | 128K | $0.035 | $0.140 | LOW |
Amazon: Nova Lite 1.0 amazon/nova-lite-v1 | 300K | $0.060 | $0.240 | LOW |
Amazon: Nova 2 Lite amazon/nova-2-lite-v1 | 1.0M | $0.300 | $2.50 | MED |
Amazon: Nova Pro 1.0 amazon/nova-pro-v1 | 300K | $0.800 | $3.20 | MED |
Amazon: Nova Premier 1.0 amazon/nova-premier-v1 | 1.0M | $2.50 | $13 | HIGH |
Provider
arcee-ai
Models
6
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Arcee AI: Trinity Mini arcee-ai/trinity-mini | 131K | $0.045 | $0.150 | LOW |
Arcee AI: Spotlight arcee-ai/spotlight | 131K | $0.180 | $0.180 | LOW |
Arcee AI: Trinity Large Thinking arcee-ai/trinity-large-thinking | 262K | $0.220 | $0.850 | MED |
Arcee AI: Coder Large arcee-ai/coder-large | 33K | $0.500 | $0.800 | MED |
Arcee AI: Virtuoso Large arcee-ai/virtuoso-large | 131K | $0.750 | $1.20 | MED |
Arcee AI: Maestro Reasoning arcee-ai/maestro-reasoning | 131K | $0.900 | $3.30 | MED |
Provider
kwaipilot
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Kwaipilot: KAT-Coder-Pro V2 kwaipilot/kat-coder-pro-v2 | 256K | $0.300 | $1.20 | MED |
Provider
rekaai
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Reka Edge rekaai/reka-edge | 16K | $0.100 | $0.100 | LOW |
Reka Flash 3 rekaai/reka-flash-3 | 66K | $0.100 | $0.200 | LOW |
Provider
xiaomi
Models
3
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Xiaomi: MiMo-V2-Flash xiaomi/mimo-v2-flash | 262K | $0.090 | $0.290 | LOW |
Xiaomi: MiMo-V2-Omni xiaomi/mimo-v2-omni | 262K | $0.400 | $2.00 | MED |
Xiaomi: MiMo-V2-Pro xiaomi/mimo-v2-pro | 1.0M | $1.00 | $3.00 | MED |
Provider
bytedance-seed
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
ByteDance Seed: Seed 1.6 Flash bytedance-seed/seed-1.6-flash | 262K | $0.075 | $0.300 | LOW |
ByteDance Seed: Seed-2.0-Mini bytedance-seed/seed-2.0-mini | 262K | $0.100 | $0.400 | LOW |
ByteDance Seed: Seed-2.0-Lite bytedance-seed/seed-2.0-lite | 262K | $0.250 | $2.00 | MED |
ByteDance Seed: Seed 1.6 bytedance-seed/seed-1.6 | 262K | $0.250 | $2.00 | MED |
Provider
inception
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Inception: Mercury 2 inception/mercury-2 | 128K | $0.250 | $0.750 | MED |
Provider
liquid
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
LiquidAI: LFM2-24B-A2B liquid/lfm-2-24b-a2b | 33K | $0.030 | $0.120 | LOW |
Provider
aion-labs
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AionLabs: Aion-1.0-Mini aion-labs/aion-1.0-mini | 131K | $0.700 | $1.40 | MED |
AionLabs: Aion-2.0 aion-labs/aion-2.0 | 131K | $0.800 | $1.60 | MED |
AionLabs: Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b | 33K | $0.800 | $1.60 | MED |
AionLabs: Aion-1.0 aion-labs/aion-1.0 | 131K | $4.00 | $8.00 | HIGH |
Provider
stepfun
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
StepFun: Step 3.5 Flash stepfun/step-3.5-flash | 262K | $0.100 | $0.300 | LOW |
Provider
upstage
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Upstage: Solar Pro 3 upstage/solar-pro-3 | 128K | $0.150 | $0.600 | LOW |
Provider
writer
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Writer: Palmyra X5 writer/palmyra-x5 | 1.0M | $0.600 | $6.00 | MED |
Provider
allenai
Models
3
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AllenAI: Olmo 2 32B Instruct allenai/olmo-2-0325-32b-instruct | 128K | $0.050 | $0.200 | LOW |
AllenAI: Olmo 3 32B Think allenai/olmo-3-32b-think | 66K | $0.150 | $0.500 | LOW |
AllenAI: Olmo 3.1 32B Instruct allenai/olmo-3.1-32b-instruct | 66K | $0.200 | $0.600 | LOW |
Provider
relace
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Relace: Relace Apply 3 relace/relace-apply-3 | 256K | $0.850 | $1.25 | MED |
Relace: Relace Search relace/relace-search | 256K | $1.00 | $3.00 | MED |
Provider
nex-agi
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Nex AGI: DeepSeek V3.1 Nex N1 nex-agi/deepseek-v3.1-nex-n1 | 131K | $0.135 | $0.500 | LOW |
Provider
essentialai
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
EssentialAI: Rnj 1 Instruct essentialai/rnj-1-instruct | 33K | $0.150 | $0.150 | LOW |
Provider
prime-intellect
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Prime Intellect: INTELLECT-3 prime-intellect/intellect-3 | 131K | $0.200 | $1.10 | MED |
Provider
ibm-granite
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
IBM: Granite 4.0 Micro ibm-granite/granite-4.0-h-micro | 131K | $0.017 | $0.110 | LOW |
Provider
thedrummer
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
TheDrummer: Rocinante 12B thedrummer/rocinante-12b | 33K | $0.170 | $0.430 | LOW |
TheDrummer: UnslopNemo 12B thedrummer/unslopnemo-12b | 33K | $0.400 | $0.400 | LOW |
TheDrummer: Cydonia 24B V4.1 thedrummer/cydonia-24b-v4.1 | 131K | $0.300 | $0.500 | LOW |
TheDrummer: Skyfall 36B V2 thedrummer/skyfall-36b-v2 | 33K | $0.550 | $0.800 | MED |
Provider
alibaba
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Tongyi DeepResearch 30B A3B alibaba/tongyi-deepresearch-30b-a3b | 131K | $0.090 | $0.450 | LOW |
Provider
nousresearch
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
NousResearch: Hermes 2 Pro - Llama-3 8B nousresearch/hermes-2-pro-llama-3-8b | 8K | $0.140 | $0.140 | LOW |
Nous: Hermes 4 70B nousresearch/hermes-4-70b | 131K | $0.130 | $0.400 | LOW |
Nous: Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b | 131K | $0.300 | $0.300 | LOW |
Nous: Hermes 3 405B Instruct nousresearch/hermes-3-llama-3.1-405b | 131K | $1.00 | $1.00 | MED |
Nous: Hermes 4 405B nousresearch/hermes-4-405b | 131K | $1.00 | $3.00 | MED |
Provider
ai21
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AI21: Jamba Large 1.7 ai21/jamba-large-1.7 | 256K | $2.00 | $8.00 | HIGH |
Provider
switchpoint
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Switchpoint Router switchpoint/router | 131K | $0.850 | $3.40 | MED |
Provider
tngtech
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
TNG: DeepSeek R1T2 Chimera tngtech/deepseek-r1t2-chimera | 164K | $0.300 | $1.10 | MED |
Provider
morph
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Morph: Morph V3 Fast morph/morph-v3-fast | 82K | $0.800 | $1.20 | MED |
Morph: Morph V3 Large morph/morph-v3-large | 262K | $0.900 | $1.90 | MED |
Provider
alfredpros
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AlfredPros: CodeLLaMa 7B Instruct Solidity alfredpros/codellama-7b-instruct-solidity | 4K | $0.800 | $1.20 | MED |
Provider
sao10k
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Sao10K: Llama 3 8B Lunaris sao10k/l3-lunaris-8b | 8K | $0.040 | $0.050 | LOW |
Sao10K: Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b | 131K | $0.650 | $0.750 | MED |
Sao10K: Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b | 131K | $0.850 | $0.850 | MED |
Sao10k: Llama 3 Euryale 70B v2.1 sao10k/l3-euryale-70b | 8K | $1.48 | $1.48 | MED |
Sao10K: Llama 3.1 70B Hanami x1 sao10k/l3.1-70b-hanami-x1 | 16K | $3.00 | $3.00 | MED |
Provider
anthracite-org
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Magnum v4 72B anthracite-org/magnum-v4-72b | 16K | $3.00 | $5.00 | MED |
Provider
inflection
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Inflection: Inflection 3 Pi inflection/inflection-3-pi | 8K | $2.50 | $10 | HIGH |
Inflection: Inflection 3 Productivity inflection/inflection-3-productivity | 8K | $2.50 | $10 | HIGH |
Provider
alpindale
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Goliath 120B alpindale/goliath-120b | 6K | $3.75 | $7.50 | HIGH |
Provider
mancer
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Mancer: Weaver (alpha) mancer/weaver | 8K | $0.750 | $1.00 | MED |
Provider
undi95
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
ReMM SLERP 13B undi95/remm-slerp-l2-13b | 6K | $0.450 | $0.650 | MED |
Provider
gryphe
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
MythoMax 13B gryphe/mythomax-l2-13b | 4K | $0.060 | $0.060 | LOW |