LLM TOKEN
COST CALC
Compare token pricing across 311 LLM models from 53 AI providers
Total Models
311
Providers
53
Last Updated
10 minutes ago
Price data loaded successfully. Showing 311 models from 53 providers.
Provider
GPT
Models
55
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
OpenAI: gpt-oss-20b openai/gpt-oss-20b | 131K | $0.020 | $0.100 | LOW |
OpenAI: gpt-oss-120b openai/gpt-oss-120b | 131K | $0.039 | $0.190 | LOW |
OpenAI: gpt-oss-120b (exacto) openai/gpt-oss-120b:exacto | 131K | $0.039 | $0.190 | LOW |
OpenAI: gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b | 131K | $0.075 | $0.300 | LOW |
OpenAI: GPT-5 Nano openai/gpt-5-nano | 400K | $0.050 | $0.400 | LOW |
OpenAI: GPT-4.1 Nano openai/gpt-4.1-nano | 1M | $0.100 | $0.400 | LOW |
OpenAI: GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview | 128K | $0.150 | $0.600 | LOW |
OpenAI: GPT-4o-mini openai/gpt-4o-mini | 128K | $0.150 | $0.600 | LOW |
OpenAI: GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18 | 128K | $0.150 | $0.600 | LOW |
OpenAI: GPT-4.1 Mini openai/gpt-4.1-mini | 1M | $0.400 | $1.60 | MED |
OpenAI: GPT-3.5 Turbo openai/gpt-3.5-turbo | 16K | $0.500 | $1.50 | MED |
OpenAI: GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini | 400K | $0.250 | $2.00 | MED |
OpenAI: GPT-5 Mini openai/gpt-5-mini | 400K | $0.250 | $2.00 | MED |
OpenAI: GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613 | 4K | $1.00 | $2.00 | MED |
OpenAI: GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct | 4K | $1.50 | $2.00 | MED |
OpenAI: GPT-5 Image Mini openai/gpt-5-image-mini | 400K | $2.50 | $2.00 | MED |
OpenAI: o4 Mini High openai/o4-mini-high | 200K | $1.10 | $4.40 | MED |
OpenAI: o4 Mini openai/o4-mini | 200K | $1.10 | $4.40 | MED |
OpenAI: o3 Mini High openai/o3-mini-high | 200K | $1.10 | $4.40 | MED |
OpenAI: o3 Mini openai/o3-mini | 200K | $1.10 | $4.40 | MED |
OpenAI: GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k | 16K | $3.00 | $4.00 | MED |
OpenAI: Codex Mini openai/codex-mini | 200K | $1.50 | $6.00 | MED |
OpenAI: o4 Mini Deep Research openai/o4-mini-deep-research | 200K | $2.00 | $8.00 | HIGH |
OpenAI: o3 openai/o3 | 200K | $2.00 | $8.00 | HIGH |
OpenAI: GPT-4.1 openai/gpt-4.1 | 1M | $2.00 | $8.00 | HIGH |
OpenAI: GPT-5.1-Codex-Max openai/gpt-5.1-codex-max | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5.1 openai/gpt-5.1 | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5.1 Chat openai/gpt-5.1-chat | 128K | $1.25 | $10 | HIGH |
OpenAI: GPT-5.1-Codex openai/gpt-5.1-codex | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5 Codex openai/gpt-5-codex | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5 Chat openai/gpt-5-chat | 128K | $1.25 | $10 | HIGH |
OpenAI: GPT-5 openai/gpt-5 | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-4o Audio openai/gpt-4o-audio-preview | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o Search Preview openai/gpt-4o-search-preview | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o openai/gpt-4o | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-5.2-Codex openai/gpt-5.2-codex | 400K | $1.75 | $14 | HIGH |
OpenAI: GPT-5.2 Chat openai/gpt-5.2-chat | 128K | $1.75 | $14 | HIGH |
OpenAI: GPT-5.2 openai/gpt-5.2 | 400K | $1.75 | $14 | HIGH |
OpenAI: GPT-5 Image openai/gpt-5-image | 400K | $10 | $10 | HIGH |
OpenAI: ChatGPT-4o openai/chatgpt-4o-latest | 128K | $5.00 | $15 | HIGH |
OpenAI: GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 | 128K | $5.00 | $15 | HIGH |
OpenAI: GPT-4o (extended) openai/gpt-4o:extended | 128K | $6.00 | $18 | HIGH |
OpenAI: GPT-4 Turbo openai/gpt-4-turbo | 128K | $10 | $30 | HIGH |
OpenAI: GPT-4 Turbo Preview openai/gpt-4-turbo-preview | 128K | $10 | $30 | HIGH |
OpenAI: GPT-4 Turbo (older v1106) openai/gpt-4-1106-preview | 128K | $10 | $30 | HIGH |
OpenAI: o3 Deep Research openai/o3-deep-research | 200K | $10 | $40 | HIGH |
OpenAI: o1 openai/o1 | 200K | $15 | $60 | HIGH |
OpenAI: GPT-4 openai/gpt-4 | 8K | $30 | $60 | HIGH |
OpenAI: GPT-4 (older v0314) openai/gpt-4-0314 | 8K | $30 | $60 | HIGH |
OpenAI: o3 Pro openai/o3-pro | 200K | $20 | $80 | HIGH |
OpenAI: GPT-5 Pro openai/gpt-5-pro | 400K | $15 | $120 | HIGH |
OpenAI: GPT-5.2 Pro openai/gpt-5.2-pro | 400K | $21 | $168 | HIGH |
OpenAI: o1-pro openai/o1-pro | 200K | $150 | $600 | HIGH |
Provider
Claude
Models
11
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Anthropic: Claude 3 Haiku anthropic/claude-3-haiku | 200K | $0.250 | $1.25 | MED |
Anthropic: Claude 3.5 Haiku anthropic/claude-3.5-haiku | 200K | $0.800 | $4.00 | MED |
Anthropic: Claude Haiku 4.5 anthropic/claude-haiku-4.5 | 200K | $1.00 | $5.00 | MED |
Anthropic: Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | 1M | $3.00 | $15 | HIGH |
Anthropic: Claude Sonnet 4 anthropic/claude-sonnet-4 | 1M | $3.00 | $15 | HIGH |
Anthropic: Claude 3.7 Sonnet (thinking) anthropic/claude-3.7-sonnet:thinking | 200K | $3.00 | $15 | HIGH |
Anthropic: Claude 3.7 Sonnet anthropic/claude-3.7-sonnet | 200K | $3.00 | $15 | HIGH |
Anthropic: Claude Opus 4.5 anthropic/claude-opus-4.5 | 200K | $5.00 | $25 | HIGH |
Anthropic: Claude 3.5 Sonnet anthropic/claude-3.5-sonnet | 200K | $6.00 | $30 | HIGH |
Anthropic: Claude Opus 4.1 anthropic/claude-opus-4.1 | 200K | $15 | $75 | HIGH |
Anthropic: Claude Opus 4 anthropic/claude-opus-4 | 200K | $15 | $75 | HIGH |
Provider
Gemini
Models
19
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Google: Gemma 3n 4B google/gemma-3n-e4b-it | 33K | $0.020 | $0.040 | LOW |
Google: Gemma 3 4B google/gemma-3-4b-it | 96K | $0.017 | $0.068 | LOW |
Google: Gemma 2 9B google/gemma-2-9b-it | 8K | $0.030 | $0.090 | LOW |
Google: Gemma 3 12B google/gemma-3-12b-it | 131K | $0.030 | $0.100 | LOW |
Google: Gemma 3 27B google/gemma-3-27b-it | 96K | $0.040 | $0.150 | LOW |
Google: Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite-001 | 1M | $0.075 | $0.300 | LOW |
Google: Gemini 2.5 Flash Lite Preview 09-2025 google/gemini-2.5-flash-lite-preview-09-2025 | 1M | $0.100 | $0.400 | LOW |
Google: Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | 1M | $0.100 | $0.400 | LOW |
Google: Gemini 2.0 Flash google/gemini-2.0-flash-001 | 1M | $0.100 | $0.400 | LOW |
Google: Gemma 2 27B google/gemma-2-27b-it | 8K | $0.650 | $0.650 | MED |
Google: Gemini 2.5 Flash Image (Nano Banana) google/gemini-2.5-flash-image | 33K | $0.300 | $2.50 | MED |
Google: Gemini 2.5 Flash Image Preview (Nano Banana) google/gemini-2.5-flash-image-preview | 33K | $0.300 | $2.50 | MED |
Google: Gemini 2.5 Flash google/gemini-2.5-flash | 1M | $0.300 | $2.50 | MED |
Google: Gemini 3 Flash Preview google/gemini-3-flash-preview | 1M | $0.500 | $3.00 | MED |
Google: Gemini 2.5 Pro google/gemini-2.5-pro | 1M | $1.25 | $10 | HIGH |
Google: Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview | 1M | $1.25 | $10 | HIGH |
Google: Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 | 1M | $1.25 | $10 | HIGH |
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) google/gemini-3-pro-image-preview | 66K | $2.00 | $12 | HIGH |
Google: Gemini 3 Pro Preview google/gemini-3-pro-preview | 1M | $2.00 | $12 | HIGH |
Provider
Grok
Models
8
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
xAI: Grok 4.1 Fast x-ai/grok-4.1-fast | 2M | $0.200 | $0.500 | LOW |
xAI: Grok 4 Fast x-ai/grok-4-fast | 2M | $0.200 | $0.500 | LOW |
xAI: Grok 3 Mini x-ai/grok-3-mini | 131K | $0.300 | $0.500 | LOW |
xAI: Grok 3 Mini Beta x-ai/grok-3-mini-beta | 131K | $0.300 | $0.500 | LOW |
xAI: Grok Code Fast 1 x-ai/grok-code-fast-1 | 256K | $0.200 | $1.50 | MED |
xAI: Grok 4 x-ai/grok-4 | 256K | $3.00 | $15 | HIGH |
xAI: Grok 3 x-ai/grok-3 | 131K | $3.00 | $15 | HIGH |
xAI: Grok 3 Beta x-ai/grok-3-beta | 131K | $3.00 | $15 | HIGH |
Provider
Qwen
Models
39
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Qwen: Qwen2.5 Coder 7B Instruct qwen/qwen2.5-coder-7b-instruct | 33K | $0.030 | $0.090 | LOW |
Qwen: Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct | 33K | $0.040 | $0.100 | LOW |
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct | 33K | $0.030 | $0.110 | LOW |
Qwen: Qwen-Turbo qwen/qwen-turbo | 1M | $0.050 | $0.200 | LOW |
Qwen: Qwen3 14B qwen/qwen3-14b | 41K | $0.050 | $0.220 | LOW |
Qwen: Qwen2.5 VL 32B Instruct qwen/qwen2.5-vl-32b-instruct | 16K | $0.050 | $0.220 | LOW |
Qwen: Qwen3 30B A3B qwen/qwen3-30b-a3b | 41K | $0.060 | $0.220 | LOW |
Qwen: Qwen3 8B qwen/qwen3-8b | 32K | $0.050 | $0.250 | LOW |
Qwen: Qwen3 32B qwen/qwen3-32b | 41K | $0.080 | $0.240 | LOW |
Qwen: Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct | 160K | $0.070 | $0.270 | LOW |
Qwen: Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 | 33K | $0.051 | $0.340 | LOW |
Qwen: Qwen2.5-VL 7B Instruct qwen/qwen-2.5-vl-7b-instruct | 33K | $0.200 | $0.200 | LOW |
Qwen: Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 | 262K | $0.080 | $0.330 | LOW |
Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct | 33K | $0.120 | $0.390 | LOW |
Qwen: Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507 | 262K | $0.071 | $0.463 | LOW |
Qwen: QwQ 32B qwen/qwq-32b | 33K | $0.150 | $0.400 | LOW |
Qwen: Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct | 131K | $0.080 | $0.500 | LOW |
Qwen: Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 | 262K | $0.110 | $0.600 | LOW |
Qwen: Qwen3 235B A22B qwen/qwen3-235b-a22b | 41K | $0.180 | $0.540 | LOW |
Qwen: Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct | 262K | $0.150 | $0.600 | LOW |
Qwen: Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct | 33K | $0.150 | $0.600 | LOW |
Qwen: Qwen VL Plus qwen/qwen-vl-plus | 8K | $0.210 | $0.630 | LOW |
Qwen: Qwen3 Coder 480B A35B qwen/qwen3-coder | 262K | $0.220 | $0.950 | MED |
Qwen: Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct | 262K | $0.090 | $1.10 | MED |
Qwen: Qwen3 VL 30B A3B Thinking qwen/qwen3-vl-30b-a3b-thinking | 131K | $0.200 | $1.00 | MED |
Qwen: Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking | 262K | $0.150 | $1.20 | MED |
Qwen: Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct | 262K | $0.200 | $1.20 | MED |
Qwen: Qwen Plus 0728 qwen/qwen-plus-2025-07-28 | 1M | $0.400 | $1.20 | MED |
Qwen: Qwen-Plus qwen/qwen-plus | 131K | $0.400 | $1.20 | MED |
Qwen: Qwen3 Coder Flash qwen/qwen3-coder-flash | 128K | $0.300 | $1.50 | MED |
Qwen: Qwen3 VL 32B Instruct qwen/qwen3-vl-32b-instruct | 262K | $0.500 | $1.50 | MED |
Qwen: Qwen3 Coder 480B A35B (exacto) qwen/qwen3-coder:exacto | 262K | $0.220 | $1.80 | MED |
Qwen: Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking | 256K | $0.180 | $2.10 | MED |
Qwen: Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking | 262K | $0.450 | $3.50 | MED |
Qwen: Qwen VL Max qwen/qwen-vl-max | 131K | $0.800 | $3.20 | MED |
Qwen: Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking | 1M | $0.400 | $4.00 | MED |
Qwen: Qwen3 Coder Plus qwen/qwen3-coder-plus | 128K | $1.00 | $5.00 | MED |
Qwen: Qwen3 Max qwen/qwen3-max | 256K | $1.20 | $6.00 | MED |
Qwen: Qwen-Max qwen/qwen-max | 33K | $1.60 | $6.40 | MED |
Provider
DeepSeek
Models
13
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
DeepSeek: R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b | 131K | $0.030 | $0.110 | LOW |
DeepSeek: DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp | 164K | $0.210 | $0.320 | LOW |
DeepSeek: R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b | 131K | $0.270 | $0.270 | LOW |
DeepSeek: DeepSeek V3.2 deepseek/deepseek-v3.2 | 164K | $0.250 | $0.380 | LOW |
DeepSeek: DeepSeek V3.2 Speciale deepseek/deepseek-v3.2-speciale | 164K | $0.270 | $0.410 | LOW |
DeepSeek: DeepSeek V3.1 deepseek/deepseek-chat-v3.1 | 33K | $0.150 | $0.750 | LOW |
DeepSeek: DeepSeek V3.1 Terminus (exacto) deepseek/deepseek-v3.1-terminus:exacto | 164K | $0.210 | $0.790 | LOW |
DeepSeek: DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus | 164K | $0.210 | $0.790 | LOW |
DeepSeek: DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 | 164K | $0.190 | $0.870 | MED |
DeepSeek: DeepSeek V3 deepseek/deepseek-chat | 164K | $0.300 | $1.20 | MED |
DeepSeek: R1 0528 deepseek/deepseek-r1-0528 | 131K | $0.450 | $2.15 | MED |
DeepSeek: DeepSeek Prover V2 deepseek/deepseek-prover-v2 | 164K | $0.500 | $2.18 | MED |
DeepSeek: R1 deepseek/deepseek-r1 | 164K | $0.700 | $2.40 | MED |
Provider
Mistral
Models
32
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Mistral: Mistral Nemo mistralai/mistral-nemo | 131K | $0.020 | $0.040 | LOW |
Mistral: Ministral 3B mistralai/ministral-3b | 131K | $0.040 | $0.040 | LOW |
Mistral: Mistral 7B Instruct mistralai/mistral-7b-instruct | 33K | $0.028 | $0.054 | LOW |
Mistral: Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct | 131K | $0.030 | $0.110 | LOW |
Mistral: Mistral Small 3 mistralai/mistral-small-24b-instruct-2501 | 33K | $0.030 | $0.110 | LOW |
Mistral: Devstral Small 2505 mistralai/devstral-small-2505 | 128K | $0.060 | $0.120 | LOW |
Mistral: Ministral 3 3B 2512 mistralai/ministral-3b-2512 | 131K | $0.100 | $0.100 | LOW |
Mistral: Ministral 8B mistralai/ministral-8b | 131K | $0.100 | $0.100 | LOW |
Mistral: Pixtral 12B mistralai/pixtral-12b | 33K | $0.100 | $0.100 | LOW |
Mistral: Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct | 131K | $0.060 | $0.180 | LOW |
Mistral: Devstral 2 2512 mistralai/devstral-2512 | 262K | $0.050 | $0.220 | LOW |
Mistral: Ministral 3 8B 2512 mistralai/ministral-8b-2512 | 262K | $0.150 | $0.150 | LOW |
Mistral: Mistral 7B Instruct v0.1 mistralai/mistral-7b-instruct-v0.1 | 3K | $0.110 | $0.190 | LOW |
Mistral: Devstral Small 1.1 mistralai/devstral-small | 128K | $0.070 | $0.280 | LOW |
Mistral: Mistral Small Creative mistralai/mistral-small-creative | 33K | $0.100 | $0.300 | LOW |
Mistral: Ministral 3 14B 2512 mistralai/ministral-14b-2512 | 262K | $0.200 | $0.200 | LOW |
Mistral: Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507 | 32K | $0.100 | $0.300 | LOW |
Mistral: Mistral 7B Instruct v0.3 mistralai/mistral-7b-instruct-v0.3 | 33K | $0.200 | $0.200 | LOW |
Mistral: Mistral 7B Instruct v0.2 mistralai/mistral-7b-instruct-v0.2 | 33K | $0.200 | $0.200 | LOW |
Mistral Tiny mistralai/mistral-tiny | 33K | $0.250 | $0.250 | LOW |
Mistral: Saba mistralai/mistral-saba | 33K | $0.200 | $0.600 | LOW |
Mistral: Mixtral 8x7B Instruct mistralai/mixtral-8x7b-instruct | 33K | $0.540 | $0.540 | MED |
Mistral: Codestral 2508 mistralai/codestral-2508 | 256K | $0.300 | $0.900 | MED |
Mistral: Mistral Large 3 2512 mistralai/mistral-large-2512 | 262K | $0.500 | $1.50 | MED |
Mistral: Mistral Medium 3.1 mistralai/mistral-medium-3.1 | 131K | $0.400 | $2.00 | MED |
Mistral: Devstral Medium mistralai/devstral-medium | 131K | $0.400 | $2.00 | MED |
Mistral: Mistral Medium 3 mistralai/mistral-medium-3 | 131K | $0.400 | $2.00 | MED |
Mistral Large 2411 mistralai/mistral-large-2411 | 131K | $2.00 | $6.00 | MED |
Mistral Large 2407 mistralai/mistral-large-2407 | 131K | $2.00 | $6.00 | MED |
Mistral: Pixtral Large 2411 mistralai/pixtral-large-2411 | 131K | $2.00 | $6.00 | MED |
Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct | 66K | $2.00 | $6.00 | MED |
Mistral Large mistralai/mistral-large | 128K | $2.00 | $6.00 | MED |
Provider
Cohere
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Cohere: Command R7B (12-2024) cohere/command-r7b-12-2024 | 128K | $0.037 | $0.150 | LOW |
Cohere: Command R (08-2024) cohere/command-r-08-2024 | 128K | $0.150 | $0.600 | LOW |
Cohere: Command A cohere/command-a | 256K | $2.50 | $10 | HIGH |
Cohere: Command R+ (08-2024) cohere/command-r-plus-08-2024 | 128K | $2.50 | $10 | HIGH |
Provider
MoonshotAI
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
MoonshotAI: Kimi Dev 72B moonshotai/kimi-dev-72b | 131K | $0.290 | $1.15 | MED |
MoonshotAI: Kimi K2 Thinking moonshotai/kimi-k2-thinking | 262K | $0.400 | $1.75 | MED |
MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905 | 262K | $0.390 | $1.90 | MED |
MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2 | 131K | $0.500 | $2.40 | MED |
MoonshotAI: Kimi K2 0905 (exacto) moonshotai/kimi-k2-0905:exacto | 262K | $0.600 | $2.50 | MED |
Provider
ByteDance
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
ByteDance: UI-TARS 7B bytedance/ui-tars-1.5-7b | 128K | $0.100 | $0.200 | LOW |
Provider
DeepCogito
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Cogito V2 Preview Llama 109B deepcogito/cogito-v2-preview-llama-109b-moe | 33K | $0.180 | $0.590 | LOW |
Deep Cogito: Cogito V2 Preview Llama 70B deepcogito/cogito-v2-preview-llama-70b | 33K | $0.880 | $0.880 | MED |
Deep Cogito: Cogito v2.1 671B deepcogito/cogito-v2.1-671b | 128K | $1.25 | $1.25 | MED |
Deep Cogito: Cogito V2 Preview Llama 405B deepcogito/cogito-v2-preview-llama-405b | 33K | $3.50 | $3.50 | MED |
Provider
Baidu
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Baidu: ERNIE 4.5 21B A3B Thinking baidu/ernie-4.5-21b-a3b-thinking | 131K | $0.070 | $0.280 | LOW |
Baidu: ERNIE 4.5 21B A3B baidu/ernie-4.5-21b-a3b | 120K | $0.070 | $0.280 | LOW |
Baidu: ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b | 30K | $0.140 | $0.560 | LOW |
Baidu: ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b | 123K | $0.280 | $1.10 | MED |
Baidu: ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b | 123K | $0.420 | $1.25 | MED |
Provider
Z-AI
Models
8
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Z.AI: GLM 4 32B z-ai/glm-4-32b | 128K | $0.100 | $0.100 | LOW |
Z.AI: GLM 4.5 Air z-ai/glm-4.5-air | 131K | $0.050 | $0.220 | LOW |
Z.AI: GLM 4.6V z-ai/glm-4.6v | 131K | $0.300 | $0.900 | MED |
Z.AI: GLM 4.6 z-ai/glm-4.6 | 203K | $0.350 | $1.50 | MED |
Z.AI: GLM 4.7 z-ai/glm-4.7 | 203K | $0.400 | $1.50 | MED |
Z.AI: GLM 4.5 z-ai/glm-4.5 | 131K | $0.350 | $1.55 | MED |
Z.AI: GLM 4.6 (exacto) z-ai/glm-4.6:exacto | 205K | $0.440 | $1.76 | MED |
Z.AI: GLM 4.5V z-ai/glm-4.5v | 66K | $0.600 | $1.80 | MED |
Provider
Tencent
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Tencent: Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct | 131K | $0.140 | $0.570 | LOW |
Provider
MiniMax
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
MiniMax: MiniMax M2 minimax/minimax-m2 | 197K | $0.200 | $1.00 | MED |
MiniMax: MiniMax-01 minimax/minimax-01 | 1M | $0.200 | $1.10 | MED |
MiniMax: MiniMax M2.1 minimax/minimax-m2.1 | 197K | $0.270 | $1.12 | MED |
MiniMax: MiniMax M1 minimax/minimax-m1 | 1M | $0.400 | $2.20 | MED |
Provider
Meta-Llama
Models
16
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Meta: Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct | 131K | $0.020 | $0.020 | LOW |
Meta: Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct | 16K | $0.020 | $0.050 | LOW |
Llama Guard 3 8B meta-llama/llama-guard-3-8b | 131K | $0.020 | $0.060 | LOW |
Meta: Llama 3 8B Instruct meta-llama/llama-3-8b-instruct | 8K | $0.030 | $0.060 | LOW |
Meta: Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct | 131K | $0.049 | $0.049 | LOW |
Meta: Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct | 60K | $0.027 | $0.200 | LOW |
Meta: Llama Guard 4 12B meta-llama/llama-guard-4-12b | 164K | $0.180 | $0.180 | LOW |
Meta: Llama 4 Scout meta-llama/llama-4-scout | 328K | $0.080 | $0.300 | LOW |
Meta: LlamaGuard 2 8B meta-llama/llama-guard-2-8b | 8K | $0.200 | $0.200 | LOW |
Meta: Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct | 131K | $0.100 | $0.320 | LOW |
Meta: Llama 3 70B Instruct meta-llama/llama-3-70b-instruct | 8K | $0.300 | $0.400 | LOW |
Meta: Llama 4 Maverick meta-llama/llama-4-maverick | 1M | $0.150 | $0.600 | LOW |
Meta: Llama 3.2 90B Vision Instruct meta-llama/llama-3.2-90b-vision-instruct | 33K | $0.350 | $0.400 | LOW |
Meta: Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct | 131K | $0.400 | $0.400 | LOW |
Meta: Llama 3.1 405B Instruct meta-llama/llama-3.1-405b-instruct | 10K | $3.50 | $3.50 | MED |
Meta: Llama 3.1 405B (base) meta-llama/llama-3.1-405b | 33K | $4.00 | $4.00 | MED |
Provider
Microsoft
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Microsoft: Phi 4 Multimodal Instruct microsoft/phi-4-multimodal-instruct | 131K | $0.050 | $0.100 | LOW |
Microsoft: Phi 4 microsoft/phi-4 | 16K | $0.060 | $0.140 | LOW |
Microsoft: Phi 4 Reasoning Plus microsoft/phi-4-reasoning-plus | 33K | $0.070 | $0.350 | LOW |
WizardLM-2 8x22B microsoft/wizardlm-2-8x22b | 66K | $0.480 | $0.480 | LOW |
Provider
NVIDIA
Models
6
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
NVIDIA: Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2 | 131K | $0.040 | $0.160 | LOW |
NVIDIA: Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b | 262K | $0.060 | $0.240 | LOW |
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 | 131K | $0.100 | $0.400 | LOW |
NVIDIA: Nemotron Nano 12B 2 VL nvidia/nemotron-nano-12b-v2-vl | 131K | $0.200 | $0.600 | LOW |
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 nvidia/llama-3.1-nemotron-ultra-253b-v1 | 131K | $0.600 | $1.80 | MED |
NVIDIA: Llama 3.1 Nemotron 70B Instruct nvidia/llama-3.1-nemotron-70b-instruct | 131K | $1.20 | $1.20 | MED |
Provider
Perplexity
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Perplexity: Sonar perplexity/sonar | 127K | $1.00 | $1.00 | MED |
Perplexity: Sonar Reasoning Pro perplexity/sonar-reasoning-pro | 128K | $2.00 | $8.00 | HIGH |
Perplexity: Sonar Deep Research perplexity/sonar-deep-research | 128K | $2.00 | $8.00 | HIGH |
Perplexity: Sonar Pro Search perplexity/sonar-pro-search | 200K | $3.00 | $15 | HIGH |
Perplexity: Sonar Pro perplexity/sonar-pro | 200K | $3.00 | $15 | HIGH |
Provider
Amazon
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Amazon: Nova Micro 1.0 amazon/nova-micro-v1 | 128K | $0.035 | $0.140 | LOW |
Amazon: Nova Lite 1.0 amazon/nova-lite-v1 | 300K | $0.060 | $0.240 | LOW |
Amazon: Nova 2 Lite amazon/nova-2-lite-v1 | 1M | $0.300 | $2.50 | MED |
Amazon: Nova Pro 1.0 amazon/nova-pro-v1 | 300K | $0.800 | $3.20 | MED |
Amazon: Nova Premier 1.0 amazon/nova-premier-v1 | 1M | $2.50 | $13 | HIGH |
Provider
allenai
Models
6
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AllenAI: Olmo 2 32B Instruct allenai/olmo-2-0325-32b-instruct | 128K | $0.050 | $0.200 | LOW |
AllenAI: Olmo 3 7B Instruct allenai/olmo-3-7b-instruct | 66K | $0.100 | $0.200 | LOW |
AllenAI: Olmo 3 7B Think allenai/olmo-3-7b-think | 66K | $0.120 | $0.200 | LOW |
AllenAI: Olmo 3.1 32B Think allenai/olmo-3.1-32b-think | 66K | $0.150 | $0.500 | LOW |
AllenAI: Olmo 3 32B Think allenai/olmo-3-32b-think | 66K | $0.150 | $0.500 | LOW |
AllenAI: Olmo 3.1 32B Instruct allenai/olmo-3.1-32b-instruct | 66K | $0.200 | $0.600 | LOW |
Provider
bytedance-seed
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
ByteDance Seed: Seed 1.6 Flash bytedance-seed/seed-1.6-flash | 262K | $0.075 | $0.300 | LOW |
ByteDance Seed: Seed 1.6 bytedance-seed/seed-1.6 | 262K | $0.250 | $2.00 | MED |
Provider
relace
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Relace: Relace Apply 3 relace/relace-apply-3 | 256K | $0.850 | $1.25 | MED |
Relace: Relace Search relace/relace-search | 256K | $1.00 | $3.00 | MED |
Provider
nex-agi
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Nex AGI: DeepSeek V3.1 Nex N1 nex-agi/deepseek-v3.1-nex-n1 | 131K | $0.270 | $1.00 | MED |
Provider
essentialai
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
EssentialAI: Rnj 1 Instruct essentialai/rnj-1-instruct | 33K | $0.150 | $0.150 | LOW |
Provider
arcee-ai
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Arcee AI: Trinity Mini arcee-ai/trinity-mini | 131K | $0.045 | $0.150 | LOW |
Arcee AI: Spotlight arcee-ai/spotlight | 131K | $0.180 | $0.180 | LOW |
Arcee AI: Coder Large arcee-ai/coder-large | 33K | $0.500 | $0.800 | MED |
Arcee AI: Virtuoso Large arcee-ai/virtuoso-large | 131K | $0.750 | $1.20 | MED |
Arcee AI: Maestro Reasoning arcee-ai/maestro-reasoning | 131K | $0.900 | $3.30 | MED |
Provider
prime-intellect
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Prime Intellect: INTELLECT-3 prime-intellect/intellect-3 | 131K | $0.200 | $1.10 | MED |
Provider
tngtech
Models
3
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
TNG: R1T Chimera tngtech/tng-r1t-chimera | 164K | $0.250 | $0.850 | MED |
TNG: DeepSeek R1T2 Chimera tngtech/deepseek-r1t2-chimera | 164K | $0.250 | $0.850 | MED |
TNG: DeepSeek R1T Chimera tngtech/deepseek-r1t-chimera | 164K | $0.300 | $1.20 | MED |
Provider
kwaipilot
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Kwaipilot: KAT-Coder-Pro V1 kwaipilot/kat-coder-pro | 256K | $0.207 | $0.828 | MED |
Provider
liquid
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
LiquidAI/LFM2-8B-A1B liquid/lfm2-8b-a1b | 33K | $0.010 | $0.020 | LOW |
LiquidAI/LFM2-2.6B liquid/lfm-2.2-6b | 33K | $0.010 | $0.020 | LOW |
Provider
ibm-granite
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
IBM: Granite 4.0 Micro ibm-granite/granite-4.0-h-micro | 131K | $0.017 | $0.110 | LOW |
Provider
thedrummer
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
TheDrummer: Rocinante 12B thedrummer/rocinante-12b | 33K | $0.170 | $0.430 | LOW |
TheDrummer: UnslopNemo 12B thedrummer/unslopnemo-12b | 33K | $0.400 | $0.400 | LOW |
TheDrummer: Cydonia 24B V4.1 thedrummer/cydonia-24b-v4.1 | 131K | $0.300 | $0.500 | LOW |
TheDrummer: Skyfall 36B V2 thedrummer/skyfall-36b-v2 | 33K | $0.550 | $0.800 | MED |
Provider
alibaba
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Tongyi DeepResearch 30B A3B alibaba/tongyi-deepresearch-30b-a3b | 131K | $0.090 | $0.400 | LOW |
Provider
opengvlab
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
OpenGVLab: InternVL3 78B opengvlab/internvl3-78b | 33K | $0.100 | $0.390 | LOW |
Provider
meituan
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Meituan: LongCat Flash Chat meituan/longcat-flash-chat | 131K | $0.200 | $0.800 | LOW |
Provider
stepfun-ai
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
StepFun: Step3 stepfun-ai/step3 | 66K | $0.570 | $1.42 | MED |
Provider
nousresearch
Models
6
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
NousResearch: Hermes 2 Pro - Llama-3 8B nousresearch/hermes-2-pro-llama-3-8b | 8K | $0.025 | $0.080 | LOW |
Nous: DeepHermes 3 Mistral 24B Preview nousresearch/deephermes-3-mistral-24b-preview | 33K | $0.020 | $0.100 | LOW |
Nous: Hermes 4 70B nousresearch/hermes-4-70b | 131K | $0.110 | $0.380 | LOW |
Nous: Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b | 66K | $0.300 | $0.300 | LOW |
Nous: Hermes 3 405B Instruct nousresearch/hermes-3-llama-3.1-405b | 131K | $1.00 | $1.00 | MED |
Nous: Hermes 4 405B nousresearch/hermes-4-405b | 131K | $1.00 | $3.00 | MED |
Provider
ai21
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AI21: Jamba Mini 1.7 ai21/jamba-mini-1.7 | 256K | $0.200 | $0.400 | LOW |
AI21: Jamba Large 1.7 ai21/jamba-large-1.7 | 256K | $2.00 | $8.00 | HIGH |
Provider
switchpoint
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Switchpoint Router switchpoint/router | 131K | $0.850 | $3.40 | MED |
Provider
morph
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Morph: Morph V3 Fast morph/morph-v3-fast | 82K | $0.800 | $1.20 | MED |
Morph: Morph V3 Large morph/morph-v3-large | 262K | $0.900 | $1.90 | MED |
Provider
inception
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Inception: Mercury inception/mercury | 128K | $0.250 | $1.00 | MED |
Inception: Mercury Coder inception/mercury-coder | 128K | $0.250 | $1.00 | MED |
Provider
eleutherai
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
EleutherAI: Llemma 7b eleutherai/llemma_7b | 4K | $0.800 | $1.20 | MED |
Provider
alfredpros
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AlfredPros: CodeLLaMa 7B Instruct Solidity alfredpros/codellama-7b-instruct-solidity | 4K | $0.800 | $1.20 | MED |
Provider
aion-labs
Models
3
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AionLabs: Aion-1.0-Mini aion-labs/aion-1.0-mini | 131K | $0.700 | $1.40 | MED |
AionLabs: Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b | 33K | $0.800 | $1.60 | MED |
AionLabs: Aion-1.0 aion-labs/aion-1.0 | 131K | $4.00 | $8.00 | HIGH |
Provider
sao10k
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Sao10K: Llama 3 8B Lunaris sao10k/l3-lunaris-8b | 8K | $0.040 | $0.050 | LOW |
Sao10K: Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b | 131K | $0.650 | $0.750 | MED |
Sao10K: Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b | 33K | $0.650 | $0.750 | MED |
Sao10k: Llama 3 Euryale 70B v2.1 sao10k/l3-euryale-70b | 8K | $1.48 | $1.48 | MED |
Sao10K: Llama 3.1 70B Hanami x1 sao10k/l3.1-70b-hanami-x1 | 16K | $3.00 | $3.00 | MED |
Provider
raifle
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
SorcererLM 8x22B raifle/sorcererlm-8x22b | 16K | $4.50 | $4.50 | MED |
Provider
anthracite-org
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Magnum v4 72B anthracite-org/magnum-v4-72b | 16K | $3.00 | $5.00 | MED |
Provider
inflection
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Inflection: Inflection 3 Pi inflection/inflection-3-pi | 8K | $2.50 | $10 | HIGH |
Inflection: Inflection 3 Productivity inflection/inflection-3-productivity | 8K | $2.50 | $10 | HIGH |
Provider
neversleep
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
NeverSleep: Lumimaid v0.2 8B neversleep/llama-3.1-lumimaid-8b | 33K | $0.090 | $0.600 | LOW |
Noromaid 20B neversleep/noromaid-20b | 4K | $1.00 | $1.75 | MED |
Provider
alpindale
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Goliath 120B alpindale/goliath-120b | 6K | $6.00 | $8.00 | HIGH |
Provider
mancer
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Mancer: Weaver (alpha) mancer/weaver | 8K | $0.750 | $1.00 | MED |
Provider
undi95
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
ReMM SLERP 13B undi95/remm-slerp-l2-13b | 6K | $0.450 | $0.650 | MED |
Provider
gryphe
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
MythoMax 13B gryphe/mythomax-l2-13b | 4K | $0.060 | $0.060 | LOW |