LLM TOKEN
COST CALC
Compare token pricing across 301 LLM models from 51 AI providers
Total Models
301
Providers
51
Last Updated
33 minutes ago
Price data loaded successfully. Showing 301 models from 51 providers.
Provider
GPT
Models
50
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
OpenAI: gpt-oss-20b openai/gpt-oss-20b | 131K | $0.030 | $0.140 | LOW |
OpenAI: gpt-oss-120b (exacto) openai/gpt-oss-120b:exacto | 131K | $0.040 | $0.200 | LOW |
OpenAI: gpt-oss-120b openai/gpt-oss-120b | 131K | $0.040 | $0.200 | LOW |
OpenAI: gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b | 131K | $0.075 | $0.300 | LOW |
OpenAI: GPT-5 Nano openai/gpt-5-nano | 400K | $0.050 | $0.400 | LOW |
OpenAI: GPT-4.1 Nano openai/gpt-4.1-nano | 1M | $0.100 | $0.400 | LOW |
OpenAI: GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview | 128K | $0.150 | $0.600 | LOW |
OpenAI: GPT-4o-mini openai/gpt-4o-mini | 128K | $0.150 | $0.600 | LOW |
OpenAI: GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18 | 128K | $0.150 | $0.600 | LOW |
OpenAI: GPT-4.1 Mini openai/gpt-4.1-mini | 1M | $0.400 | $1.60 | MED |
OpenAI: GPT-3.5 Turbo openai/gpt-3.5-turbo | 16K | $0.500 | $1.50 | MED |
OpenAI: GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini | 400K | $0.250 | $2.00 | MED |
OpenAI: GPT-5 Mini openai/gpt-5-mini | 400K | $0.250 | $2.00 | MED |
OpenAI: GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613 | 4K | $1.00 | $2.00 | MED |
OpenAI: GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct | 4K | $1.50 | $2.00 | MED |
OpenAI: GPT-5 Image Mini openai/gpt-5-image-mini | 400K | $2.50 | $2.00 | MED |
OpenAI: o4 Mini High openai/o4-mini-high | 200K | $1.10 | $4.40 | MED |
OpenAI: o4 Mini openai/o4-mini | 200K | $1.10 | $4.40 | MED |
OpenAI: o3 Mini High openai/o3-mini-high | 200K | $1.10 | $4.40 | MED |
OpenAI: o3 Mini openai/o3-mini | 200K | $1.10 | $4.40 | MED |
OpenAI: GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k | 16K | $3.00 | $4.00 | MED |
OpenAI: Codex Mini openai/codex-mini | 200K | $1.50 | $6.00 | MED |
OpenAI: o4 Mini Deep Research openai/o4-mini-deep-research | 200K | $2.00 | $8.00 | HIGH |
OpenAI: o3 openai/o3 | 200K | $2.00 | $8.00 | HIGH |
OpenAI: GPT-4.1 openai/gpt-4.1 | 1M | $2.00 | $8.00 | HIGH |
OpenAI: GPT-5.1 openai/gpt-5.1 | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5.1 Chat openai/gpt-5.1-chat | 128K | $1.25 | $10 | HIGH |
OpenAI: GPT-5.1-Codex openai/gpt-5.1-codex | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5 Codex openai/gpt-5-codex | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-5 Chat openai/gpt-5-chat | 128K | $1.25 | $10 | HIGH |
OpenAI: GPT-5 openai/gpt-5 | 400K | $1.25 | $10 | HIGH |
OpenAI: GPT-4o Audio openai/gpt-4o-audio-preview | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o Search Preview openai/gpt-4o-search-preview | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-4o openai/gpt-4o | 128K | $2.50 | $10 | HIGH |
OpenAI: GPT-5 Image openai/gpt-5-image | 400K | $10 | $10 | HIGH |
OpenAI: ChatGPT-4o openai/chatgpt-4o-latest | 128K | $5.00 | $15 | HIGH |
OpenAI: GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 | 128K | $5.00 | $15 | HIGH |
OpenAI: GPT-4o (extended) openai/gpt-4o:extended | 128K | $6.00 | $18 | HIGH |
OpenAI: GPT-4 Turbo openai/gpt-4-turbo | 128K | $10 | $30 | HIGH |
OpenAI: GPT-4 Turbo Preview openai/gpt-4-turbo-preview | 128K | $10 | $30 | HIGH |
OpenAI: GPT-4 Turbo (older v1106) openai/gpt-4-1106-preview | 128K | $10 | $30 | HIGH |
OpenAI: o3 Deep Research openai/o3-deep-research | 200K | $10 | $40 | HIGH |
OpenAI: o1 openai/o1 | 200K | $15 | $60 | HIGH |
OpenAI: GPT-4 (older v0314) openai/gpt-4-0314 | 8K | $30 | $60 | HIGH |
OpenAI: GPT-4 openai/gpt-4 | 8K | $30 | $60 | HIGH |
OpenAI: o3 Pro openai/o3-pro | 200K | $20 | $80 | HIGH |
OpenAI: GPT-5 Pro openai/gpt-5-pro | 400K | $15 | $120 | HIGH |
OpenAI: o1-pro openai/o1-pro | 200K | $150 | $600 | HIGH |
Provider
Claude
Models
13
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Anthropic: Claude 3 Haiku anthropic/claude-3-haiku | 200K | $0.250 | $1.25 | MED |
Anthropic: Claude 3.5 Haiku (2024-10-22) anthropic/claude-3.5-haiku-20241022 | 200K | $0.800 | $4.00 | MED |
Anthropic: Claude 3.5 Haiku anthropic/claude-3.5-haiku | 200K | $0.800 | $4.00 | MED |
Anthropic: Claude Haiku 4.5 anthropic/claude-haiku-4.5 | 200K | $1.00 | $5.00 | MED |
Anthropic: Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | 1M | $3.00 | $15 | HIGH |
Anthropic: Claude Sonnet 4 anthropic/claude-sonnet-4 | 1M | $3.00 | $15 | HIGH |
Anthropic: Claude 3.7 Sonnet (thinking) anthropic/claude-3.7-sonnet:thinking | 200K | $3.00 | $15 | HIGH |
Anthropic: Claude 3.7 Sonnet anthropic/claude-3.7-sonnet | 200K | $3.00 | $15 | HIGH |
Anthropic: Claude 3.5 Sonnet anthropic/claude-3.5-sonnet | 200K | $3.00 | $15 | HIGH |
Anthropic: Claude Opus 4.5 anthropic/claude-opus-4.5 | 200K | $5.00 | $25 | HIGH |
Anthropic: Claude Opus 4.1 anthropic/claude-opus-4.1 | 200K | $15 | $75 | HIGH |
Anthropic: Claude Opus 4 anthropic/claude-opus-4 | 200K | $15 | $75 | HIGH |
Anthropic: Claude 3 Opus anthropic/claude-3-opus | 200K | $15 | $75 | HIGH |
Provider
Gemini
Models
19
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Google: Gemma 3n 4B google/gemma-3n-e4b-it | 33K | $0.020 | $0.040 | LOW |
Google: Gemma 3 4B google/gemma-3-4b-it | 96K | $0.017 | $0.068 | LOW |
Google: Gemma 2 9B google/gemma-2-9b-it | 8K | $0.030 | $0.090 | LOW |
Google: Gemma 3 12B google/gemma-3-12b-it | 131K | $0.030 | $0.100 | LOW |
Google: Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite-001 | 1M | $0.075 | $0.300 | LOW |
Google: Gemini 2.5 Flash Lite Preview 09-2025 google/gemini-2.5-flash-lite-preview-09-2025 | 1M | $0.100 | $0.400 | LOW |
Google: Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | 1M | $0.100 | $0.400 | LOW |
Google: Gemini 2.0 Flash google/gemini-2.0-flash-001 | 1M | $0.100 | $0.400 | LOW |
Google: Gemma 3 27B google/gemma-3-27b-it | 131K | $0.070 | $0.500 | LOW |
Google: Gemma 2 27B google/gemma-2-27b-it | 8K | $0.650 | $0.650 | MED |
Google: Gemini 2.5 Flash Image (Nano Banana) google/gemini-2.5-flash-image | 33K | $0.300 | $2.50 | MED |
Google: Gemini 2.5 Flash Preview 09-2025 google/gemini-2.5-flash-preview-09-2025 | 1M | $0.300 | $2.50 | MED |
Google: Gemini 2.5 Flash Image Preview (Nano Banana) google/gemini-2.5-flash-image-preview | 33K | $0.300 | $2.50 | MED |
Google: Gemini 2.5 Flash google/gemini-2.5-flash | 1M | $0.300 | $2.50 | MED |
Google: Gemini 2.5 Pro google/gemini-2.5-pro | 1M | $1.25 | $10 | HIGH |
Google: Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview | 1M | $1.25 | $10 | HIGH |
Google: Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 | 1M | $1.25 | $10 | HIGH |
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) google/gemini-3-pro-image-preview | 66K | $2.00 | $12 | HIGH |
Google: Gemini 3 Pro Preview google/gemini-3-pro-preview | 1M | $2.00 | $12 | HIGH |
Provider
Grok
Models
7
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
xAI: Grok 4 Fast x-ai/grok-4-fast | 2M | $0.200 | $0.500 | LOW |
xAI: Grok 3 Mini x-ai/grok-3-mini | 131K | $0.300 | $0.500 | LOW |
xAI: Grok 3 Mini Beta x-ai/grok-3-mini-beta | 131K | $0.300 | $0.500 | LOW |
xAI: Grok Code Fast 1 x-ai/grok-code-fast-1 | 256K | $0.200 | $1.50 | MED |
xAI: Grok 4 x-ai/grok-4 | 256K | $3.00 | $15 | HIGH |
xAI: Grok 3 x-ai/grok-3 | 131K | $3.00 | $15 | HIGH |
xAI: Grok 3 Beta x-ai/grok-3-beta | 131K | $3.00 | $15 | HIGH |
Provider
Qwen
Models
38
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Qwen: Qwen2.5 Coder 7B Instruct qwen/qwen2.5-coder-7b-instruct | 33K | $0.030 | $0.090 | LOW |
Qwen: Qwen3 8B qwen/qwen3-8b | 128K | $0.028 | $0.110 | LOW |
Qwen: Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct | 33K | $0.040 | $0.100 | LOW |
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct | 33K | $0.030 | $0.110 | LOW |
Qwen: Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct | 33K | $0.030 | $0.130 | LOW |
Qwen: Qwen-Turbo qwen/qwen-turbo | 1M | $0.050 | $0.200 | LOW |
Qwen: Qwen3 14B qwen/qwen3-14b | 41K | $0.050 | $0.220 | LOW |
Qwen: Qwen2.5 VL 32B Instruct qwen/qwen2.5-vl-32b-instruct | 16K | $0.050 | $0.220 | LOW |
Qwen: Qwen3 30B A3B qwen/qwen3-30b-a3b | 41K | $0.060 | $0.220 | LOW |
Qwen: Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct | 262K | $0.060 | $0.250 | LOW |
Qwen: Qwen3 32B qwen/qwen3-32b | 41K | $0.080 | $0.240 | LOW |
Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct | 33K | $0.070 | $0.260 | LOW |
Qwen: Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 | 33K | $0.051 | $0.340 | LOW |
Qwen: Qwen2.5-VL 7B Instruct qwen/qwen-2.5-vl-7b-instruct | 33K | $0.200 | $0.200 | LOW |
Qwen: Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 | 262K | $0.080 | $0.330 | LOW |
Qwen: Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct | 131K | $0.064 | $0.400 | LOW |
Qwen: Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507 | 131K | $0.072 | $0.464 | LOW |
Qwen: QwQ 32B qwen/qwq-32b | 33K | $0.150 | $0.400 | LOW |
Qwen: Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 | 262K | $0.110 | $0.600 | LOW |
Qwen: Qwen3 235B A22B qwen/qwen3-235b-a22b | 41K | $0.180 | $0.540 | LOW |
Qwen: Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct | 262K | $0.150 | $0.600 | LOW |
Qwen: Qwen VL Plus qwen/qwen-vl-plus | 8K | $0.210 | $0.630 | LOW |
Qwen: Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct | 262K | $0.100 | $0.800 | LOW |
Qwen: Qwen3 VL 30B A3B Thinking qwen/qwen3-vl-30b-a3b-thinking | 131K | $0.160 | $0.800 | LOW |
Qwen: Qwen3 Coder 480B A35B qwen/qwen3-coder | 262K | $0.220 | $0.950 | MED |
Qwen: Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking | 131K | $0.120 | $1.20 | MED |
Qwen: Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking | 262K | $0.300 | $1.20 | MED |
Qwen: Qwen Plus 0728 qwen/qwen-plus-2025-07-28 | 1M | $0.400 | $1.20 | MED |
Qwen: Qwen-Plus qwen/qwen-plus | 131K | $0.400 | $1.20 | MED |
Qwen: Qwen3 Coder Flash qwen/qwen3-coder-flash | 128K | $0.300 | $1.50 | MED |
Qwen: Qwen3 Coder 480B A35B (exacto) qwen/qwen3-coder:exacto | 262K | $0.380 | $1.53 | MED |
Qwen: Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct | 131K | $0.210 | $1.90 | MED |
Qwen: Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking | 256K | $0.180 | $2.10 | MED |
Qwen: Qwen VL Max qwen/qwen-vl-max | 131K | $0.800 | $3.20 | MED |
Qwen: Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking | 1M | $0.400 | $4.00 | MED |
Qwen: Qwen3 Coder Plus qwen/qwen3-coder-plus | 128K | $1.00 | $5.00 | MED |
Qwen: Qwen3 Max qwen/qwen3-max | 256K | $1.20 | $6.00 | MED |
Qwen: Qwen-Max qwen/qwen-max | 33K | $1.60 | $6.40 | MED |
Provider
DeepSeek
Models
13
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
DeepSeek: DeepSeek R1 0528 Qwen3 8B deepseek/deepseek-r1-0528-qwen3-8b | 33K | $0.020 | $0.100 | LOW |
DeepSeek: R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b | 131K | $0.030 | $0.130 | LOW |
DeepSeek: R1 Distill Qwen 14B deepseek/deepseek-r1-distill-qwen-14b | 33K | $0.120 | $0.120 | LOW |
DeepSeek: R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b | 64K | $0.240 | $0.240 | LOW |
DeepSeek: DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp | 164K | $0.216 | $0.328 | LOW |
DeepSeek: DeepSeek V3.1 deepseek/deepseek-chat-v3.1 | 164K | $0.200 | $0.800 | LOW |
DeepSeek: DeepSeek V3.1 Terminus (exacto) deepseek/deepseek-v3.1-terminus:exacto | 131K | $0.216 | $0.800 | MED |
DeepSeek: DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus | 131K | $0.216 | $0.800 | MED |
DeepSeek: DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 | 164K | $0.200 | $0.880 | MED |
DeepSeek: R1 deepseek/deepseek-r1 | 164K | $0.300 | $1.20 | MED |
DeepSeek: DeepSeek V3 deepseek/deepseek-chat | 164K | $0.300 | $1.20 | MED |
DeepSeek: DeepSeek Prover V2 deepseek/deepseek-prover-v2 | 164K | $0.500 | $2.18 | MED |
DeepSeek: R1 0528 deepseek/deepseek-r1-0528 | 164K | $0.200 | $4.50 | MED |
Provider
Mistral
Models
31
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Mistral: Mistral Nemo mistralai/mistral-nemo | 131K | $0.020 | $0.040 | LOW |
Mistral: Ministral 3B mistralai/ministral-3b | 131K | $0.040 | $0.040 | LOW |
Mistral: Mistral 7B Instruct mistralai/mistral-7b-instruct | 33K | $0.028 | $0.054 | LOW |
Mistral: Mistral Small 3 mistralai/mistral-small-24b-instruct-2501 | 33K | $0.050 | $0.080 | LOW |
Mistral: Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct | 131K | $0.030 | $0.110 | LOW |
Mistral: Devstral Small 2505 mistralai/devstral-small-2505 | 128K | $0.060 | $0.120 | LOW |
Mistral: Ministral 8B mistralai/ministral-8b | 131K | $0.100 | $0.100 | LOW |
Mistral: Pixtral 12B mistralai/pixtral-12b | 33K | $0.100 | $0.100 | LOW |
Mistral: Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct | 131K | $0.060 | $0.180 | LOW |
Mistral: Mistral 7B Instruct v0.1 mistralai/mistral-7b-instruct-v0.1 | 3K | $0.110 | $0.190 | LOW |
Mistral: Devstral Small 1.1 mistralai/devstral-small | 128K | $0.070 | $0.280 | LOW |
Mistral: Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507 | 32K | $0.100 | $0.300 | LOW |
Mistral: Mistral 7B Instruct v0.3 mistralai/mistral-7b-instruct-v0.3 | 33K | $0.200 | $0.200 | LOW |
Mistral: Mistral 7B Instruct v0.2 mistralai/mistral-7b-instruct-v0.2 | 33K | $0.200 | $0.200 | LOW |
Mistral Tiny mistralai/mistral-tiny | 33K | $0.250 | $0.250 | LOW |
Mistral: Saba mistralai/mistral-saba | 33K | $0.200 | $0.600 | LOW |
Mistral Small mistralai/mistral-small | 33K | $0.200 | $0.600 | LOW |
Mistral: Mixtral 8x7B Instruct mistralai/mixtral-8x7b-instruct | 33K | $0.540 | $0.540 | MED |
Mistral: Codestral 2508 mistralai/codestral-2508 | 256K | $0.300 | $0.900 | MED |
Mistral: Codestral 2501 mistralai/codestral-2501 | 256K | $0.300 | $0.900 | MED |
Mistral: Magistral Small 2506 mistralai/magistral-small-2506 | 40K | $0.500 | $1.50 | MED |
Mistral: Mistral Medium 3.1 mistralai/mistral-medium-3.1 | 131K | $0.400 | $2.00 | MED |
Mistral: Devstral Medium mistralai/devstral-medium | 131K | $0.400 | $2.00 | MED |
Mistral: Mistral Medium 3 mistralai/mistral-medium-3 | 131K | $0.400 | $2.00 | MED |
Mistral: Magistral Medium 2506 (thinking) mistralai/magistral-medium-2506:thinking | 41K | $2.00 | $5.00 | MED |
Mistral: Magistral Medium 2506 mistralai/magistral-medium-2506 | 41K | $2.00 | $5.00 | MED |
Mistral Large 2411 mistralai/mistral-large-2411 | 131K | $2.00 | $6.00 | MED |
Mistral Large 2407 mistralai/mistral-large-2407 | 131K | $2.00 | $6.00 | MED |
Mistral: Pixtral Large 2411 mistralai/pixtral-large-2411 | 131K | $2.00 | $6.00 | MED |
Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct | 66K | $2.00 | $6.00 | MED |
Mistral Large mistralai/mistral-large | 128K | $2.00 | $6.00 | MED |
Provider
Cohere
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Cohere: Command R7B (12-2024) cohere/command-r7b-12-2024 | 128K | $0.037 | $0.150 | LOW |
Cohere: Command R (08-2024) cohere/command-r-08-2024 | 128K | $0.150 | $0.600 | LOW |
Cohere: Command A cohere/command-a | 256K | $2.50 | $10 | HIGH |
Cohere: Command R+ (08-2024) cohere/command-r-plus-08-2024 | 128K | $2.50 | $10 | HIGH |
Provider
MoonshotAI
Models
6
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
MoonshotAI: Kimi Linear 48B A3B Instruct moonshotai/kimi-linear-48b-a3b-instruct | 1M | $0.500 | $0.600 | MED |
MoonshotAI: Kimi Dev 72B moonshotai/kimi-dev-72b | 131K | $0.290 | $1.15 | MED |
MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905 | 262K | $0.390 | $1.90 | MED |
MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2 | 131K | $0.456 | $1.84 | MED |
MoonshotAI: Kimi K2 Thinking moonshotai/kimi-k2-thinking | 262K | $0.450 | $2.35 | MED |
MoonshotAI: Kimi K2 0905 (exacto) moonshotai/kimi-k2-0905:exacto | 262K | $0.600 | $2.50 | MED |
Provider
ByteDance
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
ByteDance: UI-TARS 7B bytedance/ui-tars-1.5-7b | 128K | $0.100 | $0.200 | LOW |
Provider
DeepCogito
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Cogito V2 Preview Llama 109B deepcogito/cogito-v2-preview-llama-109b-moe | 33K | $0.180 | $0.590 | LOW |
Deep Cogito: Cogito V2 Preview Llama 70B deepcogito/cogito-v2-preview-llama-70b | 33K | $0.880 | $0.880 | MED |
Deep Cogito: Cogito v2.1 671B deepcogito/cogito-v2.1-671b | 128K | $1.25 | $1.25 | MED |
Deep Cogito: Cogito V2 Preview Deepseek 671B deepcogito/cogito-v2-preview-deepseek-671b | 164K | $1.25 | $1.25 | MED |
Deep Cogito: Cogito V2 Preview Llama 405B deepcogito/cogito-v2-preview-llama-405b | 33K | $3.50 | $3.50 | MED |
Provider
Baidu
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Baidu: ERNIE 4.5 21B A3B Thinking baidu/ernie-4.5-21b-a3b-thinking | 131K | $0.056 | $0.224 | LOW |
Baidu: ERNIE 4.5 21B A3B baidu/ernie-4.5-21b-a3b | 120K | $0.056 | $0.224 | LOW |
Baidu: ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b | 30K | $0.112 | $0.448 | LOW |
Baidu: ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b | 123K | $0.224 | $0.880 | MED |
Baidu: ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b | 123K | $0.336 | $1.00 | MED |
Provider
Z-AI
Models
6
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Z.AI: GLM 4 32B z-ai/glm-4-32b | 128K | $0.100 | $0.100 | LOW |
Z.AI: GLM 4.5 Air z-ai/glm-4.5-air | 131K | $0.104 | $0.680 | LOW |
Z.AI: GLM 4.5 z-ai/glm-4.5 | 131K | $0.350 | $1.55 | MED |
Z.AI: GLM 4.5V z-ai/glm-4.5v | 66K | $0.480 | $1.44 | MED |
Z.AI: GLM 4.6 z-ai/glm-4.6 | 203K | $0.400 | $1.75 | MED |
Z.AI: GLM 4.6 (exacto) z-ai/glm-4.6:exacto | 205K | $0.440 | $1.76 | MED |
Provider
Tencent
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Tencent: Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct | 131K | $0.140 | $0.570 | LOW |
Provider
MiniMax
Models
3
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
MiniMax: MiniMax M2 minimax/minimax-m2 | 205K | $0.255 | $1.02 | MED |
MiniMax: MiniMax-01 minimax/minimax-01 | 1M | $0.200 | $1.10 | MED |
MiniMax: MiniMax M1 minimax/minimax-m1 | 1M | $0.400 | $2.20 | MED |
Provider
Meta-Llama
Models
16
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Meta: Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct | 131K | $0.020 | $0.020 | LOW |
Meta: Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct | 131K | $0.020 | $0.030 | LOW |
Llama Guard 3 8B meta-llama/llama-guard-3-8b | 131K | $0.020 | $0.060 | LOW |
Meta: Llama 3 8B Instruct meta-llama/llama-3-8b-instruct | 8K | $0.030 | $0.060 | LOW |
Meta: Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct | 131K | $0.049 | $0.049 | LOW |
Meta: Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct | 60K | $0.027 | $0.200 | LOW |
Meta: Llama Guard 4 12B meta-llama/llama-guard-4-12b | 164K | $0.180 | $0.180 | LOW |
Meta: Llama 4 Scout meta-llama/llama-4-scout | 328K | $0.080 | $0.300 | LOW |
Meta: LlamaGuard 2 8B meta-llama/llama-guard-2-8b | 8K | $0.200 | $0.200 | LOW |
Meta: Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct | 131K | $0.104 | $0.312 | LOW |
Meta: Llama 3 70B Instruct meta-llama/llama-3-70b-instruct | 8K | $0.300 | $0.400 | LOW |
Meta: Llama 3.2 90B Vision Instruct meta-llama/llama-3.2-90b-vision-instruct | 33K | $0.350 | $0.400 | LOW |
Meta: Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct | 131K | $0.400 | $0.400 | LOW |
Meta: Llama 4 Maverick meta-llama/llama-4-maverick | 1M | $0.136 | $0.680 | LOW |
Meta: Llama 3.1 405B Instruct meta-llama/llama-3.1-405b-instruct | 131K | $3.50 | $3.50 | MED |
Meta: Llama 3.1 405B (base) meta-llama/llama-3.1-405b | 33K | $4.00 | $4.00 | MED |
Provider
Microsoft
Models
8
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Microsoft: Phi 4 Multimodal Instruct microsoft/phi-4-multimodal-instruct | 131K | $0.050 | $0.100 | LOW |
Microsoft: Phi-3.5 Mini 128K Instruct microsoft/phi-3.5-mini-128k-instruct | 128K | $0.100 | $0.100 | LOW |
Microsoft: Phi-3 Mini 128K Instruct microsoft/phi-3-mini-128k-instruct | 128K | $0.100 | $0.100 | LOW |
Microsoft: Phi 4 microsoft/phi-4 | 16K | $0.060 | $0.140 | LOW |
Microsoft: Phi 4 Reasoning Plus microsoft/phi-4-reasoning-plus | 33K | $0.070 | $0.350 | LOW |
WizardLM-2 8x22B microsoft/wizardlm-2-8x22b | 66K | $0.480 | $0.480 | LOW |
Microsoft: MAI DS R1 microsoft/mai-ds-r1 | 164K | $0.300 | $1.20 | MED |
Microsoft: Phi-3 Medium 128K Instruct microsoft/phi-3-medium-128k-instruct | 128K | $1.00 | $1.00 | MED |
Provider
NVIDIA
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
NVIDIA: Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2 | 131K | $0.040 | $0.160 | LOW |
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 | 131K | $0.100 | $0.400 | LOW |
NVIDIA: Nemotron Nano 12B 2 VL nvidia/nemotron-nano-12b-v2-vl | 131K | $0.200 | $0.600 | LOW |
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 nvidia/llama-3.1-nemotron-ultra-253b-v1 | 131K | $0.600 | $1.80 | MED |
NVIDIA: Llama 3.1 Nemotron 70B Instruct nvidia/llama-3.1-nemotron-70b-instruct | 131K | $1.20 | $1.20 | MED |
Provider
Perplexity
Models
6
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Perplexity: Sonar perplexity/sonar | 127K | $1.00 | $1.00 | MED |
Perplexity: Sonar Reasoning perplexity/sonar-reasoning | 127K | $1.00 | $5.00 | MED |
Perplexity: Sonar Reasoning Pro perplexity/sonar-reasoning-pro | 128K | $2.00 | $8.00 | HIGH |
Perplexity: Sonar Deep Research perplexity/sonar-deep-research | 128K | $2.00 | $8.00 | HIGH |
Perplexity: Sonar Pro Search perplexity/sonar-pro-search | 200K | $3.00 | $15 | HIGH |
Perplexity: Sonar Pro perplexity/sonar-pro | 200K | $3.00 | $15 | HIGH |
Provider
Amazon
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Amazon: Nova Micro 1.0 amazon/nova-micro-v1 | 128K | $0.035 | $0.140 | LOW |
Amazon: Nova Lite 1.0 amazon/nova-lite-v1 | 300K | $0.060 | $0.240 | LOW |
Amazon: Nova Pro 1.0 amazon/nova-pro-v1 | 300K | $0.800 | $3.20 | MED |
Amazon: Nova Premier 1.0 amazon/nova-premier-v1 | 1M | $2.50 | $13 | HIGH |
Provider
prime-intellect
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Prime Intellect: INTELLECT-3 prime-intellect/intellect-3 | 131K | $0.200 | $1.10 | MED |
Provider
tngtech
Models
3
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
TNG: R1T Chimera tngtech/tng-r1t-chimera | 164K | $0.300 | $1.20 | MED |
TNG: DeepSeek R1T2 Chimera tngtech/deepseek-r1t2-chimera | 164K | $0.300 | $1.20 | MED |
TNG: DeepSeek R1T Chimera tngtech/deepseek-r1t-chimera | 164K | $0.300 | $1.20 | MED |
Provider
allenai
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AllenAI: Olmo 2 32B Instruct allenai/olmo-2-0325-32b-instruct | 128K | $0.050 | $0.200 | LOW |
AllenAI: Olmo 3 7B Instruct allenai/olmo-3-7b-instruct | 66K | $0.100 | $0.200 | LOW |
AllenAI: Olmo 3 7B Think allenai/olmo-3-7b-think | 66K | $0.120 | $0.200 | LOW |
AllenAI: Olmo 3 32B Think allenai/olmo-3-32b-think | 66K | $0.300 | $0.550 | LOW |
Provider
liquid
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
LiquidAI/LFM2-8B-A1B liquid/lfm2-8b-a1b | 33K | $0.050 | $0.100 | LOW |
LiquidAI/LFM2-2.6B liquid/lfm-2.2-6b | 33K | $0.050 | $0.100 | LOW |
Provider
ibm-granite
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
IBM: Granite 4.0 Micro ibm-granite/granite-4.0-h-micro | 131K | $0.017 | $0.110 | LOW |
Provider
thedrummer
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
TheDrummer: Rocinante 12B thedrummer/rocinante-12b | 33K | $0.170 | $0.430 | LOW |
TheDrummer: UnslopNemo 12B thedrummer/unslopnemo-12b | 33K | $0.400 | $0.400 | LOW |
TheDrummer: Cydonia 24B V4.1 thedrummer/cydonia-24b-v4.1 | 131K | $0.300 | $0.500 | LOW |
TheDrummer: Skyfall 36B V2 thedrummer/skyfall-36b-v2 | 33K | $0.500 | $0.800 | MED |
TheDrummer: Anubis 70B V1.1 thedrummer/anubis-70b-v1.1 | 131K | $0.750 | $1.00 | MED |
Provider
relace
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Relace: Relace Apply 3 relace/relace-apply-3 | 256K | $0.850 | $1.25 | MED |
Provider
alibaba
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Tongyi DeepResearch 30B A3B alibaba/tongyi-deepresearch-30b-a3b | 131K | $0.090 | $0.400 | LOW |
Provider
opengvlab
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
OpenGVLab: InternVL3 78B opengvlab/internvl3-78b | 33K | $0.070 | $0.260 | LOW |
Provider
meituan
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Meituan: LongCat Flash Chat meituan/longcat-flash-chat | 131K | $0.150 | $0.750 | LOW |
Provider
stepfun-ai
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
StepFun: Step3 stepfun-ai/step3 | 66K | $0.570 | $1.42 | MED |
Provider
nousresearch
Models
6
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
NousResearch: Hermes 2 Pro - Llama-3 8B nousresearch/hermes-2-pro-llama-3-8b | 8K | $0.025 | $0.080 | LOW |
Nous: DeepHermes 3 Mistral 24B Preview nousresearch/deephermes-3-mistral-24b-preview | 33K | $0.050 | $0.200 | LOW |
Nous: Hermes 4 70B nousresearch/hermes-4-70b | 131K | $0.110 | $0.380 | LOW |
Nous: Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b | 66K | $0.300 | $0.300 | LOW |
Nous: Hermes 4 405B nousresearch/hermes-4-405b | 131K | $0.300 | $1.20 | MED |
Nous: Hermes 3 405B Instruct nousresearch/hermes-3-llama-3.1-405b | 131K | $1.00 | $1.00 | MED |
Provider
ai21
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AI21: Jamba Mini 1.7 ai21/jamba-mini-1.7 | 256K | $0.200 | $0.400 | LOW |
AI21: Jamba Large 1.7 ai21/jamba-large-1.7 | 256K | $2.00 | $8.00 | HIGH |
Provider
switchpoint
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Switchpoint Router switchpoint/router | 131K | $0.850 | $3.40 | MED |
Provider
thudm
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
THUDM: GLM 4.1V 9B Thinking thudm/glm-4.1v-9b-thinking | 66K | $0.028 | $0.110 | LOW |
Provider
morph
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Morph: Morph V3 Fast morph/morph-v3-fast | 82K | $0.800 | $1.20 | MED |
Morph: Morph V3 Large morph/morph-v3-large | 262K | $0.900 | $1.90 | MED |
Provider
inception
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Inception: Mercury inception/mercury | 128K | $0.250 | $1.00 | MED |
Inception: Mercury Coder inception/mercury-coder | 128K | $0.250 | $1.00 | MED |
Provider
arcee-ai
Models
4
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Arcee AI: Spotlight arcee-ai/spotlight | 131K | $0.180 | $0.180 | LOW |
Arcee AI: Coder Large arcee-ai/coder-large | 33K | $0.500 | $0.800 | MED |
Arcee AI: Virtuoso Large arcee-ai/virtuoso-large | 131K | $0.750 | $1.20 | MED |
Arcee AI: Maestro Reasoning arcee-ai/maestro-reasoning | 131K | $0.900 | $3.30 | MED |
Provider
eleutherai
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
EleutherAI: Llemma 7b eleutherai/llemma_7b | 4K | $0.800 | $1.20 | MED |
Provider
alfredpros
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AlfredPros: CodeLLaMa 7B Instruct Solidity alfredpros/codellama-7b-instruct-solidity | 4K | $0.800 | $1.20 | MED |
Provider
arliai
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
ArliAI: QwQ 32B RpR v1 arliai/qwq-32b-arliai-rpr-v1 | 33K | $0.030 | $0.110 | LOW |
Provider
aion-labs
Models
3
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
AionLabs: Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b | 33K | $0.200 | $0.200 | LOW |
AionLabs: Aion-1.0-Mini aion-labs/aion-1.0-mini | 131K | $0.700 | $1.40 | MED |
AionLabs: Aion-1.0 aion-labs/aion-1.0 | 131K | $4.00 | $8.00 | HIGH |
Provider
sao10k
Models
5
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Sao10K: Llama 3 8B Lunaris sao10k/l3-lunaris-8b | 8K | $0.040 | $0.050 | LOW |
Sao10K: Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b | 131K | $0.650 | $0.750 | MED |
Sao10K: Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b | 33K | $0.650 | $0.750 | MED |
Sao10k: Llama 3 Euryale 70B v2.1 sao10k/l3-euryale-70b | 8K | $1.48 | $1.48 | MED |
Sao10K: Llama 3.1 70B Hanami x1 sao10k/l3.1-70b-hanami-x1 | 16K | $3.00 | $3.00 | MED |
Provider
raifle
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
SorcererLM 8x22B raifle/sorcererlm-8x22b | 16K | $4.50 | $4.50 | MED |
Provider
anthracite-org
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Magnum v4 72B anthracite-org/magnum-v4-72b | 16K | $3.00 | $5.00 | MED |
Provider
inflection
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Inflection: Inflection 3 Productivity inflection/inflection-3-productivity | 8K | $2.50 | $10 | HIGH |
Inflection: Inflection 3 Pi inflection/inflection-3-pi | 8K | $2.50 | $10 | HIGH |
Provider
neversleep
Models
2
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
NeverSleep: Lumimaid v0.2 8B neversleep/llama-3.1-lumimaid-8b | 33K | $0.090 | $0.600 | LOW |
Noromaid 20B neversleep/noromaid-20b | 4K | $1.00 | $1.75 | MED |
Provider
alpindale
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Goliath 120B alpindale/goliath-120b | 6K | $6.00 | $8.00 | HIGH |
Provider
mancer
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
Mancer: Weaver (alpha) mancer/weaver | 8K | $1.13 | $1.13 | MED |
Provider
undi95
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
ReMM SLERP 13B undi95/remm-slerp-l2-13b | 6K | $0.450 | $0.650 | MED |
Provider
gryphe
Models
1
| Model | Context | Input $/M | Output $/M | Tier |
|---|---|---|---|---|
MythoMax 13B gryphe/mythomax-l2-13b | 4K | $0.060 | $0.060 | LOW |