LLM Token Cost Calculator
Compare token pricing across 267 LLM models from 51 AI providers
Across all providers
Active providers
Auto-refreshes every 12 hours
40 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
OpenAI: gpt-oss-20b openai/gpt-oss-20b | 131K | $0.040 | $0.150 | budget |
OpenAI: gpt-oss-120b openai/gpt-oss-120b | 131K | $0.072 | $0.280 | budget |
OpenAI: GPT-5 Nano openai/gpt-5-nano | 400K | $0.050 | $0.400 | budget |
OpenAI: GPT-4.1 Nano openai/gpt-4.1-nano | 1.0M | $0.100 | $0.400 | budget |
OpenAI: GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview | 128K | $0.150 | $0.600 | budget |
OpenAI: GPT-4o-mini openai/gpt-4o-mini | 128K | $0.150 | $0.600 | budget |
OpenAI: GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18 | 128K | $0.150 | $0.600 | budget |
OpenAI: GPT-4.1 Mini openai/gpt-4.1-mini | 1.0M | $0.400 | $1.60 | standard |
OpenAI: GPT-3.5 Turbo openai/gpt-3.5-turbo | 16K | $0.500 | $1.50 | standard |
OpenAI: GPT-5 Mini openai/gpt-5-mini | 400K | $0.250 | $2.00 | standard |
OpenAI: GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613 | 4K | $1.00 | $2.00 | standard |
OpenAI: GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct | 4K | $1.50 | $2.00 | standard |
OpenAI: o4 Mini High openai/o4-mini-high | 200K | $1.10 | $4.40 | standard |
OpenAI: o4 Mini openai/o4-mini | 200K | $1.10 | $4.40 | standard |
OpenAI: o3 Mini High openai/o3-mini-high | 200K | $1.10 | $4.40 | standard |
OpenAI: o3 Mini openai/o3-mini | 200K | $1.10 | $4.40 | standard |
OpenAI: o1-mini openai/o1-mini | 128K | $1.10 | $4.40 | standard |
OpenAI: o1-mini (2024-09-12) openai/o1-mini-2024-09-12 | 128K | $1.10 | $4.40 | standard |
OpenAI: GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k | 16K | $3.00 | $4.00 | standard |
OpenAI: Codex Mini openai/codex-mini | 200K | $1.50 | $6.00 | standard |
OpenAI: o3 openai/o3 | 200K | $2.00 | $8.00 | premium |
OpenAI: GPT-4.1 openai/gpt-4.1 | 1.0M | $2.00 | $8.00 | premium |
OpenAI: GPT-5 Chat openai/gpt-5-chat | 128K | $1.25 | $10 | premium |
OpenAI: GPT-5 openai/gpt-5 | 400K | $1.25 | $10 | premium |
OpenAI: GPT-4o Audio openai/gpt-4o-audio-preview | 128K | $2.50 | $10 | premium |
OpenAI: GPT-4o Search Preview openai/gpt-4o-search-preview | 128K | $2.50 | $10 | premium |
OpenAI: GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 | 128K | $2.50 | $10 | premium |
OpenAI: GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 | 128K | $2.50 | $10 | premium |
OpenAI: GPT-4o openai/gpt-4o | 128K | $2.50 | $10 | premium |
OpenAI: ChatGPT-4o openai/chatgpt-4o-latest | 128K | $5.00 | $15 | premium |
OpenAI: GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 | 128K | $5.00 | $15 | premium |
OpenAI: GPT-4o (extended) openai/gpt-4o:extended | 128K | $6.00 | $18 | premium |
OpenAI: GPT-4 Turbo openai/gpt-4-turbo | 128K | $10 | $30 | premium |
OpenAI: GPT-4 Turbo Preview openai/gpt-4-turbo-preview | 128K | $10 | $30 | premium |
OpenAI: GPT-4 Turbo (older v1106) openai/gpt-4-1106-preview | 128K | $10 | $30 | premium |
OpenAI: o1 openai/o1 | 200K | $15 | $60 | premium |
OpenAI: GPT-4 openai/gpt-4 | 8K | $30 | $60 | premium |
OpenAI: GPT-4 (older v0314) openai/gpt-4-0314 | 8K | $30 | $60 | premium |
OpenAI: o3 Pro openai/o3-pro | 200K | $20 | $80 | premium |
OpenAI: o1-pro openai/o1-pro | 200K | $150 | $600 | premium |
11 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Anthropic: Claude 3 Haiku anthropic/claude-3-haiku | 200K | $0.250 | $1.25 | standard |
Anthropic: Claude 3.5 Haiku anthropic/claude-3.5-haiku | 200K | $0.800 | $4.00 | standard |
Anthropic: Claude 3.5 Haiku (2024-10-22) anthropic/claude-3.5-haiku-20241022 | 200K | $0.800 | $4.00 | standard |
Anthropic: Claude Sonnet 4 anthropic/claude-sonnet-4 | 1.0M | $3.00 | $15 | premium |
Anthropic: Claude 3.7 Sonnet anthropic/claude-3.7-sonnet | 200K | $3.00 | $15 | premium |
Anthropic: Claude 3.7 Sonnet (thinking) anthropic/claude-3.7-sonnet:thinking | 200K | $3.00 | $15 | premium |
Anthropic: Claude 3.5 Sonnet anthropic/claude-3.5-sonnet | 200K | $3.00 | $15 | premium |
Anthropic: Claude 3.5 Sonnet (2024-06-20) anthropic/claude-3.5-sonnet-20240620 | 200K | $3.00 | $15 | premium |
Anthropic: Claude Opus 4.1 anthropic/claude-opus-4.1 | 200K | $15 | $75 | premium |
Anthropic: Claude Opus 4 anthropic/claude-opus-4 | 200K | $15 | $75 | premium |
Anthropic: Claude 3 Opus anthropic/claude-3-opus | 200K | $15 | $75 | premium |
18 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Google: Gemma 2 9B google/gemma-2-9b-it | 8K | $0.010 | $0.010 | budget |
Google: Gemma 3n 4B google/gemma-3n-e4b-it | 33K | $0.020 | $0.040 | budget |
Google: Gemma 3 4B google/gemma-3-4b-it | 131K | $0.040 | $0.080 | budget |
Google: Gemini 1.5 Flash 8B google/gemini-flash-1.5-8b | 1.0M | $0.037 | $0.150 | budget |
Google: Gemma 3 12B google/gemma-3-12b-it | 96K | $0.048 | $0.193 | budget |
Google: Gemma 3 27B google/gemma-3-27b-it | 96K | $0.067 | $0.267 | budget |
Google: Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite-001 | 1.0M | $0.075 | $0.300 | budget |
Google: Gemini 1.5 Flash google/gemini-flash-1.5 | 1.0M | $0.075 | $0.300 | budget |
Google: Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | 1.0M | $0.100 | $0.400 | budget |
Google: Gemini 2.5 Flash Lite Preview 06-17 google/gemini-2.5-flash-lite-preview-06-17 | 1.0M | $0.100 | $0.400 | budget |
Google: Gemini 2.0 Flash google/gemini-2.0-flash-001 | 1.0M | $0.100 | $0.400 | budget |
Google: Gemma 2 27B google/gemma-2-27b-it | 8K | $0.650 | $0.650 | standard |
Google: Gemini 2.5 Flash Image Preview google/gemini-2.5-flash-image-preview | 33K | $0.300 | $2.50 | standard |
Google: Gemini 2.5 Flash google/gemini-2.5-flash | 1.0M | $0.300 | $2.50 | standard |
Google: Gemini 1.5 Pro google/gemini-pro-1.5 | 2.0M | $1.25 | $5.00 | standard |
Google: Gemini 2.5 Pro google/gemini-2.5-pro | 1.0M | $1.25 | $10 | premium |
Google: Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview | 1.0M | $1.25 | $10 | premium |
Google: Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 | 1.0M | $1.25 | $10 | premium |
8 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
xAI: Grok 3 Mini x-ai/grok-3-mini | 131K | $0.300 | $0.500 | budget |
xAI: Grok 3 Mini Beta x-ai/grok-3-mini-beta | 131K | $0.300 | $0.500 | budget |
xAI: Grok Code Fast 1 x-ai/grok-code-fast-1 | 256K | $0.200 | $1.50 | standard |
xAI: Grok 2 Vision 1212 x-ai/grok-2-vision-1212 | 33K | $2.00 | $10 | premium |
xAI: Grok 2 1212 x-ai/grok-2-1212 | 131K | $2.00 | $10 | premium |
xAI: Grok 4 x-ai/grok-4 | 256K | $3.00 | $15 | premium |
xAI: Grok 3 x-ai/grok-3 | 131K | $3.00 | $15 | premium |
xAI: Grok 3 Beta x-ai/grok-3-beta | 131K | $3.00 | $15 | premium |
25 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Qwen: Qwen3 32B qwen/qwen3-32b | 41K | $0.018 | $0.072 | budget |
Qwen: Qwen3 30B A3B qwen/qwen3-30b-a3b | 41K | $0.020 | $0.080 | budget |
Qwen: Qwen2.5 VL 32B Instruct qwen/qwen2.5-vl-32b-instruct | 16K | $0.020 | $0.080 | budget |
Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct | 66K | $0.040 | $0.100 | budget |
Qwen: Qwen3 8B qwen/qwen3-8b | 128K | $0.035 | $0.138 | budget |
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct | 33K | $0.050 | $0.200 | budget |
Qwen: Qwen-Turbo qwen/qwen-turbo | 1.0M | $0.050 | $0.200 | budget |
Qwen: Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct | 262K | $0.052 | $0.207 | budget |
Qwen: Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 | 262K | $0.052 | $0.207 | budget |
Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct | 33K | $0.052 | $0.207 | budget |
Qwen: Qwen3 14B qwen/qwen3-14b | 41K | $0.060 | $0.240 | budget |
Qwen: Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 | 262K | $0.071 | $0.285 | budget |
Qwen: Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 | 262K | $0.078 | $0.312 | budget |
Qwen: Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507 | 262K | $0.078 | $0.312 | budget |
Qwen: QwQ 32B Preview qwen/qwq-32b-preview | 33K | $0.200 | $0.200 | budget |
Qwen: Qwen2.5-VL 7B Instruct qwen/qwen-2.5-vl-7b-instruct | 33K | $0.200 | $0.200 | budget |
Qwen: Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct | 33K | $0.100 | $0.400 | budget |
Qwen: QwQ 32B qwen/qwq-32b | 33K | $0.150 | $0.400 | budget |
Qwen: Qwen3 235B A22B qwen/qwen3-235b-a22b | 41K | $0.130 | $0.600 | budget |
Qwen: Qwen VL Plus qwen/qwen-vl-plus | 8K | $0.210 | $0.630 | budget |
Qwen: Qwen3 Coder 480B A35B qwen/qwen3-coder | 262K | $0.200 | $0.800 | budget |
Qwen: Qwen-Plus qwen/qwen-plus | 131K | $0.400 | $1.20 | standard |
Qwen: Qwen VL Max qwen/qwen-vl-max | 8K | $0.800 | $3.20 | standard |
Qwen: Qwen3 Max qwen/qwen3-max | 256K | $1.20 | $6.00 | standard |
Qwen: Qwen-Max qwen/qwen-max | 33K | $1.60 | $6.40 | standard |
12 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
DeepSeek: R1 Distill Llama 8B deepseek/deepseek-r1-distill-llama-8b | 32K | $0.040 | $0.040 | budget |
DeepSeek: Deepseek R1 0528 Qwen3 8B deepseek/deepseek-r1-0528-qwen3-8b | 131K | $0.017 | $0.068 | budget |
DeepSeek: R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b | 131K | $0.026 | $0.104 | budget |
DeepSeek: R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b | 131K | $0.075 | $0.150 | budget |
DeepSeek: R1 Distill Qwen 14B deepseek/deepseek-r1-distill-qwen-14b | 64K | $0.150 | $0.150 | budget |
DeepSeek: R1 0528 deepseek/deepseek-r1-0528 | 164K | $0.200 | $0.800 | budget |
DeepSeek: DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 | 164K | $0.200 | $0.800 | budget |
DeepSeek: DeepSeek V3 deepseek/deepseek-chat | 164K | $0.200 | $0.800 | budget |
DeepSeek: DeepSeek V3.1 deepseek/deepseek-chat-v3.1 | 164K | $0.200 | $0.800 | budget |
DeepSeek: DeepSeek V3.1 Base deepseek/deepseek-v3.1-base | 164K | $0.200 | $0.800 | budget |
DeepSeek: R1 deepseek/deepseek-r1 | 164K | $0.400 | $2.00 | standard |
DeepSeek: DeepSeek Prover V2 deepseek/deepseek-prover-v2 | 164K | $0.500 | $2.18 | standard |
29 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Mistral: Mistral Nemo mistralai/mistral-nemo | 131K | $0.010 | $0.040 | budget |
Mistral: Ministral 3B mistralai/ministral-3b | 33K | $0.040 | $0.040 | budget |
Mistral: Mistral 7B Instruct mistralai/mistral-7b-instruct | 33K | $0.028 | $0.054 | budget |
Mistral: Mistral 7B Instruct v0.3 mistralai/mistral-7b-instruct-v0.3 | 33K | $0.028 | $0.054 | budget |
Mistral: Devstral Small 2505 mistralai/devstral-small-2505 | 131K | $0.020 | $0.080 | budget |
Mistral: Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct | 131K | $0.020 | $0.080 | budget |
Mistral: Mistral Small 3 mistralai/mistral-small-24b-instruct-2501 | 33K | $0.020 | $0.080 | budget |
Mistral: Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct | 128K | $0.050 | $0.100 | budget |
Mistral: Ministral 8B mistralai/ministral-8b | 128K | $0.100 | $0.100 | budget |
Mistral: Pixtral 12B mistralai/pixtral-12b | 33K | $0.100 | $0.100 | budget |
Mistral: Mistral 7B Instruct v0.1 mistralai/mistral-7b-instruct-v0.1 | 3K | $0.110 | $0.190 | budget |
Mistral: Mixtral 8x7B Instruct mistralai/mixtral-8x7b-instruct | 33K | $0.080 | $0.240 | budget |
Mistral: Devstral Small 1.1 mistralai/devstral-small | 128K | $0.070 | $0.280 | budget |
Mistral Tiny mistralai/mistral-tiny | 33K | $0.250 | $0.250 | budget |
Mistral: Saba mistralai/mistral-saba | 33K | $0.200 | $0.600 | budget |
Mistral Small mistralai/mistral-small | 33K | $0.200 | $0.600 | budget |
Mistral: Codestral 2508 mistralai/codestral-2508 | 256K | $0.300 | $0.900 | standard |
Mistral: Codestral 2501 mistralai/codestral-2501 | 262K | $0.300 | $0.900 | standard |
Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct | 66K | $0.900 | $0.900 | standard |
Mistral: Magistral Small 2506 mistralai/magistral-small-2506 | 40K | $0.500 | $1.50 | standard |
Mistral: Mistral Medium 3.1 mistralai/mistral-medium-3.1 | 131K | $0.400 | $2.00 | standard |
Mistral: Devstral Medium mistralai/devstral-medium | 131K | $0.400 | $2.00 | standard |
Mistral: Mistral Medium 3 mistralai/mistral-medium-3 | 131K | $0.400 | $2.00 | standard |
Mistral: Magistral Medium 2506 mistralai/magistral-medium-2506 | 41K | $2.00 | $5.00 | standard |
Mistral: Magistral Medium 2506 (thinking) mistralai/magistral-medium-2506:thinking | 41K | $2.00 | $5.00 | standard |
Mistral Large 2411 mistralai/mistral-large-2411 | 131K | $2.00 | $6.00 | standard |
Mistral Large 2407 mistralai/mistral-large-2407 | 131K | $2.00 | $6.00 | standard |
Mistral: Pixtral Large 2411 mistralai/pixtral-large-2411 | 131K | $2.00 | $6.00 | standard |
Mistral Large mistralai/mistral-large | 128K | $2.00 | $6.00 | standard |
9 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Cohere: Command R7B (12-2024) cohere/command-r7b-12-2024 | 128K | $0.037 | $0.150 | budget |
Cohere: Command R (08-2024) cohere/command-r-08-2024 | 128K | $0.150 | $0.600 | budget |
Cohere: Command R cohere/command-r | 128K | $0.500 | $1.50 | standard |
Cohere: Command R (03-2024) cohere/command-r-03-2024 | 128K | $0.500 | $1.50 | standard |
Cohere: Command cohere/command | 4K | $1.00 | $2.00 | standard |
Cohere: Command A cohere/command-a | 256K | $2.50 | $10 | premium |
Cohere: Command R+ (08-2024) cohere/command-r-plus-08-2024 | 128K | $2.50 | $10 | premium |
Cohere: Command R+ cohere/command-r-plus | 128K | $3.00 | $15 | premium |
Cohere: Command R+ (04-2024) cohere/command-r-plus-04-2024 | 128K | $3.00 | $15 | premium |
4 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
MoonshotAI: Kimi VL A3B Thinking moonshotai/kimi-vl-a3b-thinking | 131K | $0.025 | $0.100 | budget |
MoonshotAI: Kimi Dev 72B moonshotai/kimi-dev-72b | 131K | $0.290 | $1.15 | standard |
MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905 | 262K | $0.296 | $1.19 | standard |
MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2 | 63K | $0.140 | $2.49 | standard |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
ByteDance: UI-TARS 7B bytedance/ui-tars-1.5-7b | 128K | $0.100 | $0.200 | budget |
ByteDance: Seed OSS 36B Instruct bytedance/seed-oss-36b-instruct | 131K | $0.104 | $0.415 | budget |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Cogito V2 Preview Llama 109B deepcogito/cogito-v2-preview-llama-109b-moe | 33K | $0.180 | $0.590 | budget |
Deep Cogito: Cogito V2 Preview Deepseek 671B deepcogito/cogito-v2-preview-deepseek-671b | 164K | $1.25 | $1.25 | standard |
4 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Baidu: ERNIE 4.5 21B A3B baidu/ernie-4.5-21b-a3b | 120K | $0.070 | $0.280 | budget |
Baidu: ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b | 30K | $0.140 | $0.560 | budget |
Baidu: ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b | 123K | $0.280 | $1.10 | standard |
Baidu: ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b | 123K | $0.420 | $1.25 | standard |
4 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Z.AI: GLM 4 32B z-ai/glm-4-32b | 128K | $0.100 | $0.100 | budget |
Z.AI: GLM 4.5 Air z-ai/glm-4.5-air | 131K | $0.140 | $0.860 | standard |
Z.AI: GLM 4.5 z-ai/glm-4.5 | 131K | $0.330 | $1.32 | standard |
Z.AI: GLM 4.5V z-ai/glm-4.5v | 66K | $0.500 | $1.80 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Tencent: Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct | 33K | $0.030 | $0.030 | budget |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
MiniMax: MiniMax-01 minimax/minimax-01 | 1.0M | $0.200 | $1.10 | standard |
MiniMax: MiniMax M1 minimax/minimax-m1 | 1.0M | $0.300 | $1.65 | standard |
16 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Meta: Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct | 131K | $0.0050 | $0.010 | budget |
Meta: Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct | 131K | $0.015 | $0.020 | budget |
Meta: Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct | 131K | $0.012 | $0.024 | budget |
Llama Guard 3 8B meta-llama/llama-guard-3-8b | 131K | $0.020 | $0.060 | budget |
Meta: Llama 3 8B Instruct meta-llama/llama-3-8b-instruct | 8K | $0.030 | $0.060 | budget |
Meta: Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct | 131K | $0.049 | $0.049 | budget |
Meta: Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct | 131K | $0.038 | $0.120 | budget |
Meta: Llama Guard 4 12B meta-llama/llama-guard-4-12b | 164K | $0.180 | $0.180 | budget |
Meta: Llama 4 Scout meta-llama/llama-4-scout | 1.0M | $0.080 | $0.300 | budget |
Meta: Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct | 131K | $0.100 | $0.280 | budget |
Meta: LlamaGuard 2 8B meta-llama/llama-guard-2-8b | 8K | $0.200 | $0.200 | budget |
Meta: Llama 3 70B Instruct meta-llama/llama-3-70b-instruct | 8K | $0.300 | $0.400 | budget |
Meta: Llama 4 Maverick meta-llama/llama-4-maverick | 1.0M | $0.150 | $0.600 | budget |
Meta: Llama 3.2 90B Vision Instruct meta-llama/llama-3.2-90b-vision-instruct | 33K | $0.350 | $0.400 | budget |
Meta: Llama 3.1 405B Instruct meta-llama/llama-3.1-405b-instruct | 33K | $0.800 | $0.800 | standard |
Meta: Llama 3.1 405B (base) meta-llama/llama-3.1-405b | 33K | $2.00 | $2.00 | standard |
8 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Microsoft: Phi 4 Multimodal Instruct microsoft/phi-4-multimodal-instruct | 131K | $0.050 | $0.100 | budget |
Microsoft: Phi-3.5 Mini 128K Instruct microsoft/phi-3.5-mini-128k-instruct | 128K | $0.100 | $0.100 | budget |
Microsoft: Phi-3 Mini 128K Instruct microsoft/phi-3-mini-128k-instruct | 128K | $0.100 | $0.100 | budget |
Microsoft: Phi 4 microsoft/phi-4 | 16K | $0.060 | $0.140 | budget |
Microsoft: Phi 4 Reasoning Plus microsoft/phi-4-reasoning-plus | 33K | $0.070 | $0.350 | budget |
WizardLM-2 8x22B microsoft/wizardlm-2-8x22b | 66K | $0.480 | $0.480 | budget |
Microsoft: MAI DS R1 microsoft/mai-ds-r1 | 164K | $0.200 | $0.800 | budget |
Microsoft: Phi-3 Medium 128K Instruct microsoft/phi-3-medium-128k-instruct | 128K | $1.00 | $1.00 | standard |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
NVIDIA: Llama 3.1 Nemotron 70B Instruct nvidia/llama-3.1-nemotron-70b-instruct | 131K | $0.120 | $0.300 | budget |
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 nvidia/llama-3.1-nemotron-ultra-253b-v1 | 131K | $0.600 | $1.80 | standard |
6 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Perplexity: Sonar perplexity/sonar | 127K | $1.00 | $1.00 | standard |
Perplexity: Sonar Reasoning perplexity/sonar-reasoning | 127K | $1.00 | $5.00 | standard |
Perplexity: Sonar Reasoning Pro perplexity/sonar-reasoning-pro | 128K | $2.00 | $8.00 | premium |
Perplexity: Sonar Deep Research perplexity/sonar-deep-research | 128K | $2.00 | $8.00 | premium |
Perplexity: R1 1776 perplexity/r1-1776 | 128K | $2.00 | $8.00 | premium |
Perplexity: Sonar Pro perplexity/sonar-pro | 200K | $3.00 | $15 | premium |
3 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Amazon: Nova Micro 1.0 amazon/nova-micro-v1 | 128K | $0.035 | $0.140 | budget |
Amazon: Nova Lite 1.0 amazon/nova-lite-v1 | 300K | $0.060 | $0.240 | budget |
Amazon: Nova Pro 1.0 amazon/nova-pro-v1 | 300K | $0.800 | $3.20 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
StepFun: Step3 stepfun-ai/step3 | 66K | $0.570 | $1.42 | standard |
6 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
NousResearch: Hermes 2 Pro - Llama-3 8B nousresearch/hermes-2-pro-llama-3-8b | 131K | $0.025 | $0.040 | budget |
Nous: Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b | 131K | $0.100 | $0.280 | budget |
Nous: Hermes 4 70B nousresearch/hermes-4-70b | 131K | $0.093 | $0.373 | budget |
Nous: DeepHermes 3 Mistral 24B Preview nousresearch/deephermes-3-mistral-24b-preview | 33K | $0.093 | $0.373 | budget |
Nous: Hermes 4 405B nousresearch/hermes-4-405b | 131K | $0.200 | $0.800 | budget |
Nous: Hermes 3 405B Instruct nousresearch/hermes-3-llama-3.1-405b | 131K | $0.700 | $0.800 | standard |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
AI21: Jamba Mini 1.7 ai21/jamba-mini-1.7 | 256K | $0.200 | $0.400 | budget |
AI21: Jamba Large 1.7 ai21/jamba-large-1.7 | 256K | $2.00 | $8.00 | premium |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Switchpoint Router switchpoint/router | 131K | $0.850 | $3.40 | standard |
3 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
THUDM: GLM Z1 32B thudm/glm-z1-32b | 33K | $0.020 | $0.080 | budget |
THUDM: GLM 4.1V 9B Thinking thudm/glm-4.1v-9b-thinking | 66K | $0.035 | $0.138 | budget |
THUDM: GLM 4 32B thudm/glm-4-32b | 32K | $0.550 | $1.66 | standard |
3 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Dolphin3.0 R1 Mistral 24B cognitivecomputations/dolphin3.0-r1-mistral-24b | 33K | $0.010 | $0.034 | budget |
Dolphin3.0 Mistral 24B cognitivecomputations/dolphin3.0-mistral-24b | 33K | $0.037 | $0.148 | budget |
Dolphin 2.9.2 Mixtral 8x22B 🐬 cognitivecomputations/dolphin-mixtral-8x22b | 16K | $0.900 | $0.900 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
TNG: DeepSeek R1T Chimera tngtech/deepseek-r1t-chimera | 164K | $0.200 | $0.800 | budget |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Morph: Morph V3 Large morph/morph-v3-large | 82K | $0.900 | $1.90 | standard |
Morph: Morph V3 Fast morph/morph-v3-fast | 82K | $0.900 | $1.90 | standard |
5 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
TheDrummer: Skyfall 36B V2 thedrummer/skyfall-36b-v2 | 33K | $0.048 | $0.193 | budget |
TheDrummer: Rocinante 12B thedrummer/rocinante-12b | 33K | $0.170 | $0.430 | budget |
TheDrummer: UnslopNemo 12B thedrummer/unslopnemo-12b | 33K | $0.400 | $0.400 | budget |
TheDrummer: Anubis 70B V1.1 thedrummer/anubis-70b-v1.1 | 16K | $0.400 | $0.700 | standard |
TheDrummer: Anubis Pro 105B V1 thedrummer/anubis-pro-105b-v1 | 131K | $0.500 | $1.00 | standard |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Inception: Mercury inception/mercury | 128K | $0.250 | $1.00 | standard |
Inception: Mercury Coder inception/mercury-coder | 128K | $0.250 | $1.00 | standard |
4 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Arcee AI: Spotlight arcee-ai/spotlight | 131K | $0.180 | $0.180 | budget |
Arcee AI: Coder Large arcee-ai/coder-large | 33K | $0.500 | $0.800 | standard |
Arcee AI: Virtuoso Large arcee-ai/virtuoso-large | 131K | $0.750 | $1.20 | standard |
Arcee AI: Maestro Reasoning arcee-ai/maestro-reasoning | 131K | $0.900 | $3.30 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Shisa AI: Shisa V2 Llama 3.3 70B shisa-ai/shisa-v2-llama3.3-70b | 33K | $0.020 | $0.080 | budget |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
EleutherAI: Llemma 7b eleutherai/llemma_7b | 4K | $0.800 | $1.20 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
AlfredPros: CodeLLaMa 7B Instruct Solidity alfredpros/codellama-7b-instruct-solidity | 8K | $0.700 | $1.10 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
ArliAI: QwQ 32B RpR v1 arliai/qwq-32b-arliai-rpr-v1 | 33K | $0.010 | $0.040 | budget |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Agentica: Deepcoder 14B Preview agentica-org/deepcoder-14b-preview | 96K | $0.015 | $0.015 | budget |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
AllenAI: Molmo 7B D allenai/molmo-7b-d | 4K | $0.100 | $0.200 | budget |
AllenAI: Olmo 2 32B Instruct allenai/olmo-2-0325-32b-instruct | 4K | $1.00 | $1.50 | standard |
3 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
AionLabs: Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b | 33K | $0.200 | $0.200 | budget |
AionLabs: Aion-1.0-Mini aion-labs/aion-1.0-mini | 131K | $0.700 | $1.40 | standard |
AionLabs: Aion-1.0 aion-labs/aion-1.0 | 131K | $4.00 | $8.00 | premium |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Liquid: LFM 7B liquid/lfm-7b | 33K | $0.010 | $0.010 | budget |
Liquid: LFM 3B liquid/lfm-3b | 33K | $0.020 | $0.020 | budget |
4 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Sao10K: Llama 3 8B Lunaris sao10k/l3-lunaris-8b | 8K | $0.020 | $0.050 | budget |
Sao10K: Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b | 131K | $0.650 | $0.750 | standard |
Sao10K: Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b | 33K | $0.650 | $0.750 | standard |
Sao10k: Llama 3 Euryale 70B v2.1 sao10k/l3-euryale-70b | 8K | $1.48 | $1.48 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Infermatic: Mistral Nemo Inferor 12B infermatic/mn-inferor-12b | 8K | $0.600 | $1.00 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
SorcererLM 8x22B raifle/sorcererlm-8x22b | 16K | $4.50 | $4.50 | standard |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Magnum v2 72B anthracite-org/magnum-v2-72b | 33K | $3.00 | $3.00 | standard |
Magnum v4 72B anthracite-org/magnum-v4-72b | 16K | $2.00 | $5.00 | standard |
2 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Inflection: Inflection 3 Productivity inflection/inflection-3-productivity | 8K | $2.50 | $10 | premium |
Inflection: Inflection 3 Pi inflection/inflection-3-pi | 8K | $2.50 | $10 | premium |
3 models available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
NeverSleep: Lumimaid v0.2 8B neversleep/llama-3.1-lumimaid-8b | 33K | $0.090 | $0.600 | budget |
Noromaid 20B neversleep/noromaid-20b | 4K | $1.00 | $1.75 | standard |
NeverSleep: Llama 3 Lumimaid 70B neversleep/llama-3-lumimaid-70b | 8K | $4.00 | $6.00 | premium |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Midnight Rose 70B sophosympatheia/midnight-rose-70b | 4K | $0.800 | $0.800 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Goliath 120B alpindale/goliath-120b | 6K | $4.00 | $5.50 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Pygmalion: Mythalion 13B pygmalionai/mythalion-13b | 4K | $0.700 | $1.10 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
Mancer: Weaver (alpha) mancer/weaver | 8K | $1.13 | $1.13 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
ReMM SLERP 13B undi95/remm-slerp-l2-13b | 6K | $0.450 | $0.650 | standard |
1 model available
Model | Context | Input $/M | Output $/M | Tier |
---|---|---|---|---|
MythoMax 13B gryphe/mythomax-l2-13b | 4K | $0.060 | $0.060 | budget |