Provider

GPT

Models

56

Pricing table for GPT models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
OpenAI: gpt-oss-20b openai/gpt-oss-20b	131K	$0.030	$0.140	LOW
OpenAI: gpt-oss-120b openai/gpt-oss-120b	131K	$0.039	$0.190	LOW
OpenAI: gpt-oss-120b (exacto) openai/gpt-oss-120b:exacto	131K	$0.039	$0.190	LOW
OpenAI: gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b	131K	$0.075	$0.300	LOW
OpenAI: GPT-5 Nano openai/gpt-5-nano	400K	$0.050	$0.400	LOW
OpenAI: GPT-4.1 Nano openai/gpt-4.1-nano	1M	$0.100	$0.400	LOW
OpenAI: GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview	128K	$0.150	$0.600	LOW
OpenAI: GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18	128K	$0.150	$0.600	LOW
OpenAI: GPT-4o-mini openai/gpt-4o-mini	128K	$0.150	$0.600	LOW
OpenAI: GPT-4.1 Mini openai/gpt-4.1-mini	1M	$0.400	$1.60	MED
OpenAI: GPT-3.5 Turbo openai/gpt-3.5-turbo	16K	$0.500	$1.50	MED
OpenAI: GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini	400K	$0.250	$2.00	MED
OpenAI: GPT-5 Mini openai/gpt-5-mini	400K	$0.250	$2.00	MED
OpenAI: GPT Audio Mini openai/gpt-audio-mini	128K	$0.600	$2.40	MED
OpenAI: GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613	4K	$1.00	$2.00	MED
OpenAI: GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct	4K	$1.50	$2.00	MED
OpenAI: GPT-5 Image Mini openai/gpt-5-image-mini	400K	$2.50	$2.00	MED
OpenAI: o4 Mini High openai/o4-mini-high	200K	$1.10	$4.40	MED
OpenAI: o4 Mini openai/o4-mini	200K	$1.10	$4.40	MED
OpenAI: o3 Mini High openai/o3-mini-high	200K	$1.10	$4.40	MED
OpenAI: o3 Mini openai/o3-mini	200K	$1.10	$4.40	MED
OpenAI: GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k	16K	$3.00	$4.00	MED
OpenAI: o4 Mini Deep Research openai/o4-mini-deep-research	200K	$2.00	$8.00	HIGH
OpenAI: o3 openai/o3	200K	$2.00	$8.00	HIGH
OpenAI: GPT-4.1 openai/gpt-4.1	1M	$2.00	$8.00	HIGH
OpenAI: GPT-5.1-Codex-Max openai/gpt-5.1-codex-max	400K	$1.25	$10	HIGH
OpenAI: GPT-5.1 openai/gpt-5.1	400K	$1.25	$10	HIGH
OpenAI: GPT-5.1 Chat openai/gpt-5.1-chat	128K	$1.25	$10	HIGH
OpenAI: GPT-5.1-Codex openai/gpt-5.1-codex	400K	$1.25	$10	HIGH
OpenAI: GPT-5 Codex openai/gpt-5-codex	400K	$1.25	$10	HIGH
OpenAI: GPT-5 Chat openai/gpt-5-chat	128K	$1.25	$10	HIGH
OpenAI: GPT-5 openai/gpt-5	400K	$1.25	$10	HIGH
OpenAI: GPT Audio openai/gpt-audio	128K	$2.50	$10	HIGH
OpenAI: GPT-4o Audio openai/gpt-4o-audio-preview	128K	$2.50	$10	HIGH
OpenAI: GPT-4o Search Preview openai/gpt-4o-search-preview	128K	$2.50	$10	HIGH
OpenAI: GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20	128K	$2.50	$10	HIGH
OpenAI: GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06	128K	$2.50	$10	HIGH
OpenAI: GPT-4o openai/gpt-4o	128K	$2.50	$10	HIGH
OpenAI: GPT-5.3-Codex openai/gpt-5.3-codex	400K	$1.75	$14	HIGH
OpenAI: GPT-5.2-Codex openai/gpt-5.2-codex	400K	$1.75	$14	HIGH
OpenAI: GPT-5.2 Chat openai/gpt-5.2-chat	128K	$1.75	$14	HIGH
OpenAI: GPT-5.2 openai/gpt-5.2	400K	$1.75	$14	HIGH
OpenAI: GPT-5 Image openai/gpt-5-image	400K	$10	$10	HIGH
OpenAI: GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13	128K	$5.00	$15	HIGH
OpenAI: GPT-4o (extended) openai/gpt-4o:extended	128K	$6.00	$18	HIGH
OpenAI: GPT-4 Turbo openai/gpt-4-turbo	128K	$10	$30	HIGH
OpenAI: GPT-4 Turbo Preview openai/gpt-4-turbo-preview	128K	$10	$30	HIGH
OpenAI: GPT-4 Turbo (older v1106) openai/gpt-4-1106-preview	128K	$10	$30	HIGH
OpenAI: o3 Deep Research openai/o3-deep-research	200K	$10	$40	HIGH
OpenAI: o1 openai/o1	200K	$15	$60	HIGH
OpenAI: GPT-4 (older v0314) openai/gpt-4-0314	8K	$30	$60	HIGH
OpenAI: GPT-4 openai/gpt-4	8K	$30	$60	HIGH
OpenAI: o3 Pro openai/o3-pro	200K	$20	$80	HIGH
OpenAI: GPT-5 Pro openai/gpt-5-pro	400K	$15	$120	HIGH
OpenAI: GPT-5.2 Pro openai/gpt-5.2-pro	400K	$21	$168	HIGH
OpenAI: o1-pro openai/o1-pro	200K	$150	$600	HIGH

Provider

Claude

Models

13

Pricing table for Claude models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Anthropic: Claude 3 Haiku anthropic/claude-3-haiku	200K	$0.250	$1.25	MED
Anthropic: Claude 3.5 Haiku anthropic/claude-3.5-haiku	200K	$0.800	$4.00	MED
Anthropic: Claude Haiku 4.5 anthropic/claude-haiku-4.5	200K	$1.00	$5.00	MED
Anthropic: Claude Sonnet 4.6 anthropic/claude-sonnet-4.6	1M	$3.00	$15	HIGH
Anthropic: Claude Sonnet 4.5 anthropic/claude-sonnet-4.5	1M	$3.00	$15	HIGH
Anthropic: Claude Sonnet 4 anthropic/claude-sonnet-4	1M	$3.00	$15	HIGH
Anthropic: Claude 3.7 Sonnet anthropic/claude-3.7-sonnet	200K	$3.00	$15	HIGH
Anthropic: Claude 3.7 Sonnet (thinking) anthropic/claude-3.7-sonnet:thinking	200K	$3.00	$15	HIGH
Anthropic: Claude Opus 4.6 anthropic/claude-opus-4.6	1M	$5.00	$25	HIGH
Anthropic: Claude Opus 4.5 anthropic/claude-opus-4.5	200K	$5.00	$25	HIGH
Anthropic: Claude 3.5 Sonnet anthropic/claude-3.5-sonnet	200K	$6.00	$30	HIGH
Anthropic: Claude Opus 4.1 anthropic/claude-opus-4.1	200K	$15	$75	HIGH
Anthropic: Claude Opus 4 anthropic/claude-opus-4	200K	$15	$75	HIGH

Provider

Gemini

Models

21

Pricing table for Gemini models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Google: Gemma 3n 4B google/gemma-3n-e4b-it	33K	$0.020	$0.040	LOW
Google: Gemma 3 4B google/gemma-3-4b-it	131K	$0.040	$0.080	LOW
Google: Gemma 2 9B google/gemma-2-9b-it	8K	$0.030	$0.090	LOW
Google: Gemma 3 12B google/gemma-3-12b-it	131K	$0.040	$0.130	LOW
Google: Gemma 3 27B google/gemma-3-27b-it	128K	$0.040	$0.150	LOW
Google: Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite-001	1M	$0.075	$0.300	LOW
Google: Gemini 2.5 Flash Lite Preview 09-2025 google/gemini-2.5-flash-lite-preview-09-2025	1M	$0.100	$0.400	LOW
Google: Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite	1M	$0.100	$0.400	LOW
Google: Gemini 2.0 Flash google/gemini-2.0-flash-001	1M	$0.100	$0.400	LOW
Google: Gemma 2 27B google/gemma-2-27b-it	8K	$0.650	$0.650	MED
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) google/gemini-3.1-flash-image-preview	66K	$0.250	$1.50	MED
Google: Nano Banana (Gemini 2.5 Flash Image) google/gemini-2.5-flash-image	33K	$0.300	$2.50	MED
Google: Gemini 2.5 Flash google/gemini-2.5-flash	1M	$0.300	$2.50	MED
Google: Gemini 3 Flash Preview google/gemini-3-flash-preview	1M	$0.500	$3.00	MED
Google: Gemini 2.5 Pro google/gemini-2.5-pro	1M	$1.25	$10	HIGH
Google: Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview	1M	$1.25	$10	HIGH
Google: Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06	1M	$1.25	$10	HIGH
Google: Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools	1M	$2.00	$12	HIGH
Google: Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview	1M	$2.00	$12	HIGH
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) google/gemini-3-pro-image-preview	66K	$2.00	$12	HIGH
Google: Gemini 3 Pro Preview google/gemini-3-pro-preview	1M	$2.00	$12	HIGH

Provider

Grok

Models

8

Pricing table for Grok models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
xAI: Grok 4.1 Fast x-ai/grok-4.1-fast	2M	$0.200	$0.500	LOW
xAI: Grok 4 Fast x-ai/grok-4-fast	2M	$0.200	$0.500	LOW
xAI: Grok 3 Mini x-ai/grok-3-mini	131K	$0.300	$0.500	LOW
xAI: Grok 3 Mini Beta x-ai/grok-3-mini-beta	131K	$0.300	$0.500	LOW
xAI: Grok Code Fast 1 x-ai/grok-code-fast-1	256K	$0.200	$1.50	MED
xAI: Grok 4 x-ai/grok-4	256K	$3.00	$15	HIGH
xAI: Grok 3 x-ai/grok-3	131K	$3.00	$15	HIGH
xAI: Grok 3 Beta x-ai/grok-3-beta	131K	$3.00	$15	HIGH

Provider

Qwen

Models

44

Pricing table for Qwen models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Qwen: Qwen2.5 Coder 7B Instruct qwen/qwen2.5-coder-7b-instruct	33K	$0.030	$0.090	LOW
Qwen: Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct	33K	$0.040	$0.100	LOW
Qwen: Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507	262K	$0.071	$0.100	LOW
Qwen: Qwen-Turbo qwen/qwen-turbo	131K	$0.050	$0.200	LOW
Qwen: Qwen3 14B qwen/qwen3-14b	41K	$0.060	$0.240	LOW
Qwen: Qwen3 32B qwen/qwen3-32b	41K	$0.080	$0.240	LOW
Qwen: Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct	160K	$0.070	$0.270	LOW
Qwen: Qwen3 30B A3B qwen/qwen3-30b-a3b	41K	$0.080	$0.280	LOW
Qwen: Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507	262K	$0.090	$0.300	LOW
Qwen: Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507	33K	$0.051	$0.340	LOW
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct	33K	$0.200	$0.200	LOW
Qwen: Qwen2.5-VL 7B Instruct qwen/qwen-2.5-vl-7b-instruct	33K	$0.200	$0.200	LOW
Qwen: Qwen3 8B qwen/qwen3-8b	41K	$0.050	$0.400	LOW
Qwen: Qwen3.5-Flash qwen/qwen3.5-flash-02-23	1M	$0.100	$0.400	LOW
Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct	33K	$0.120	$0.390	LOW
Qwen: Qwen3 VL 32B Instruct qwen/qwen3-vl-32b-instruct	131K	$0.104	$0.416	LOW
Qwen: QwQ 32B qwen/qwq-32b	33K	$0.150	$0.400	LOW
Qwen: Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct	131K	$0.080	$0.500	LOW
Qwen: Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct	131K	$0.130	$0.520	LOW
Qwen: Qwen2.5 VL 32B Instruct qwen/qwen2.5-vl-32b-instruct	128K	$0.200	$0.600	LOW
Qwen: Qwen VL Plus qwen/qwen-vl-plus	131K	$0.210	$0.630	LOW
Qwen: Qwen3 Coder Next qwen/qwen3-coder-next	262K	$0.120	$0.750	LOW
Qwen: Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct	262K	$0.200	$0.880	MED
Qwen: Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct	262K	$0.090	$1.10	MED
Qwen: Qwen3 Coder 480B A35B qwen/qwen3-coder	262K	$0.220	$1.00	MED
Qwen: Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking	128K	$0.150	$1.20	MED
Qwen: Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking	131K	$0.117	$1.36	MED
Qwen: Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking	1M	$0.400	$1.20	MED
Qwen: Qwen Plus 0728 qwen/qwen-plus-2025-07-28	1M	$0.400	$1.20	MED
Qwen: Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct	33K	$0.800	$0.800	MED
Qwen: Qwen-Plus qwen/qwen-plus	1M	$0.400	$1.20	MED
Qwen: Qwen3 Coder Flash qwen/qwen3-coder-flash	1M	$0.300	$1.50	MED
Qwen: Qwen3 Coder 480B A35B (exacto) qwen/qwen3-coder:exacto	262K	$0.220	$1.80	MED
Qwen: Qwen3.5-35B-A3B qwen/qwen3.5-35b-a3b	262K	$0.250	$2.00	MED
Qwen: Qwen3 235B A22B qwen/qwen3-235b-a22b	131K	$0.455	$1.82	MED
Qwen: Qwen3.5-27B qwen/qwen3.5-27b	262K	$0.300	$2.40	MED
Qwen: Qwen3.5 Plus 2026-02-15 qwen/qwen3.5-plus-02-15	1M	$0.400	$2.40	MED
Qwen: Qwen3.5-122B-A10B qwen/qwen3.5-122b-a10b	262K	$0.400	$3.20	MED
Qwen: Qwen VL Max qwen/qwen-vl-max	131K	$0.800	$3.20	MED
Qwen: Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b	262K	$0.550	$3.50	MED
Qwen: Qwen3 Coder Plus qwen/qwen3-coder-plus	1M	$1.00	$5.00	MED
Qwen: Qwen3 Max Thinking qwen/qwen3-max-thinking	262K	$1.20	$6.00	MED
Qwen: Qwen3 Max qwen/qwen3-max	262K	$1.20	$6.00	MED
Qwen: Qwen-Max qwen/qwen-max	33K	$1.60	$6.40	MED

Provider

DeepSeek

Models

12

Pricing table for DeepSeek models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
DeepSeek: R1 Distill Qwen 32B deepseek/deepseek-r1-distill-qwen-32b	33K	$0.290	$0.290	LOW
DeepSeek: DeepSeek V3.2 deepseek/deepseek-v3.2	164K	$0.250	$0.400	LOW
DeepSeek: DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp	164K	$0.270	$0.410	LOW
DeepSeek: DeepSeek V3.1 deepseek/deepseek-chat-v3.1	33K	$0.150	$0.750	LOW
DeepSeek: DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324	164K	$0.200	$0.770	LOW
DeepSeek: DeepSeek V3.1 Terminus (exacto) deepseek/deepseek-v3.1-terminus:exacto	164K	$0.210	$0.790	LOW
DeepSeek: DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus	164K	$0.210	$0.790	LOW
DeepSeek: DeepSeek V3 deepseek/deepseek-chat	164K	$0.320	$0.890	MED
DeepSeek: R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b	131K	$0.700	$0.800	MED
DeepSeek: DeepSeek V3.2 Speciale deepseek/deepseek-v3.2-speciale	164K	$0.400	$1.20	MED
DeepSeek: R1 0528 deepseek/deepseek-r1-0528	164K	$0.450	$2.15	MED
DeepSeek: R1 deepseek/deepseek-r1	64K	$0.700	$2.50	MED

Provider

Mistral

Models

26

Pricing table for Mistral models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Mistral: Mistral Nemo mistralai/mistral-nemo	131K	$0.020	$0.040	LOW
Mistral: Mistral Small 3 mistralai/mistral-small-24b-instruct-2501	33K	$0.050	$0.080	LOW
Mistral: Ministral 3 3B 2512 mistralai/ministral-3b-2512	131K	$0.100	$0.100	LOW
Mistral: Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct	131K	$0.060	$0.180	LOW
Mistral: Ministral 3 8B 2512 mistralai/ministral-8b-2512	262K	$0.150	$0.150	LOW
Mistral: Mistral 7B Instruct v0.1 mistralai/mistral-7b-instruct-v0.1	3K	$0.110	$0.190	LOW
Mistral: Mistral Small Creative mistralai/mistral-small-creative	33K	$0.100	$0.300	LOW
Mistral: Ministral 3 14B 2512 mistralai/ministral-14b-2512	262K	$0.200	$0.200	LOW
Mistral: Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507	32K	$0.100	$0.300	LOW
Mistral: Devstral Small 1.1 mistralai/devstral-small	131K	$0.100	$0.300	LOW
Mistral: Mistral 7B Instruct mistralai/mistral-7b-instruct	33K	$0.200	$0.200	LOW
Mistral: Mistral 7B Instruct v0.3 mistralai/mistral-7b-instruct-v0.3	33K	$0.200	$0.200	LOW
Mistral: Saba mistralai/mistral-saba	33K	$0.200	$0.600	LOW
Mistral: Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct	128K	$0.350	$0.560	LOW
Mistral: Mixtral 8x7B Instruct mistralai/mixtral-8x7b-instruct	33K	$0.540	$0.540	MED
Mistral: Codestral 2508 mistralai/codestral-2508	256K	$0.300	$0.900	MED
Mistral: Mistral Large 3 2512 mistralai/mistral-large-2512	262K	$0.500	$1.50	MED
Mistral: Devstral 2 2512 mistralai/devstral-2512	262K	$0.400	$2.00	MED
Mistral: Mistral Medium 3.1 mistralai/mistral-medium-3.1	131K	$0.400	$2.00	MED
Mistral: Devstral Medium mistralai/devstral-medium	131K	$0.400	$2.00	MED
Mistral: Mistral Medium 3 mistralai/mistral-medium-3	131K	$0.400	$2.00	MED
Mistral Large 2411 mistralai/mistral-large-2411	131K	$2.00	$6.00	MED
Mistral Large 2407 mistralai/mistral-large-2407	131K	$2.00	$6.00	MED
Mistral: Pixtral Large 2411 mistralai/pixtral-large-2411	131K	$2.00	$6.00	MED
Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct	66K	$2.00	$6.00	MED
Mistral Large mistralai/mistral-large	128K	$2.00	$6.00	MED

Provider

Cohere

Models

4

Pricing table for Cohere models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Cohere: Command R7B (12-2024) cohere/command-r7b-12-2024	128K	$0.037	$0.150	LOW
Cohere: Command R (08-2024) cohere/command-r-08-2024	128K	$0.150	$0.600	LOW
Cohere: Command A cohere/command-a	256K	$2.50	$10	HIGH
Cohere: Command R+ (08-2024) cohere/command-r-plus-08-2024	128K	$2.50	$10	HIGH

Provider

MoonshotAI

Models

5

Pricing table for MoonshotAI models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905	131K	$0.400	$2.00	MED
MoonshotAI: Kimi K2 Thinking moonshotai/kimi-k2-thinking	131K	$0.470	$2.00	MED
MoonshotAI: Kimi K2.5 moonshotai/kimi-k2.5	262K	$0.450	$2.20	MED
MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2	131K	$0.550	$2.20	MED
MoonshotAI: Kimi K2 0905 (exacto) moonshotai/kimi-k2-0905:exacto	262K	$0.600	$2.50	MED

Provider

ByteDance

Models

1

Pricing table for ByteDance models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
ByteDance: UI-TARS 7B bytedance/ui-tars-1.5-7b	128K	$0.100	$0.200	LOW

Provider

DeepCogito

Models

1

Pricing table for DeepCogito models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Deep Cogito: Cogito v2.1 671B deepcogito/cogito-v2.1-671b	128K	$1.25	$1.25	MED

Provider

Baidu

Models

5

Pricing table for Baidu models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Baidu: ERNIE 4.5 21B A3B Thinking baidu/ernie-4.5-21b-a3b-thinking	131K	$0.070	$0.280	LOW
Baidu: ERNIE 4.5 21B A3B baidu/ernie-4.5-21b-a3b	120K	$0.070	$0.280	LOW
Baidu: ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b	30K	$0.140	$0.560	LOW
Baidu: ERNIE 4.5 300B A47B baidu/ernie-4.5-300b-a47b	123K	$0.280	$1.10	MED
Baidu: ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b	123K	$0.420	$1.25	MED

Provider

Z-AI

Models

10

Pricing table for Z-AI models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Z.ai: GLM 4 32B z-ai/glm-4-32b	128K	$0.100	$0.100	LOW
Z.ai: GLM 4.7 Flash z-ai/glm-4.7-flash	203K	$0.060	$0.400	LOW
Z.ai: GLM 4.5 Air z-ai/glm-4.5-air	131K	$0.130	$0.850	LOW
Z.ai: GLM 4.6V z-ai/glm-4.6v	131K	$0.300	$0.900	MED
Z.ai: GLM 4.7 z-ai/glm-4.7	203K	$0.300	$1.40	MED
Z.ai: GLM 4.6 z-ai/glm-4.6	203K	$0.350	$1.71	MED
Z.ai: GLM 4.6 (exacto) z-ai/glm-4.6:exacto	205K	$0.440	$1.76	MED
Z.ai: GLM 4.5V z-ai/glm-4.5v	66K	$0.600	$1.80	MED
Z.ai: GLM 4.5 z-ai/glm-4.5	131K	$0.550	$2.00	MED
Z.ai: GLM 5 z-ai/glm-5	205K	$0.950	$2.55	MED

Provider

Tencent

Models

1

Pricing table for Tencent models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Tencent: Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct	131K	$0.140	$0.570	LOW

Provider

MiniMax

Models

6

Pricing table for MiniMax models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
MiniMax: MiniMax M2.1 minimax/minimax-m2.1	197K	$0.270	$0.950	MED
MiniMax: MiniMax M2 minimax/minimax-m2	197K	$0.255	$1.00	MED
MiniMax: MiniMax-01 minimax/minimax-01	1M	$0.200	$1.10	MED
MiniMax: MiniMax M2.5 minimax/minimax-m2.5	197K	$0.295	$1.20	MED
MiniMax: MiniMax M2-her minimax/minimax-m2-her	66K	$0.300	$1.20	MED
MiniMax: MiniMax M1 minimax/minimax-m1	1M	$0.400	$2.20	MED

Provider

Meta-Llama

Models

15

Pricing table for Meta-Llama models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Meta: Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct	131K	$0.020	$0.020	LOW
Meta: Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct	16K	$0.020	$0.050	LOW
Meta: Llama 3 8B Instruct meta-llama/llama-3-8b-instruct	8K	$0.030	$0.040	LOW
Llama Guard 3 8B meta-llama/llama-guard-3-8b	131K	$0.020	$0.060	LOW
Meta: Llama 3.2 11B Vision Instruct meta-llama/llama-3.2-11b-vision-instruct	131K	$0.049	$0.049	LOW
Meta: Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct	60K	$0.027	$0.200	LOW
Meta: Llama Guard 4 12B meta-llama/llama-guard-4-12b	164K	$0.180	$0.180	LOW
Meta: Llama 4 Scout meta-llama/llama-4-scout	328K	$0.080	$0.300	LOW
Meta: LlamaGuard 2 8B meta-llama/llama-guard-2-8b	8K	$0.200	$0.200	LOW
Meta: Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct	131K	$0.100	$0.320	LOW
Meta: Llama 4 Maverick meta-llama/llama-4-maverick	1M	$0.150	$0.600	LOW
Meta: Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct	131K	$0.400	$0.400	LOW
Meta: Llama 3 70B Instruct meta-llama/llama-3-70b-instruct	8K	$0.510	$0.740	MED
Meta: Llama 3.1 405B (base) meta-llama/llama-3.1-405b	33K	$4.00	$4.00	MED
Meta: Llama 3.1 405B Instruct meta-llama/llama-3.1-405b-instruct	131K	$4.00	$4.00	MED

Provider

Microsoft

Models

2

Pricing table for Microsoft models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Microsoft: Phi 4 microsoft/phi-4	16K	$0.060	$0.140	LOW
WizardLM-2 8x22B microsoft/wizardlm-2-8x22b	66K	$0.620	$0.620	MED

Provider

NVIDIA

Models

5

Pricing table for NVIDIA models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
NVIDIA: Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2	131K	$0.040	$0.160	LOW
NVIDIA: Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b	262K	$0.050	$0.200	LOW
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5	131K	$0.100	$0.400	LOW
NVIDIA: Nemotron Nano 12B 2 VL nvidia/nemotron-nano-12b-v2-vl	131K	$0.200	$0.600	LOW
NVIDIA: Llama 3.1 Nemotron 70B Instruct nvidia/llama-3.1-nemotron-70b-instruct	131K	$1.20	$1.20	MED

Provider

Perplexity

Models

5

Pricing table for Perplexity models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Perplexity: Sonar perplexity/sonar	127K	$1.00	$1.00	MED
Perplexity: Sonar Reasoning Pro perplexity/sonar-reasoning-pro	128K	$2.00	$8.00	HIGH
Perplexity: Sonar Deep Research perplexity/sonar-deep-research	128K	$2.00	$8.00	HIGH
Perplexity: Sonar Pro Search perplexity/sonar-pro-search	200K	$3.00	$15	HIGH
Perplexity: Sonar Pro perplexity/sonar-pro	200K	$3.00	$15	HIGH

Provider

Amazon

Models

5

Pricing table for Amazon models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Amazon: Nova Micro 1.0 amazon/nova-micro-v1	128K	$0.035	$0.140	LOW
Amazon: Nova Lite 1.0 amazon/nova-lite-v1	300K	$0.060	$0.240	LOW
Amazon: Nova 2 Lite amazon/nova-2-lite-v1	1M	$0.300	$2.50	MED
Amazon: Nova Pro 1.0 amazon/nova-pro-v1	300K	$0.800	$3.20	MED
Amazon: Nova Premier 1.0 amazon/nova-premier-v1	1M	$2.50	$13	HIGH

Provider

bytedance-seed

Models

3

Pricing table for bytedance-seed models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
ByteDance Seed: Seed 1.6 Flash bytedance-seed/seed-1.6-flash	262K	$0.075	$0.300	LOW
ByteDance Seed: Seed-2.0-Mini bytedance-seed/seed-2.0-mini	262K	$0.100	$0.400	LOW
ByteDance Seed: Seed 1.6 bytedance-seed/seed-1.6	262K	$0.250	$2.00	MED

Provider

liquid

Models

3

Pricing table for liquid models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
LiquidAI: LFM2-8B-A1B liquid/lfm2-8b-a1b	33K	$0.010	$0.020	LOW
LiquidAI: LFM2-2.6B liquid/lfm-2.2-6b	33K	$0.010	$0.020	LOW
LiquidAI: LFM2-24B-A2B liquid/lfm-2-24b-a2b	33K	$0.030	$0.120	LOW

Provider

aion-labs

Models

4

Pricing table for aion-labs models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
AionLabs: Aion-1.0-Mini aion-labs/aion-1.0-mini	131K	$0.700	$1.40	MED
AionLabs: Aion-2.0 aion-labs/aion-2.0	131K	$0.800	$1.60	MED
AionLabs: Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b	33K	$0.800	$1.60	MED
AionLabs: Aion-1.0 aion-labs/aion-1.0	131K	$4.00	$8.00	HIGH

Provider

stepfun

Models

1

Pricing table for stepfun models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
StepFun: Step 3.5 Flash stepfun/step-3.5-flash	256K	$0.100	$0.300	LOW

Provider

arcee-ai

Models

5

Pricing table for arcee-ai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Arcee AI: Trinity Mini arcee-ai/trinity-mini	131K	$0.045	$0.150	LOW
Arcee AI: Spotlight arcee-ai/spotlight	131K	$0.180	$0.180	LOW
Arcee AI: Coder Large arcee-ai/coder-large	33K	$0.500	$0.800	MED
Arcee AI: Virtuoso Large arcee-ai/virtuoso-large	131K	$0.750	$1.20	MED
Arcee AI: Maestro Reasoning arcee-ai/maestro-reasoning	131K	$0.900	$3.30	MED

Provider

writer

Models

1

Pricing table for writer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Writer: Palmyra X5 writer/palmyra-x5	1M	$0.600	$6.00	MED

Provider

allenai

Models

6

Pricing table for allenai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
AllenAI: Olmo 2 32B Instruct allenai/olmo-2-0325-32b-instruct	128K	$0.050	$0.200	LOW
AllenAI: Olmo 3 7B Instruct allenai/olmo-3-7b-instruct	66K	$0.100	$0.200	LOW
AllenAI: Olmo 3 7B Think allenai/olmo-3-7b-think	66K	$0.120	$0.200	LOW
AllenAI: Molmo2 8B allenai/molmo-2-8b	37K	$0.200	$0.200	LOW
AllenAI: Olmo 3 32B Think allenai/olmo-3-32b-think	66K	$0.150	$0.500	LOW
AllenAI: Olmo 3.1 32B Instruct allenai/olmo-3.1-32b-instruct	66K	$0.200	$0.600	LOW

Provider

xiaomi

Models

1

Pricing table for xiaomi models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Xiaomi: MiMo-V2-Flash xiaomi/mimo-v2-flash	262K	$0.090	$0.290	LOW

Provider

relace

Models

2

Pricing table for relace models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Relace: Relace Apply 3 relace/relace-apply-3	256K	$0.850	$1.25	MED
Relace: Relace Search relace/relace-search	256K	$1.00	$3.00	MED

Provider

nex-agi

Models

1

Pricing table for nex-agi models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Nex AGI: DeepSeek V3.1 Nex N1 nex-agi/deepseek-v3.1-nex-n1	131K	$0.270	$1.00	MED

Provider

essentialai

Models

1

Pricing table for essentialai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
EssentialAI: Rnj 1 Instruct essentialai/rnj-1-instruct	33K	$0.150	$0.150	LOW

Provider

prime-intellect

Models

1

Pricing table for prime-intellect models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Prime Intellect: INTELLECT-3 prime-intellect/intellect-3	131K	$0.200	$1.10	MED

Provider

kwaipilot

Models

1

Pricing table for kwaipilot models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Kwaipilot: KAT-Coder-Pro V1 kwaipilot/kat-coder-pro	256K	$0.207	$0.828	MED

Provider

ibm-granite

Models

1

Pricing table for ibm-granite models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
IBM: Granite 4.0 Micro ibm-granite/granite-4.0-h-micro	131K	$0.017	$0.110	LOW

Provider

thedrummer

Models

4

Pricing table for thedrummer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
TheDrummer: Rocinante 12B thedrummer/rocinante-12b	33K	$0.170	$0.430	LOW
TheDrummer: UnslopNemo 12B thedrummer/unslopnemo-12b	33K	$0.400	$0.400	LOW
TheDrummer: Cydonia 24B V4.1 thedrummer/cydonia-24b-v4.1	131K	$0.300	$0.500	LOW
TheDrummer: Skyfall 36B V2 thedrummer/skyfall-36b-v2	33K	$0.550	$0.800	MED

Provider

alibaba

Models

1

Pricing table for alibaba models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Tongyi DeepResearch 30B A3B alibaba/tongyi-deepresearch-30b-a3b	131K	$0.090	$0.450	LOW

Provider

meituan

Models

1

Pricing table for meituan models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Meituan: LongCat Flash Chat meituan/longcat-flash-chat	131K	$0.200	$0.800	LOW

Provider

nousresearch

Models

5

Pricing table for nousresearch models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
NousResearch: Hermes 2 Pro - Llama-3 8B nousresearch/hermes-2-pro-llama-3-8b	8K	$0.140	$0.140	LOW
Nous: Hermes 4 70B nousresearch/hermes-4-70b	131K	$0.130	$0.400	LOW
Nous: Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b	66K	$0.300	$0.300	LOW
Nous: Hermes 3 405B Instruct nousresearch/hermes-3-llama-3.1-405b	131K	$1.00	$1.00	MED
Nous: Hermes 4 405B nousresearch/hermes-4-405b	131K	$1.00	$3.00	MED

Provider

ai21

Models

1

Pricing table for ai21 models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
AI21: Jamba Large 1.7 ai21/jamba-large-1.7	256K	$2.00	$8.00	HIGH

Provider

switchpoint

Models

1

Pricing table for switchpoint models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Switchpoint Router switchpoint/router	131K	$0.850	$3.40	MED

Provider

tngtech

Models

1

Pricing table for tngtech models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
TNG: DeepSeek R1T2 Chimera tngtech/deepseek-r1t2-chimera	164K	$0.250	$0.850	MED

Provider

morph

Models

2

Pricing table for morph models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Morph: Morph V3 Fast morph/morph-v3-fast	82K	$0.800	$1.20	MED
Morph: Morph V3 Large morph/morph-v3-large	262K	$0.900	$1.90	MED

Provider

inception

Models

2

Pricing table for inception models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Inception: Mercury inception/mercury	128K	$0.250	$1.00	MED
Inception: Mercury Coder inception/mercury-coder	128K	$0.250	$1.00	MED

Provider

eleutherai

Models

1

Pricing table for eleutherai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
EleutherAI: Llemma 7b eleutherai/llemma_7b	4K	$0.800	$1.20	MED

Provider

alfredpros

Models

1

Pricing table for alfredpros models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
AlfredPros: CodeLLaMa 7B Instruct Solidity alfredpros/codellama-7b-instruct-solidity	4K	$0.800	$1.20	MED

Provider

sao10k

Models

5

Pricing table for sao10k models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Sao10K: Llama 3 8B Lunaris sao10k/l3-lunaris-8b	8K	$0.040	$0.050	LOW
Sao10K: Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b	131K	$0.650	$0.750	MED
Sao10K: Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b	33K	$0.650	$0.750	MED
Sao10k: Llama 3 Euryale 70B v2.1 sao10k/l3-euryale-70b	8K	$1.48	$1.48	MED
Sao10K: Llama 3.1 70B Hanami x1 sao10k/l3.1-70b-hanami-x1	16K	$3.00	$3.00	MED

Provider

raifle

Models

1

Pricing table for raifle models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
SorcererLM 8x22B raifle/sorcererlm-8x22b	16K	$4.50	$4.50	MED

Provider

anthracite-org

Models

1

Pricing table for anthracite-org models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Magnum v4 72B anthracite-org/magnum-v4-72b	16K	$3.00	$5.00	MED

Provider

inflection

Models

2

Pricing table for inflection models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Inflection: Inflection 3 Pi inflection/inflection-3-pi	8K	$2.50	$10	HIGH
Inflection: Inflection 3 Productivity inflection/inflection-3-productivity	8K	$2.50	$10	HIGH

Provider

neversleep

Models

2

Pricing table for neversleep models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
NeverSleep: Lumimaid v0.2 8B neversleep/llama-3.1-lumimaid-8b	33K	$0.090	$0.600	LOW
Noromaid 20B neversleep/noromaid-20b	4K	$1.00	$1.75	MED

Provider

alpindale

Models

1

Pricing table for alpindale models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Goliath 120B alpindale/goliath-120b	6K	$3.75	$7.50	HIGH

Provider

mancer

Models

1

Pricing table for mancer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Mancer: Weaver (alpha) mancer/weaver	8K	$0.750	$1.00	MED

Provider

undi95

Models

1

Pricing table for undi95 models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
ReMM SLERP 13B undi95/remm-slerp-l2-13b	6K	$0.450	$0.650	MED

Provider

gryphe

Models

1

Pricing table for gryphe models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
MythoMax 13B gryphe/mythomax-l2-13b	4K	$0.060	$0.060	LOW

LLM TOKENCOST CALC

GPT

Claude

Gemini

Grok

Qwen

DeepSeek

Mistral

Cohere

MoonshotAI

ByteDance

DeepCogito

Baidu

Z-AI

Tencent

MiniMax

Meta-Llama

Microsoft

NVIDIA

Perplexity

Amazon

bytedance-seed

liquid

aion-labs

stepfun

arcee-ai

writer

allenai

xiaomi

relace

nex-agi

essentialai

prime-intellect

kwaipilot

ibm-granite

thedrummer

alibaba

meituan

nousresearch

ai21

switchpoint

tngtech

morph

inception

eleutherai

alfredpros

sao10k

raifle

anthracite-org

inflection

neversleep

alpindale

mancer

undi95

gryphe

Provider Calculators

LLM TOKEN
COST CALC