Provider

GPT

Models

66

Pricing table for GPT models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
OpenAI: gpt-oss-20b openai/gpt-oss-20b	131K	$0.030	$0.130	LOW
OpenAI: gpt-oss-120b openai/gpt-oss-120b	131K	$0.037	$0.170	LOW
OpenAI: gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b	131K	$0.075	$0.300	LOW
OpenAI: GPT-5 Nano openai/gpt-5-nano	400K	$0.050	$0.400	LOW
OpenAI: GPT-4.1 Nano openai/gpt-4.1-nano	1.0M	$0.100	$0.400	LOW
OpenAI: GPT-4o-mini Search Preview openai/gpt-4o-mini-search-preview	128K	$0.150	$0.600	LOW
OpenAI: GPT-4o-mini openai/gpt-4o-mini	128K	$0.150	$0.600	LOW
OpenAI: GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18	128K	$0.150	$0.600	LOW
OpenAI: GPT-5.4 Nano openai/gpt-5.4-nano	400K	$0.200	$1.25	MED
OpenAI: GPT-4.1 Mini openai/gpt-4.1-mini	1.0M	$0.400	$1.60	MED
OpenAI: GPT-3.5 Turbo openai/gpt-3.5-turbo	16K	$0.500	$1.50	MED
OpenAI: GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini	400K	$0.250	$2.00	MED
OpenAI: GPT-5 Mini openai/gpt-5-mini	400K	$0.250	$2.00	MED
OpenAI: GPT Audio Mini openai/gpt-audio-mini	128K	$0.600	$2.40	MED
OpenAI: GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613	4K	$1.00	$2.00	MED
OpenAI: GPT-3.5 Turbo Instruct openai/gpt-3.5-turbo-instruct	4K	$1.50	$2.00	MED
OpenAI: GPT-5 Image Mini openai/gpt-5-image-mini	400K	$2.50	$2.00	MED
OpenAI: GPT-5.4 Mini openai/gpt-5.4-mini	400K	$0.750	$4.50	MED
OpenAI: o4 Mini High openai/o4-mini-high	200K	$1.10	$4.40	MED
OpenAI: o4 Mini openai/o4-mini	200K	$1.10	$4.40	MED
OpenAI: o3 Mini High openai/o3-mini-high	200K	$1.10	$4.40	MED
OpenAI: o3 Mini openai/o3-mini	200K	$1.10	$4.40	MED
OpenAI: GPT-5.6 Luna Pro openai/gpt-5.6-luna-pro	1.1M	$1.00	$6.00	MED
OpenAI: GPT-5.6 Luna openai/gpt-5.6-luna	1.1M	$1.00	$6.00	MED
OpenAI: GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k	16K	$3.00	$4.00	MED
OpenAI: o4 Mini Deep Research openai/o4-mini-deep-research	200K	$2.00	$8.00	HIGH
OpenAI: o3 openai/o3	200K	$2.00	$8.00	HIGH
OpenAI: GPT-4.1 openai/gpt-4.1	1.0M	$2.00	$8.00	HIGH
OpenAI: GPT-5.1-Codex-Max openai/gpt-5.1-codex-max	400K	$1.25	$10	HIGH
OpenAI: GPT-5.1 openai/gpt-5.1	400K	$1.25	$10	HIGH
OpenAI: GPT-5.1 Chat openai/gpt-5.1-chat	128K	$1.25	$10	HIGH
OpenAI: GPT-5.1-Codex openai/gpt-5.1-codex	400K	$1.25	$10	HIGH
OpenAI: GPT-5 Codex openai/gpt-5-codex	400K	$1.25	$10	HIGH
OpenAI: GPT-5 Chat openai/gpt-5-chat	128K	$1.25	$10	HIGH
OpenAI: GPT-5 openai/gpt-5	400K	$1.25	$10	HIGH
OpenAI: GPT Audio openai/gpt-audio	128K	$2.50	$10	HIGH
OpenAI: GPT-4o Search Preview openai/gpt-4o-search-preview	128K	$2.50	$10	HIGH
OpenAI: GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20	128K	$2.50	$10	HIGH
OpenAI: GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06	128K	$2.50	$10	HIGH
OpenAI: GPT-4o openai/gpt-4o	128K	$2.50	$10	HIGH
OpenAI: GPT-5.3 Chat openai/gpt-5.3-chat	128K	$1.75	$14	HIGH
OpenAI: GPT-5.3-Codex openai/gpt-5.3-codex	400K	$1.75	$14	HIGH
OpenAI: GPT-5.2-Codex openai/gpt-5.2-codex	400K	$1.75	$14	HIGH
OpenAI: GPT-5.2 Chat openai/gpt-5.2-chat	128K	$1.75	$14	HIGH
OpenAI: GPT-5.2 openai/gpt-5.2	400K	$1.75	$14	HIGH
OpenAI: GPT-5.6 Terra Pro openai/gpt-5.6-terra-pro	1.1M	$2.50	$15	HIGH
OpenAI: GPT-5.6 Terra openai/gpt-5.6-terra	1.1M	$2.50	$15	HIGH
OpenAI: GPT-5.4 openai/gpt-5.4	1.1M	$2.50	$15	HIGH
OpenAI: GPT-5 Image openai/gpt-5-image	400K	$10	$10	HIGH
OpenAI: GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13	128K	$5.00	$15	HIGH
OpenAI: GPT-5.4 Image 2 openai/gpt-5.4-image-2	272K	$8.00	$15	HIGH
OpenAI: GPT-5.6 Sol Pro openai/gpt-5.6-sol-pro	1.1M	$5.00	$30	HIGH
OpenAI: GPT-5.6 Sol openai/gpt-5.6-sol	1.1M	$5.00	$30	HIGH
OpenAI: GPT Chat Latest openai/gpt-chat-latest	400K	$5.00	$30	HIGH
OpenAI: GPT-5.5 openai/gpt-5.5	1.1M	$5.00	$30	HIGH
OpenAI: GPT-4 Turbo openai/gpt-4-turbo	128K	$10	$30	HIGH
OpenAI: GPT-4 Turbo Preview openai/gpt-4-turbo-preview	128K	$10	$30	HIGH
OpenAI: o3 Deep Research openai/o3-deep-research	200K	$10	$40	HIGH
OpenAI: o1 openai/o1	200K	$15	$60	HIGH
OpenAI: GPT-4 openai/gpt-4	8K	$30	$60	HIGH
OpenAI: o3 Pro openai/o3-pro	200K	$20	$80	HIGH
OpenAI: GPT-5 Pro openai/gpt-5-pro	400K	$15	$120	HIGH
OpenAI: GPT-5.2 Pro openai/gpt-5.2-pro	400K	$21	$168	HIGH
OpenAI: GPT-5.5 Pro openai/gpt-5.5-pro	1.1M	$30	$180	HIGH
OpenAI: GPT-5.4 Pro openai/gpt-5.4-pro	1.1M	$30	$180	HIGH
OpenAI: o1-pro openai/o1-pro	200K	$150	$600	HIGH

Provider

Claude

Models

15

Pricing table for Claude models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Anthropic: Claude 3 Haiku anthropic/claude-3-haiku	200K	$0.250	$1.25	MED
Anthropic: Claude Haiku 4.5 anthropic/claude-haiku-4.5	200K	$1.00	$5.00	MED
Anthropic: Claude Sonnet 5 anthropic/claude-sonnet-5	1.0M	$2.00	$10	HIGH
Anthropic: Claude Sonnet 4.6 anthropic/claude-sonnet-4.6	1.0M	$3.00	$15	HIGH
Anthropic: Claude Sonnet 4.5 anthropic/claude-sonnet-4.5	1.0M	$3.00	$15	HIGH
Anthropic: Claude Sonnet 4 anthropic/claude-sonnet-4	1.0M	$3.00	$15	HIGH
Anthropic: Claude Opus 4.8 anthropic/claude-opus-4.8	1.0M	$5.00	$25	HIGH
Anthropic: Claude Opus 4.7 anthropic/claude-opus-4.7	1.0M	$5.00	$25	HIGH
Anthropic: Claude Opus 4.6 anthropic/claude-opus-4.6	1.0M	$5.00	$25	HIGH
Anthropic: Claude Opus 4.5 anthropic/claude-opus-4.5	200K	$5.00	$25	HIGH
Anthropic: Claude Fable 5 anthropic/claude-fable-5	1.0M	$10	$50	HIGH
Anthropic: Claude Opus 4.8 (Fast) anthropic/claude-opus-4.8-fast	1.0M	$10	$50	HIGH
Anthropic: Claude Opus 4.1 anthropic/claude-opus-4.1	200K	$15	$75	HIGH
Anthropic: Claude Opus 4 anthropic/claude-opus-4	200K	$15	$75	HIGH
Anthropic: Claude Opus 4.7 (Fast) anthropic/claude-opus-4.7-fast	1.0M	$30	$150	HIGH

Provider

Gemini

Models

24

Pricing table for Gemini models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Google: Gemma 3 4B google/gemma-3-4b-it	131K	$0.050	$0.100	LOW
Google: Gemma 3n 4B google/gemma-3n-e4b-it	33K	$0.060	$0.120	LOW
Google: Gemma 3 12B google/gemma-3-12b-it	131K	$0.050	$0.150	LOW
Google: Gemma 3 27B google/gemma-3-27b-it	131K	$0.100	$0.300	LOW
Google: Gemma 4 26B A4B google/gemma-4-26b-a4b-it	262K	$0.070	$0.340	LOW
Google: Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite	1.0M	$0.100	$0.400	LOW
Google: Gemma 4 31B google/gemma-4-31b-it	262K	$0.220	$0.550	LOW
Google: Gemma 2 27B google/gemma-2-27b-it	8K	$0.650	$0.650	MED
Google: Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) google/gemini-3.1-flash-lite-image	66K	$0.250	$1.50	MED
Google: Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite	1.0M	$0.250	$1.50	MED
Google: Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview	1.0M	$0.250	$1.50	MED
Google: Nano Banana (Gemini 2.5 Flash Image) google/gemini-2.5-flash-image	33K	$0.300	$2.50	MED
Google: Gemini 2.5 Flash google/gemini-2.5-flash	1.0M	$0.300	$2.50	MED
Google: Nano Banana 2 (Gemini 3.1 Flash Image) google/gemini-3.1-flash-image	131K	$0.500	$3.00	MED
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) google/gemini-3.1-flash-image-preview	131K	$0.500	$3.00	MED
Google: Gemini 3 Flash Preview google/gemini-3-flash-preview	1.0M	$0.500	$3.00	MED
Google: Gemini 3.5 Flash google/gemini-3.5-flash	1.0M	$1.50	$9.00	HIGH
Google: Gemini 2.5 Pro google/gemini-2.5-pro	1.0M	$1.25	$10	HIGH
Google: Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview	1.0M	$1.25	$10	HIGH
Google: Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06	1.0M	$1.25	$10	HIGH
Google: Nano Banana Pro (Gemini 3 Pro Image) google/gemini-3-pro-image	66K	$2.00	$12	HIGH
Google: Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools	1.0M	$2.00	$12	HIGH
Google: Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview	1.0M	$2.00	$12	HIGH
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) google/gemini-3-pro-image-preview	66K	$2.00	$12	HIGH

Provider

Grok

Models

5

Pricing table for Grok models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
xAI: Grok Build 0.1 x-ai/grok-build-0.1	256K	$1.00	$2.00	MED
xAI: Grok 4.3 x-ai/grok-4.3	1.0M	$1.25	$2.50	MED
xAI: Grok 4.20 Multi-Agent x-ai/grok-4.20-multi-agent	2.0M	$1.25	$2.50	MED
xAI: Grok 4.20 x-ai/grok-4.20	2.0M	$1.25	$2.50	MED
xAI: Grok 4.5 x-ai/grok-4.5	500K	$2.00	$6.00	MED

Provider

Qwen

Models

47

Pricing table for Qwen models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Qwen: Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct	131K	$0.040	$0.100	LOW
Qwen: Qwen3.5-9B qwen/qwen3.5-9b	262K	$0.100	$0.150	LOW
Qwen: Qwen3.5-Flash qwen/qwen3.5-flash-02-23	1.0M	$0.065	$0.260	LOW
Qwen: Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct	160K	$0.070	$0.270	LOW
Qwen: Qwen3 32B qwen/qwen3-32b	131K	$0.080	$0.280	LOW
Qwen: Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507	262K	$0.100	$0.300	LOW
Qwen: Qwen3 VL 32B Instruct qwen/qwen3-vl-32b-instruct	262K	$0.104	$0.416	LOW
Qwen: Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct	256K	$0.117	$0.455	LOW
Qwen: Qwen3 8B qwen/qwen3-8b	131K	$0.117	$0.455	LOW
Qwen: Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507	262K	$0.090	$0.550	LOW
Qwen: Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct	262K	$0.130	$0.520	LOW
Qwen: Qwen3 30B A3B qwen/qwen3-30b-a3b	131K	$0.130	$0.520	LOW
Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct	131K	$0.360	$0.400	LOW
Qwen: Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking	262K	$0.098	$0.780	LOW
Qwen: Qwen3 Coder Next qwen/qwen3-coder-next	262K	$0.110	$0.800	LOW
Qwen: Qwen Plus 0728 qwen/qwen-plus-2025-07-28	1.0M	$0.260	$0.780	MED
Qwen: Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking	1.0M	$0.260	$0.780	MED
Qwen: Qwen-Plus qwen/qwen-plus	1.0M	$0.260	$0.780	MED
Qwen: Qwen3 14B qwen/qwen3-14b	132K	$0.227	$0.910	MED
Qwen: Qwen3.6 35B A3B qwen/qwen3.6-35b-a3b	262K	$0.140	$1.00	MED
Qwen: Qwen3.5-35B-A3B qwen/qwen3.5-35b-a3b	262K	$0.140	$1.00	MED
Qwen: Qwen3 Coder Flash qwen/qwen3-coder-flash	1.0M	$0.195	$0.975	MED
Qwen: Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct	262K	$0.100	$1.10	MED
Qwen: Qwen3 Coder 480B A35B qwen/qwen3-coder	1.0M	$0.300	$1.00	MED
Qwen: Qwen3.6 Flash qwen/qwen3.6-flash	1.0M	$0.188	$1.13	MED
Qwen: Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking	256K	$0.117	$1.36	MED
Qwen: Qwen3.7 Plus qwen/qwen3.7-plus	1.0M	$0.320	$1.28	MED
Qwen: Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507	262K	$0.150	$1.50	MED
Qwen2.5 Coder 32B Instruct qwen/qwen-2.5-coder-32b-instruct	128K	$0.660	$1.00	MED
Qwen: Qwen3 VL 30B A3B Thinking qwen/qwen3-vl-30b-a3b-thinking	131K	$0.130	$1.56	MED
Qwen: Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507	131K	$0.130	$1.56	MED
Qwen: Qwen2.5 VL 72B Instruct qwen/qwen2.5-vl-72b-instruct	131K	$0.800	$1.00	MED
Qwen: Qwen3.5 Plus 2026-02-15 qwen/qwen3.5-plus-02-15	1.0M	$0.260	$1.56	MED
Qwen: Qwen3.5 Plus 2026-04-20 qwen/qwen3.5-plus-20260420	1.0M	$0.300	$1.80	MED
Qwen: Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct	131K	$0.210	$1.90	MED
Qwen: Qwen3.6 Plus qwen/qwen3.6-plus	1.0M	$0.325	$1.95	MED
Qwen: Qwen3 235B A22B qwen/qwen3-235b-a22b	131K	$0.455	$1.82	MED
Qwen: Qwen3.5-122B-A10B qwen/qwen3.5-122b-a10b	262K	$0.260	$2.08	MED
Qwen: Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b	262K	$0.390	$2.34	MED
Qwen: Qwen3.5-27B qwen/qwen3.5-27b	262K	$0.260	$2.60	MED
Qwen: Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking	131K	$0.260	$2.60	MED
Qwen: Qwen3.6 27B qwen/qwen3.6-27b	262K	$0.450	$2.70	MED
Qwen: Qwen3 Coder Plus qwen/qwen3-coder-plus	1.0M	$0.650	$3.25	MED
Qwen: Qwen3 Max Thinking qwen/qwen3-max-thinking	262K	$0.780	$3.90	MED
Qwen: Qwen3 Max qwen/qwen3-max	262K	$0.780	$3.90	MED
Qwen: Qwen3.7 Max qwen/qwen3.7-max	1.0M	$1.48	$4.42	MED
Qwen: Qwen3.6 Max Preview qwen/qwen3.6-max-preview	262K	$1.04	$6.24	MED

Provider

DeepSeek

Models

11

Pricing table for DeepSeek models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
DeepSeek: DeepSeek V4 Flash deepseek/deepseek-v4-flash	1.0M	$0.098	$0.196	LOW
DeepSeek: DeepSeek V3.2 deepseek/deepseek-v3.2	164K	$0.269	$0.400	LOW
DeepSeek: DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp	164K	$0.270	$0.410	LOW
DeepSeek: DeepSeek V3 deepseek/deepseek-chat	131K	$0.200	$0.800	MED
DeepSeek: DeepSeek V3.1 deepseek/deepseek-chat-v3.1	164K	$0.250	$0.950	MED
DeepSeek: DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus	131K	$0.270	$1.00	MED
DeepSeek: DeepSeek V4 Pro deepseek/deepseek-v4-pro	1.0M	$0.435	$0.870	MED
DeepSeek: DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324	164K	$0.270	$1.12	MED
DeepSeek: R1 Distill Llama 70B deepseek/deepseek-r1-distill-llama-70b	128K	$0.800	$0.800	MED
DeepSeek: R1 0528 deepseek/deepseek-r1-0528	164K	$0.500	$2.15	MED
DeepSeek: R1 deepseek/deepseek-r1	164K	$0.700	$2.50	MED

Provider

Mistral

Models

19

Pricing table for Mistral models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Mistral: Mistral Nemo mistralai/mistral-nemo	131K	$0.019	$0.030	LOW
Mistral: Mistral Small 3 mistralai/mistral-small-24b-instruct-2501	33K	$0.050	$0.080	LOW
Mistral: Ministral 3 3B 2512 mistralai/ministral-3b-2512	131K	$0.100	$0.100	LOW
Mistral: Ministral 3 8B 2512 mistralai/ministral-8b-2512	262K	$0.150	$0.150	LOW
Mistral: Ministral 3 14B 2512 mistralai/ministral-14b-2512	262K	$0.200	$0.200	LOW
Mistral: Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507	32K	$0.100	$0.300	LOW
Mistral: Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct	131K	$0.100	$0.300	LOW
Mistral: Mistral Small 4 mistralai/mistral-small-2603	262K	$0.150	$0.600	LOW
Mistral: Saba mistralai/mistral-saba	33K	$0.200	$0.600	LOW
Mistral: Mistral Small 3.1 24B mistralai/mistral-small-3.1-24b-instruct	128K	$0.351	$0.555	LOW
Mistral: Codestral 2508 mistralai/codestral-2508	256K	$0.300	$0.900	MED
Mistral: Mistral Large 3 2512 mistralai/mistral-large-2512	262K	$0.500	$1.50	MED
Mistral: Devstral 2 2512 mistralai/devstral-2512	262K	$0.400	$2.00	MED
Mistral: Mistral Medium 3.1 mistralai/mistral-medium-3.1	131K	$0.400	$2.00	MED
Mistral: Mistral Medium 3 mistralai/mistral-medium-3	131K	$0.400	$2.00	MED
Mistral Large 2407 mistralai/mistral-large-2407	131K	$2.00	$6.00	MED
Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct	66K	$2.00	$6.00	MED
Mistral Large mistralai/mistral-large	128K	$2.00	$6.00	MED
Mistral: Mistral Medium 3.5 mistralai/mistral-medium-3-5	262K	$1.50	$7.50	MED

Provider

Cohere

Models

4

Pricing table for Cohere models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Cohere: Command R7B (12-2024) cohere/command-r7b-12-2024	128K	$0.037	$0.150	LOW
Cohere: Command R (08-2024) cohere/command-r-08-2024	128K	$0.150	$0.600	LOW
Cohere: Command A cohere/command-a	256K	$2.50	$10	HIGH
Cohere: Command R+ (08-2024) cohere/command-r-plus-08-2024	128K	$2.50	$10	HIGH

Provider

MoonshotAI

Models

7

Pricing table for MoonshotAI models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2	131K	$0.570	$2.30	MED
MoonshotAI: Kimi K2 Thinking moonshotai/kimi-k2-thinking	262K	$0.600	$2.50	MED
MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905	262K	$0.600	$2.50	MED
MoonshotAI: Kimi K2.5 moonshotai/kimi-k2.5	262K	$0.570	$2.85	MED
MoonshotAI: Kimi K2.6 moonshotai/kimi-k2.6	262K	$0.684	$3.42	MED
MoonshotAI: Kimi K2.7 Code moonshotai/kimi-k2.7-code	262K	$0.850	$3.80	MED
MoonshotAI: Kimi K3 moonshotai/kimi-k3	1.0M	$3.00	$15	HIGH

Provider

ByteDance

Models

1

Pricing table for ByteDance models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
ByteDance: UI-TARS 7B bytedance/ui-tars-1.5-7b	128K	$0.100	$0.200	LOW

Provider

DeepCogito

Models

1

Pricing table for DeepCogito models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Deep Cogito: Cogito v2.1 671B deepcogito/cogito-v2.1-671b	128K	$1.25	$1.25	MED

Provider

Baidu

Models

1

Pricing table for Baidu models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Baidu: ERNIE 4.5 VL 424B A47B baidu/ernie-4.5-vl-424b-a47b	131K	$0.420	$1.25	MED

Provider

Z-AI

Models

12

Pricing table for Z-AI models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Z.ai: GLM 4.7 Flash z-ai/glm-4.7-flash	200K	$0.061	$0.400	LOW
Z.ai: GLM 4.5 Air z-ai/glm-4.5-air	131K	$0.130	$0.850	LOW
Z.ai: GLM 5.2 z-ai/glm-5.2	1.0M	$0.259	$0.814	MED
Z.ai: GLM 4.6V z-ai/glm-4.6v	131K	$0.300	$0.900	MED
Z.ai: GLM 4.7 z-ai/glm-4.7	203K	$0.400	$1.75	MED
Z.ai: GLM 4.5V z-ai/glm-4.5v	66K	$0.600	$1.80	MED
Z.ai: GLM 4.6 z-ai/glm-4.6	203K	$0.500	$2.00	MED
Z.ai: GLM 4.5 z-ai/glm-4.5	131K	$0.600	$2.20	MED
Z.ai: GLM 5.1 z-ai/glm-5.1	203K	$0.966	$3.04	MED
Z.ai: GLM 5 z-ai/glm-5	203K	$0.950	$3.15	MED
Z.ai: GLM 5V Turbo z-ai/glm-5v-turbo	203K	$1.20	$4.00	MED
Z.ai: GLM 5 Turbo z-ai/glm-5-turbo	203K	$1.20	$4.00	MED

Provider

Tencent

Models

3

Pricing table for Tencent models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Tencent: Hy3 preview tencent/hy3-preview	262K	$0.063	$0.210	LOW
Tencent: Hunyuan A13B Instruct tencent/hunyuan-a13b-instruct	131K	$0.140	$0.570	LOW
Tencent: Hy3 tencent/hy3	262K	$0.200	$0.800	LOW

Provider

MiniMax

Models

8

Pricing table for MiniMax models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
MiniMax: MiniMax M2.5 minimax/minimax-m2.5	205K	$0.150	$0.900	MED
MiniMax: MiniMax M2.7 minimax/minimax-m2.7	205K	$0.250	$1.00	MED
MiniMax: MiniMax-01 minimax/minimax-01	1.0M	$0.200	$1.10	MED
MiniMax: MiniMax M3 minimax/minimax-m3	1.0M	$0.300	$1.20	MED
MiniMax: MiniMax M2-her minimax/minimax-m2-her	66K	$0.300	$1.20	MED
MiniMax: MiniMax M2.1 minimax/minimax-m2.1	205K	$0.300	$1.20	MED
MiniMax: MiniMax M2 minimax/minimax-m2	205K	$0.300	$1.20	MED
MiniMax: MiniMax M1 minimax/minimax-m1	1.0M	$0.550	$2.20	MED

Provider

Meta-Llama

Models

8

Pricing table for Meta-Llama models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Meta: Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct	131K	$0.050	$0.080	LOW
Meta: Llama 3.2 1B Instruct meta-llama/llama-3.2-1b-instruct	131K	$0.027	$0.201	LOW
Meta: Llama Guard 4 12B meta-llama/llama-guard-4-12b	164K	$0.180	$0.180	LOW
Meta: Llama 3.2 3B Instruct meta-llama/llama-3.2-3b-instruct	131K	$0.051	$0.335	LOW
Meta: Llama 4 Scout meta-llama/llama-4-scout	10.0M	$0.100	$0.300	LOW
Meta: Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct	131K	$0.130	$0.400	LOW
Meta: Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct	131K	$0.400	$0.400	LOW
Meta: Llama 4 Maverick meta-llama/llama-4-maverick	1.0M	$0.200	$0.800	LOW

Provider

Microsoft

Models

2

Pricing table for Microsoft models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Microsoft: Phi 4 microsoft/phi-4	16K	$0.070	$0.140	LOW
WizardLM-2 8x22B microsoft/wizardlm-2-8x22b	66K	$0.620	$0.620	MED

Provider

NVIDIA

Models

3

Pricing table for NVIDIA models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
NVIDIA: Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b	262K	$0.050	$0.200	LOW
NVIDIA: Nemotron 3 Super nvidia/nemotron-3-super-120b-a12b	1.0M	$0.210	$0.455	LOW
NVIDIA: Nemotron 3 Ultra nvidia/nemotron-3-ultra-550b-a55b	1.0M	$0.600	$3.60	MED

Provider

Perplexity

Models

5

Pricing table for Perplexity models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Perplexity: Sonar perplexity/sonar	127K	$1.00	$1.00	MED
Perplexity: Sonar Reasoning Pro perplexity/sonar-reasoning-pro	128K	$2.00	$8.00	HIGH
Perplexity: Sonar Deep Research perplexity/sonar-deep-research	128K	$2.00	$8.00	HIGH
Perplexity: Sonar Pro Search perplexity/sonar-pro-search	200K	$3.00	$15	HIGH
Perplexity: Sonar Pro perplexity/sonar-pro	200K	$3.00	$15	HIGH

Provider

Amazon

Models

5

Pricing table for Amazon models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Amazon: Nova Micro 1.0 amazon/nova-micro-v1	128K	$0.035	$0.140	LOW
Amazon: Nova Lite 1.0 amazon/nova-lite-v1	300K	$0.060	$0.240	LOW
Amazon: Nova 2 Lite amazon/nova-2-lite-v1	1.0M	$0.300	$2.50	MED
Amazon: Nova Pro 1.0 amazon/nova-pro-v1	300K	$0.800	$3.20	MED
Amazon: Nova Premier 1.0 amazon/nova-premier-v1	1.0M	$2.50	$13	HIGH

Provider

thinkingmachines

Models

1

Pricing table for thinkingmachines models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Thinking Machines: Inkling thinkingmachines/inkling	1.0M	$1.00	$4.05	MED

Provider

kwaipilot

Models

3

Pricing table for kwaipilot models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Kwaipilot: KAT-Coder-Air V2.5 kwaipilot/kat-coder-air-v2.5	256K	$0.150	$0.600	LOW
Kwaipilot: KAT-Coder-Pro V2 kwaipilot/kat-coder-pro-v2	256K	$0.300	$1.20	MED
Kwaipilot: KAT-Coder-Pro V2.5 kwaipilot/kat-coder-pro-v2.5	256K	$0.740	$2.96	MED

Provider

~x-ai

Models

1

Pricing table for ~x-ai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
xAI: Grok Latest ~x-ai/grok-latest	500K	$2.00	$6.00	MED

Provider

aion-labs

Models

4

Pricing table for aion-labs models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
AionLabs: Aion-3.0-Mini aion-labs/aion-3.0-mini	131K	$0.700	$1.40	MED
AionLabs: Aion-2.0 aion-labs/aion-2.0	131K	$0.800	$1.60	MED
AionLabs: Aion-RP 1.0 (8B) aion-labs/aion-rp-llama-3.1-8b	33K	$0.800	$1.60	MED
AionLabs: Aion-3.0 aion-labs/aion-3.0	131K	$3.00	$6.00	MED

Provider

poolside

Models

2

Pricing table for poolside models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Poolside: Laguna XS 2.1 poolside/laguna-xs-2.1	262K	$0.060	$0.120	LOW
Poolside: Laguna M.1 poolside/laguna-m.1	262K	$0.200	$0.400	LOW

Provider

nex-agi

Models

2

Pricing table for nex-agi models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Nex AGI: Nex-N2-Mini nex-agi/nex-n2-mini	262K	$0.025	$0.100	LOW
Nex AGI: Nex-N2-Pro nex-agi/nex-n2-pro	262K	$0.250	$1.00	MED

Provider

sakana

Models

1

Pricing table for sakana models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Sakana: Fugu Ultra sakana/fugu-ultra	1.0M	$5.00	$30	HIGH

Provider

~anthropic

Models

4

Pricing table for ~anthropic models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Anthropic Claude Haiku Latest ~anthropic/claude-haiku-latest	200K	$1.00	$5.00	MED
Anthropic Claude Sonnet Latest ~anthropic/claude-sonnet-latest	1.0M	$2.00	$10	HIGH
Anthropic: Claude Opus Latest ~anthropic/claude-opus-latest	1.0M	$5.00	$25	HIGH
Anthropic: Claude Fable Latest ~anthropic/claude-fable-latest	1.0M	$10	$50	HIGH

Provider

stepfun

Models

2

Pricing table for stepfun models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
StepFun: Step 3.5 Flash stepfun/step-3.5-flash	262K	$0.100	$0.300	LOW
StepFun: Step 3.7 Flash stepfun/step-3.7-flash	256K	$0.200	$1.15	MED

Provider

perceptron

Models

1

Pricing table for perceptron models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Perceptron: Perceptron Mk1 perceptron/perceptron-mk1	33K	$0.150	$1.50	MED

Provider

inclusionai

Models

3

Pricing table for inclusionai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
inclusionAI: Ling-2.6-flash inclusionai/ling-2.6-flash	262K	$0.010	$0.030	LOW
inclusionAI: Ring-2.6-1T inclusionai/ring-2.6-1t	262K	$0.075	$0.625	LOW
inclusionAI: Ling-2.6-1T inclusionai/ling-2.6-1t	262K	$0.075	$0.625	LOW

Provider

ibm-granite

Models

2

Pricing table for ibm-granite models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
IBM: Granite 4.0 Micro ibm-granite/granite-4.0-h-micro	131K	$0.017	$0.112	LOW
IBM: Granite 4.1 8B ibm-granite/granite-4.1-8b	131K	$0.050	$0.100	LOW

Provider

~openai

Models

2

Pricing table for ~openai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
OpenAI GPT Mini Latest ~openai/gpt-mini-latest	400K	$0.750	$4.50	MED
OpenAI GPT Latest ~openai/gpt-latest	1.1M	$5.00	$30	HIGH

Provider

~google

Models

2

Pricing table for ~google models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Google Gemini Flash Latest ~google/gemini-flash-latest	1.0M	$1.50	$9.00	HIGH
Google Gemini Pro Latest ~google/gemini-pro-latest	1.0M	$2.00	$12	HIGH

Provider

~moonshotai

Models

1

Pricing table for ~moonshotai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
MoonshotAI Kimi Latest ~moonshotai/kimi-latest	1.0M	$3.00	$15	HIGH

Provider

xiaomi

Models

2

Pricing table for xiaomi models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Xiaomi: MiMo-V2.5 xiaomi/mimo-v2.5	1.0M	$0.140	$0.280	LOW
Xiaomi: MiMo-V2.5-Pro xiaomi/mimo-v2.5-pro	1.0M	$0.435	$0.870	MED

Provider

arcee-ai

Models

2

Pricing table for arcee-ai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Arcee AI: Trinity Large Thinking arcee-ai/trinity-large-thinking	262K	$0.250	$0.800	MED
Arcee AI: Virtuoso Large arcee-ai/virtuoso-large	131K	$0.750	$1.20	MED

Provider

rekaai

Models

2

Pricing table for rekaai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Reka Edge rekaai/reka-edge	16K	$0.100	$0.100	LOW
Reka Flash 3 rekaai/reka-flash-3	66K	$0.100	$0.200	LOW

Provider

bytedance-seed

Models

4

Pricing table for bytedance-seed models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
ByteDance Seed: Seed 1.6 Flash bytedance-seed/seed-1.6-flash	262K	$0.075	$0.300	LOW
ByteDance Seed: Seed-2.0-Mini bytedance-seed/seed-2.0-mini	262K	$0.100	$0.400	LOW
ByteDance Seed: Seed-2.0-Lite bytedance-seed/seed-2.0-lite	262K	$0.250	$2.00	MED
ByteDance Seed: Seed 1.6 bytedance-seed/seed-1.6	262K	$0.250	$2.00	MED

Provider

inception

Models

1

Pricing table for inception models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Inception: Mercury 2 inception/mercury-2	128K	$0.250	$0.750	MED

Provider

upstage

Models

1

Pricing table for upstage models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Upstage: Solar Pro 3 upstage/solar-pro-3	128K	$0.150	$0.600	LOW

Provider

writer

Models

1

Pricing table for writer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Writer: Palmyra X5 writer/palmyra-x5	1.0M	$0.600	$6.00	MED

Provider

relace

Models

2

Pricing table for relace models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Relace: Relace Apply 3 relace/relace-apply-3	256K	$0.850	$1.25	MED
Relace: Relace Search relace/relace-search	256K	$1.00	$3.00	MED

Provider

allenai

Models

1

Pricing table for allenai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
AllenAI: Olmo 3 32B Think allenai/olmo-3-32b-think	66K	$0.150	$0.500	LOW

Provider

thedrummer

Models

4

Pricing table for thedrummer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
TheDrummer: Rocinante 12B thedrummer/rocinante-12b	66K	$0.250	$0.500	LOW
TheDrummer: UnslopNemo 12B thedrummer/unslopnemo-12b	33K	$0.400	$0.400	LOW
TheDrummer: Cydonia 24B V4.1 thedrummer/cydonia-24b-v4.1	131K	$0.300	$0.500	LOW
TheDrummer: Skyfall 36B V2 thedrummer/skyfall-36b-v2	33K	$0.550	$0.800	MED

Provider

nousresearch

Models

4

Pricing table for nousresearch models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Nous: Hermes 4 70B nousresearch/hermes-4-70b	131K	$0.130	$0.400	LOW
Nous: Hermes 3 70B Instruct nousresearch/hermes-3-llama-3.1-70b	131K	$0.700	$0.700	MED
Nous: Hermes 3 405B Instruct nousresearch/hermes-3-llama-3.1-405b	131K	$1.00	$1.00	MED
Nous: Hermes 4 405B nousresearch/hermes-4-405b	131K	$1.00	$3.00	MED

Provider

ai21

Models

1

Pricing table for ai21 models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
AI21: Jamba Large 1.7 ai21/jamba-large-1.7	256K	$2.00	$8.00	HIGH

Provider

cognitivecomputations

Models

1

Pricing table for cognitivecomputations models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Venice: Uncensored cognitivecomputations/dolphin-mistral-24b-venice-edition	128K	$0.200	$0.900	MED

Provider

morph

Models

2

Pricing table for morph models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Morph: Morph V3 Fast morph/morph-v3-fast	82K	$0.800	$1.20	MED
Morph: Morph V3 Large morph/morph-v3-large	262K	$0.900	$1.90	MED

Provider

sao10k

Models

3

Pricing table for sao10k models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Sao10K: Llama 3 8B Lunaris sao10k/l3-lunaris-8b	8K	$0.040	$0.050	LOW
Sao10K: Llama 3.3 Euryale 70B sao10k/l3.3-euryale-70b	131K	$0.650	$0.750	MED
Sao10K: Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b	131K	$0.850	$0.850	MED

Provider

anthracite-org

Models

1

Pricing table for anthracite-org models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Magnum v4 72B anthracite-org/magnum-v4-72b	33K	$3.00	$5.00	MED

Provider

inflection

Models

2

Pricing table for inflection models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Inflection: Inflection 3 Pi inflection/inflection-3-pi	8K	$2.50	$10	HIGH
Inflection: Inflection 3 Productivity inflection/inflection-3-productivity	8K	$2.50	$10	HIGH

Provider

mancer

Models

1

Pricing table for mancer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
Mancer: Weaver (alpha) mancer/weaver	8K	$0.500	$0.750	MED

Provider

undi95

Models

1

Pricing table for undi95 models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
ReMM SLERP 13B undi95/remm-slerp-l2-13b	6K	$0.450	$0.650	MED

Provider

gryphe

Models

1

Pricing table for gryphe models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
Model	Context	Input $/M	Output $/M	Tier
MythoMax 13B gryphe/mythomax-l2-13b	4K	$0.060	$0.060	LOW

LLM TOKENCOST CALC

Model Pricing by Provider

GPT

Claude

Gemini

Grok

Qwen

DeepSeek

Mistral

Meta

Cohere

MoonshotAI

ByteDance

DeepCogito

Baidu

Z-AI

Tencent

MiniMax

Meta-Llama

Microsoft

NVIDIA

Perplexity

Amazon

thinkingmachines

kwaipilot

~x-ai

aion-labs

poolside

nex-agi

sakana

~anthropic

stepfun

perceptron

inclusionai

ibm-granite

~openai

~google

~moonshotai

xiaomi

arcee-ai

rekaai

bytedance-seed

inception

upstage

writer

relace

allenai

thedrummer

nousresearch

ai21

cognitivecomputations

morph

sao10k

anthracite-org

inflection

mancer

undi95

gryphe

Provider Calculators

LLM TOKEN
COST CALC