Skip to main content

LLM TOKEN
COST CALC

Compare token pricing across 314 LLM models from 52 AI providers

Total Models
314
Providers
52
Last Updated
27 minutes ago
Price data loaded successfully. Showing 314 models from 52 providers.

Model Pricing by Provider

All prices shown are per million tokens in USD. Click the copy button on any row to copy the model ID to clipboard.
Provider

GPT

Models
60
Pricing table for GPT models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
OpenAI: gpt-oss-20b
openai/gpt-oss-20b
131K$0.030$0.140LOW
OpenAI: gpt-oss-120b
openai/gpt-oss-120b
131K$0.039$0.190LOW
OpenAI: gpt-oss-safeguard-20b
openai/gpt-oss-safeguard-20b
131K$0.075$0.300LOW
OpenAI: GPT-5 Nano
openai/gpt-5-nano
400K$0.050$0.400LOW
OpenAI: GPT-4.1 Nano
openai/gpt-4.1-nano
1.0M$0.100$0.400LOW
OpenAI: GPT-4o-mini Search Preview
openai/gpt-4o-mini-search-preview
128K$0.150$0.600LOW
OpenAI: GPT-4o-mini
openai/gpt-4o-mini
128K$0.150$0.600LOW
OpenAI: GPT-4o-mini (2024-07-18)
openai/gpt-4o-mini-2024-07-18
128K$0.150$0.600LOW
OpenAI: GPT-5.4 Nano
openai/gpt-5.4-nano
400K$0.200$1.25MED
OpenAI: GPT-4.1 Mini
openai/gpt-4.1-mini
1.0M$0.400$1.60MED
OpenAI: GPT-3.5 Turbo
openai/gpt-3.5-turbo
16K$0.500$1.50MED
OpenAI: GPT-5.1-Codex-Mini
openai/gpt-5.1-codex-mini
400K$0.250$2.00MED
OpenAI: GPT-5 Mini
openai/gpt-5-mini
400K$0.250$2.00MED
OpenAI: GPT Audio Mini
openai/gpt-audio-mini
128K$0.600$2.40MED
OpenAI: GPT-3.5 Turbo (older v0613)
openai/gpt-3.5-turbo-0613
4K$1.00$2.00MED
OpenAI: GPT-3.5 Turbo Instruct
openai/gpt-3.5-turbo-instruct
4K$1.50$2.00MED
OpenAI: GPT-5 Image Mini
openai/gpt-5-image-mini
400K$2.50$2.00MED
OpenAI: GPT-5.4 Mini
openai/gpt-5.4-mini
400K$0.750$4.50MED
OpenAI: o4 Mini High
openai/o4-mini-high
200K$1.10$4.40MED
OpenAI: o4 Mini
openai/o4-mini
200K$1.10$4.40MED
OpenAI: o3 Mini High
openai/o3-mini-high
200K$1.10$4.40MED
OpenAI: o3 Mini
openai/o3-mini
200K$1.10$4.40MED
OpenAI: GPT-3.5 Turbo 16k
openai/gpt-3.5-turbo-16k
16K$3.00$4.00MED
OpenAI: o4 Mini Deep Research
openai/o4-mini-deep-research
200K$2.00$8.00HIGH
OpenAI: o3
openai/o3
200K$2.00$8.00HIGH
OpenAI: GPT-4.1
openai/gpt-4.1
1.0M$2.00$8.00HIGH
OpenAI: GPT-5.1-Codex-Max
openai/gpt-5.1-codex-max
400K$1.25$10HIGH
OpenAI: GPT-5.1
openai/gpt-5.1
400K$1.25$10HIGH
OpenAI: GPT-5.1 Chat
openai/gpt-5.1-chat
128K$1.25$10HIGH
OpenAI: GPT-5.1-Codex
openai/gpt-5.1-codex
400K$1.25$10HIGH
OpenAI: GPT-5 Codex
openai/gpt-5-codex
400K$1.25$10HIGH
OpenAI: GPT-5 Chat
openai/gpt-5-chat
128K$1.25$10HIGH
OpenAI: GPT-5
openai/gpt-5
400K$1.25$10HIGH
OpenAI: GPT Audio
openai/gpt-audio
128K$2.50$10HIGH
OpenAI: GPT-4o Audio
openai/gpt-4o-audio-preview
128K$2.50$10HIGH
OpenAI: GPT-4o Search Preview
openai/gpt-4o-search-preview
128K$2.50$10HIGH
OpenAI: GPT-4o (2024-11-20)
openai/gpt-4o-2024-11-20
128K$2.50$10HIGH
OpenAI: GPT-4o (2024-08-06)
openai/gpt-4o-2024-08-06
128K$2.50$10HIGH
OpenAI: GPT-4o
openai/gpt-4o
128K$2.50$10HIGH
OpenAI: GPT-5.3 Chat
openai/gpt-5.3-chat
128K$1.75$14HIGH
OpenAI: GPT-5.3-Codex
openai/gpt-5.3-codex
400K$1.75$14HIGH
OpenAI: GPT-5.2-Codex
openai/gpt-5.2-codex
400K$1.75$14HIGH
OpenAI: GPT-5.2 Chat
openai/gpt-5.2-chat
128K$1.75$14HIGH
OpenAI: GPT-5.2
openai/gpt-5.2
400K$1.75$14HIGH
OpenAI: GPT-5.4
openai/gpt-5.4
1.1M$2.50$15HIGH
OpenAI: GPT-5 Image
openai/gpt-5-image
400K$10$10HIGH
OpenAI: GPT-4o (2024-05-13)
openai/gpt-4o-2024-05-13
128K$5.00$15HIGH
OpenAI: GPT-4o (extended)
openai/gpt-4o:extended
128K$6.00$18HIGH
OpenAI: GPT-4 Turbo
openai/gpt-4-turbo
128K$10$30HIGH
OpenAI: GPT-4 Turbo Preview
openai/gpt-4-turbo-preview
128K$10$30HIGH
OpenAI: GPT-4 Turbo (older v1106)
openai/gpt-4-1106-preview
128K$10$30HIGH
OpenAI: o3 Deep Research
openai/o3-deep-research
200K$10$40HIGH
OpenAI: o1
openai/o1
200K$15$60HIGH
OpenAI: GPT-4
openai/gpt-4
8K$30$60HIGH
OpenAI: GPT-4 (older v0314)
openai/gpt-4-0314
8K$30$60HIGH
OpenAI: o3 Pro
openai/o3-pro
200K$20$80HIGH
OpenAI: GPT-5 Pro
openai/gpt-5-pro
400K$15$120HIGH
OpenAI: GPT-5.2 Pro
openai/gpt-5.2-pro
400K$21$168HIGH
OpenAI: GPT-5.4 Pro
openai/gpt-5.4-pro
1.1M$30$180HIGH
OpenAI: o1-pro
openai/o1-pro
200K$150$600HIGH
Provider

Claude

Models
13
Pricing table for Claude models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Anthropic: Claude 3 Haiku
anthropic/claude-3-haiku
200K$0.250$1.25MED
Anthropic: Claude 3.5 Haiku
anthropic/claude-3.5-haiku
200K$0.800$4.00MED
Anthropic: Claude Haiku 4.5
anthropic/claude-haiku-4.5
200K$1.00$5.00MED
Anthropic: Claude Sonnet 4.6
anthropic/claude-sonnet-4.6
1.0M$3.00$15HIGH
Anthropic: Claude Sonnet 4.5
anthropic/claude-sonnet-4.5
1.0M$3.00$15HIGH
Anthropic: Claude Sonnet 4
anthropic/claude-sonnet-4
1.0M$3.00$15HIGH
Anthropic: Claude 3.7 Sonnet
anthropic/claude-3.7-sonnet
200K$3.00$15HIGH
Anthropic: Claude 3.7 Sonnet (thinking)
anthropic/claude-3.7-sonnet:thinking
200K$3.00$15HIGH
Anthropic: Claude Opus 4.6
anthropic/claude-opus-4.6
1.0M$5.00$25HIGH
Anthropic: Claude Opus 4.5
anthropic/claude-opus-4.5
200K$5.00$25HIGH
Anthropic: Claude Opus 4.1
anthropic/claude-opus-4.1
200K$15$75HIGH
Anthropic: Claude Opus 4
anthropic/claude-opus-4
200K$15$75HIGH
Anthropic: Claude Opus 4.6 (Fast)
anthropic/claude-opus-4.6-fast
1.0M$30$150HIGH
Provider

Gemini

Models
22
Pricing table for Gemini models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Google: Gemma 3n 4B
google/gemma-3n-e4b-it
33K$0.020$0.040LOW
Google: Gemma 3 4B
google/gemma-3-4b-it
131K$0.040$0.080LOW
Google: Gemma 3 12B
google/gemma-3-12b-it
131K$0.040$0.130LOW
Google: Gemma 3 27B
google/gemma-3-27b-it
131K$0.080$0.160LOW
Google: Gemini 2.0 Flash Lite
google/gemini-2.0-flash-lite-001
1.0M$0.075$0.300LOW
Google: Gemma 4 26B A4B
google/gemma-4-26b-a4b-it
262K$0.080$0.350LOW
Google: Gemini 2.5 Flash Lite Preview 09-2025
google/gemini-2.5-flash-lite-preview-09-2025
1.0M$0.100$0.400LOW
Google: Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite
1.0M$0.100$0.400LOW
Google: Gemini 2.0 Flash
google/gemini-2.0-flash-001
1.0M$0.100$0.400LOW
Google: Gemma 4 31B
google/gemma-4-31b-it
262K$0.130$0.380LOW
Google: Gemma 2 27B
google/gemma-2-27b-it
8K$0.650$0.650MED
Google: Gemini 3.1 Flash Lite Preview
google/gemini-3.1-flash-lite-preview
1.0M$0.250$1.50MED
Google: Nano Banana (Gemini 2.5 Flash Image)
google/gemini-2.5-flash-image
33K$0.300$2.50MED
Google: Gemini 2.5 Flash
google/gemini-2.5-flash
1.0M$0.300$2.50MED
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-image-preview
66K$0.500$3.00MED
Google: Gemini 3 Flash Preview
google/gemini-3-flash-preview
1.0M$0.500$3.00MED
Google: Gemini 2.5 Pro
google/gemini-2.5-pro
1.0M$1.25$10HIGH
Google: Gemini 2.5 Pro Preview 06-05
google/gemini-2.5-pro-preview
1.0M$1.25$10HIGH
Google: Gemini 2.5 Pro Preview 05-06
google/gemini-2.5-pro-preview-05-06
1.0M$1.25$10HIGH
Google: Gemini 3.1 Pro Preview Custom Tools
google/gemini-3.1-pro-preview-customtools
1.0M$2.00$12HIGH
Google: Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview
1.0M$2.00$12HIGH
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)
google/gemini-3-pro-image-preview
66K$2.00$12HIGH
Provider

Grok

Models
10
Pricing table for Grok models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
xAI: Grok 4.1 Fast
x-ai/grok-4.1-fast
2.0M$0.200$0.500LOW
xAI: Grok 4 Fast
x-ai/grok-4-fast
2.0M$0.200$0.500LOW
xAI: Grok 3 Mini
x-ai/grok-3-mini
131K$0.300$0.500LOW
xAI: Grok 3 Mini Beta
x-ai/grok-3-mini-beta
131K$0.300$0.500LOW
xAI: Grok Code Fast 1
x-ai/grok-code-fast-1
256K$0.200$1.50MED
xAI: Grok 4.20 Multi-Agent
x-ai/grok-4.20-multi-agent
2.0M$2.00$6.00MED
xAI: Grok 4.20
x-ai/grok-4.20
2.0M$2.00$6.00MED
xAI: Grok 4
x-ai/grok-4
256K$3.00$15HIGH
xAI: Grok 3
x-ai/grok-3
131K$3.00$15HIGH
xAI: Grok 3 Beta
x-ai/grok-3-beta
131K$3.00$15HIGH
Provider

Qwen

Models
46
Pricing table for Qwen models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Qwen: Qwen2.5 7B Instruct
qwen/qwen-2.5-7b-instruct
33K$0.040$0.100LOW
Qwen: Qwen-Turbo
qwen/qwen-turbo
131K$0.033$0.130LOW
Qwen: Qwen3 235B A22B Instruct 2507
qwen/qwen3-235b-a22b-2507
262K$0.071$0.100LOW
Qwen: Qwen3.5-9B
qwen/qwen3.5-9b
256K$0.050$0.150LOW
Qwen: Qwen3 14B
qwen/qwen3-14b
41K$0.060$0.240LOW
Qwen: Qwen3 32B
qwen/qwen3-32b
41K$0.080$0.240LOW
Qwen: Qwen3.5-Flash
qwen/qwen3.5-flash-02-23
1.0M$0.065$0.260LOW
Qwen: Qwen3 Coder 30B A3B Instruct
qwen/qwen3-coder-30b-a3b-instruct
160K$0.070$0.270LOW
Qwen: Qwen3 30B A3B
qwen/qwen3-30b-a3b
41K$0.080$0.280LOW
Qwen: Qwen3 30B A3B Instruct 2507
qwen/qwen3-30b-a3b-instruct-2507
262K$0.090$0.300LOW
Qwen: Qwen3 8B
qwen/qwen3-8b
41K$0.050$0.400LOW
Qwen: Qwen3 30B A3B Thinking 2507
qwen/qwen3-30b-a3b-thinking-2507
131K$0.080$0.400LOW
Qwen2.5 72B Instruct
qwen/qwen-2.5-72b-instruct
33K$0.120$0.390LOW
Qwen: Qwen3 VL 32B Instruct
qwen/qwen3-vl-32b-instruct
131K$0.104$0.416LOW
Qwen: Qwen VL Plus
qwen/qwen-vl-plus
131K$0.137$0.410LOW
Qwen: Qwen3 VL 8B Instruct
qwen/qwen3-vl-8b-instruct
131K$0.080$0.500LOW
Qwen: Qwen3 VL 30B A3B Instruct
qwen/qwen3-vl-30b-a3b-instruct
131K$0.130$0.520LOW
Qwen: QwQ 32B
qwen/qwq-32b
131K$0.150$0.580LOW
Qwen: Qwen2.5 VL 32B Instruct
qwen/qwen2.5-vl-32b-instruct
128K$0.200$0.600LOW
Qwen: Qwen3 Next 80B A3B Thinking
qwen/qwen3-next-80b-a3b-thinking
131K$0.098$0.780LOW
Qwen: Qwen3 Coder Next
qwen/qwen3-coder-next
262K$0.150$0.800LOW
Qwen: Qwen Plus 0728 (thinking)
qwen/qwen-plus-2025-07-28:thinking
1.0M$0.260$0.780MED
Qwen: Qwen Plus 0728
qwen/qwen-plus-2025-07-28
1.0M$0.260$0.780MED
Qwen: Qwen-Plus
qwen/qwen-plus
1.0M$0.260$0.780MED
Qwen: Qwen3 VL 235B A22B Instruct
qwen/qwen3-vl-235b-a22b-instruct
262K$0.200$0.880MED
Qwen: Qwen3 Coder Flash
qwen/qwen3-coder-flash
1.0M$0.195$0.975MED
Qwen: Qwen3 Next 80B A3B Instruct
qwen/qwen3-next-80b-a3b-instruct
262K$0.090$1.10MED
Qwen: Qwen3 Coder 480B A35B
qwen/qwen3-coder
262K$0.220$1.00MED
Qwen: Qwen3.5-35B-A3B
qwen/qwen3.5-35b-a3b
262K$0.163$1.30MED
Qwen: Qwen3 VL 8B Thinking
qwen/qwen3-vl-8b-thinking
131K$0.117$1.36MED
Qwen: Qwen2.5 VL 72B Instruct
qwen/qwen2.5-vl-72b-instruct
33K$0.800$0.800MED
Qwen: Qwen3 235B A22B Thinking 2507
qwen/qwen3-235b-a22b-thinking-2507
131K$0.150$1.50MED
Qwen2.5 Coder 32B Instruct
qwen/qwen-2.5-coder-32b-instruct
33K$0.660$1.00MED
Qwen: Qwen3 VL 30B A3B Thinking
qwen/qwen3-vl-30b-a3b-thinking
131K$0.130$1.56MED
Qwen: Qwen3.5-27B
qwen/qwen3.5-27b
262K$0.195$1.56MED
Qwen: Qwen3.5 Plus 2026-02-15
qwen/qwen3.5-plus-02-15
1.0M$0.260$1.56MED
Qwen: Qwen3.6 Plus
qwen/qwen3.6-plus
1.0M$0.325$1.95MED
Qwen: Qwen3 235B A22B
qwen/qwen3-235b-a22b
131K$0.455$1.82MED
Qwen: Qwen3.5-122B-A10B
qwen/qwen3.5-122b-a10b
262K$0.260$2.08MED
Qwen: Qwen VL Max
qwen/qwen-vl-max
131K$0.520$2.08MED
Qwen: Qwen3.5 397B A17B
qwen/qwen3.5-397b-a17b
262K$0.390$2.34MED
Qwen: Qwen3 VL 235B A22B Thinking
qwen/qwen3-vl-235b-a22b-thinking
131K$0.260$2.60MED
Qwen: Qwen3 Coder Plus
qwen/qwen3-coder-plus
1.0M$0.650$3.25MED
Qwen: Qwen3 Max Thinking
qwen/qwen3-max-thinking
262K$0.780$3.90MED
Qwen: Qwen3 Max
qwen/qwen3-max
262K$0.780$3.90MED
Qwen: Qwen-Max
qwen/qwen-max
33K$1.04$4.16MED
Provider

DeepSeek

Models
11
Pricing table for DeepSeek models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
DeepSeek: R1 Distill Qwen 32B
deepseek/deepseek-r1-distill-qwen-32b
33K$0.290$0.290LOW
DeepSeek: DeepSeek V3.2
deepseek/deepseek-v3.2
164K$0.260$0.380LOW
DeepSeek: DeepSeek V3.2 Exp
deepseek/deepseek-v3.2-exp
164K$0.270$0.410LOW
DeepSeek: DeepSeek V3.1
deepseek/deepseek-chat-v3.1
33K$0.150$0.750LOW
DeepSeek: DeepSeek V3 0324
deepseek/deepseek-chat-v3-0324
164K$0.200$0.770LOW
DeepSeek: DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminus
164K$0.210$0.790LOW
DeepSeek: DeepSeek V3
deepseek/deepseek-chat
164K$0.320$0.890MED
DeepSeek: R1 Distill Llama 70B
deepseek/deepseek-r1-distill-llama-70b
131K$0.700$0.800MED
DeepSeek: DeepSeek V3.2 Speciale
deepseek/deepseek-v3.2-speciale
164K$0.400$1.20MED
DeepSeek: R1 0528
deepseek/deepseek-r1-0528
164K$0.500$2.15MED
DeepSeek: R1
deepseek/deepseek-r1
64K$0.700$2.50MED
Provider

Mistral

Models
25
Pricing table for Mistral models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Mistral: Mistral Nemo
mistralai/mistral-nemo
131K$0.020$0.040LOW
Mistral: Mistral Small 3
mistralai/mistral-small-24b-instruct-2501
33K$0.050$0.080LOW
Mistral: Ministral 3 3B 2512
mistralai/ministral-3b-2512
131K$0.100$0.100LOW
Mistral: Mistral Small 3.2 24B
mistralai/mistral-small-3.2-24b-instruct
128K$0.075$0.200LOW
Mistral: Ministral 3 8B 2512
mistralai/ministral-8b-2512
262K$0.150$0.150LOW
Mistral: Mistral 7B Instruct v0.1
mistralai/mistral-7b-instruct-v0.1
3K$0.110$0.190LOW
Mistral: Mistral Small Creative
mistralai/mistral-small-creative
33K$0.100$0.300LOW
Mistral: Ministral 3 14B 2512
mistralai/ministral-14b-2512
262K$0.200$0.200LOW
Mistral: Voxtral Small 24B 2507
mistralai/voxtral-small-24b-2507
32K$0.100$0.300LOW
Mistral: Devstral Small 1.1
mistralai/devstral-small
131K$0.100$0.300LOW
Mistral: Mistral Small 4
mistralai/mistral-small-2603
262K$0.150$0.600LOW
Mistral: Saba
mistralai/mistral-saba
33K$0.200$0.600LOW
Mistral: Mistral Small 3.1 24B
mistralai/mistral-small-3.1-24b-instruct
128K$0.350$0.560LOW
Mistral: Mixtral 8x7B Instruct
mistralai/mixtral-8x7b-instruct
33K$0.540$0.540MED
Mistral: Codestral 2508
mistralai/codestral-2508
256K$0.300$0.900MED
Mistral: Mistral Large 3 2512
mistralai/mistral-large-2512
262K$0.500$1.50MED
Mistral: Devstral 2 2512
mistralai/devstral-2512
262K$0.400$2.00MED
Mistral: Mistral Medium 3.1
mistralai/mistral-medium-3.1
131K$0.400$2.00MED
Mistral: Devstral Medium
mistralai/devstral-medium
131K$0.400$2.00MED
Mistral: Mistral Medium 3
mistralai/mistral-medium-3
131K$0.400$2.00MED
Mistral Large 2411
mistralai/mistral-large-2411
131K$2.00$6.00MED
Mistral Large 2407
mistralai/mistral-large-2407
131K$2.00$6.00MED
Mistral: Pixtral Large 2411
mistralai/pixtral-large-2411
131K$2.00$6.00MED
Mistral: Mixtral 8x22B Instruct
mistralai/mixtral-8x22b-instruct
66K$2.00$6.00MED
Mistral Large
mistralai/mistral-large
128K$2.00$6.00MED
Provider

Cohere

Models
4
Pricing table for Cohere models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Cohere: Command R7B (12-2024)
cohere/command-r7b-12-2024
128K$0.037$0.150LOW
Cohere: Command R (08-2024)
cohere/command-r-08-2024
128K$0.150$0.600LOW
Cohere: Command A
cohere/command-a
256K$2.50$10HIGH
Cohere: Command R+ (08-2024)
cohere/command-r-plus-08-2024
128K$2.50$10HIGH
Provider

MoonshotAI

Models
4
Pricing table for MoonshotAI models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
MoonshotAI: Kimi K2.5
moonshotai/kimi-k2.5
262K$0.383$1.72MED
MoonshotAI: Kimi K2 0905
moonshotai/kimi-k2-0905
262K$0.400$2.00MED
MoonshotAI: Kimi K2 0711
moonshotai/kimi-k2
131K$0.570$2.30MED
MoonshotAI: Kimi K2 Thinking
moonshotai/kimi-k2-thinking
262K$0.600$2.50MED
Provider

ByteDance

Models
1
Pricing table for ByteDance models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
ByteDance: UI-TARS 7B
bytedance/ui-tars-1.5-7b
128K$0.100$0.200LOW
Provider

DeepCogito

Models
1
Pricing table for DeepCogito models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Deep Cogito: Cogito v2.1 671B
deepcogito/cogito-v2.1-671b
128K$1.25$1.25MED
Provider

Baidu

Models
5
Pricing table for Baidu models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Baidu: ERNIE 4.5 21B A3B Thinking
baidu/ernie-4.5-21b-a3b-thinking
131K$0.070$0.280LOW
Baidu: ERNIE 4.5 21B A3B
baidu/ernie-4.5-21b-a3b
120K$0.070$0.280LOW
Baidu: ERNIE 4.5 VL 28B A3B
baidu/ernie-4.5-vl-28b-a3b
30K$0.140$0.560LOW
Baidu: ERNIE 4.5 300B A47B
baidu/ernie-4.5-300b-a47b
123K$0.280$1.10MED
Baidu: ERNIE 4.5 VL 424B A47B
baidu/ernie-4.5-vl-424b-a47b
123K$0.420$1.25MED
Provider

Z-AI

Models
12
Pricing table for Z-AI models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Z.ai: GLM 4 32B
z-ai/glm-4-32b
128K$0.100$0.100LOW
Z.ai: GLM 4.7 Flash
z-ai/glm-4.7-flash
203K$0.060$0.400LOW
Z.ai: GLM 4.5 Air
z-ai/glm-4.5-air
131K$0.130$0.850LOW
Z.ai: GLM 4.6V
z-ai/glm-4.6v
131K$0.300$0.900MED
Z.ai: GLM 4.7
z-ai/glm-4.7
203K$0.390$1.75MED
Z.ai: GLM 4.6
z-ai/glm-4.6
205K$0.390$1.90MED
Z.ai: GLM 4.5V
z-ai/glm-4.5v
66K$0.600$1.80MED
Z.ai: GLM 4.5
z-ai/glm-4.5
131K$0.600$2.20MED
Z.ai: GLM 5
z-ai/glm-5
80K$0.720$2.30MED
Z.ai: GLM 5.1
z-ai/glm-5.1
203K$0.950$3.15MED
Z.ai: GLM 5V Turbo
z-ai/glm-5v-turbo
203K$1.20$4.00MED
Z.ai: GLM 5 Turbo
z-ai/glm-5-turbo
203K$1.20$4.00MED
Provider

Tencent

Models
1
Pricing table for Tencent models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Tencent: Hunyuan A13B Instruct
tencent/hunyuan-a13b-instruct
131K$0.140$0.570LOW
Provider

MiniMax

Models
7
Pricing table for MiniMax models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
MiniMax: MiniMax M2.5
minimax/minimax-m2.5
197K$0.118$0.990MED
MiniMax: MiniMax M2.1
minimax/minimax-m2.1
197K$0.290$0.950MED
MiniMax: MiniMax M2
minimax/minimax-m2
197K$0.255$1.00MED
MiniMax: MiniMax-01
minimax/minimax-01
1.0M$0.200$1.10MED
MiniMax: MiniMax M2.7
minimax/minimax-m2.7
197K$0.300$1.20MED
MiniMax: MiniMax M2-her
minimax/minimax-m2-her
66K$0.300$1.20MED
MiniMax: MiniMax M1
minimax/minimax-m1
1.0M$0.400$2.20MED
Provider

Meta-Llama

Models
12
Pricing table for Meta-Llama models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Meta: Llama 3.1 8B Instruct
meta-llama/llama-3.1-8b-instruct
16K$0.020$0.050LOW
Meta: Llama 3 8B Instruct
meta-llama/llama-3-8b-instruct
8K$0.030$0.040LOW
Meta: Llama 3.2 1B Instruct
meta-llama/llama-3.2-1b-instruct
60K$0.027$0.200LOW
Meta: Llama Guard 4 12B
meta-llama/llama-guard-4-12b
164K$0.180$0.180LOW
Meta: Llama 4 Scout
meta-llama/llama-4-scout
328K$0.080$0.300LOW
Meta: Llama 3.2 3B Instruct
meta-llama/llama-3.2-3b-instruct
80K$0.051$0.340LOW
Meta: Llama 3.3 70B Instruct
meta-llama/llama-3.3-70b-instruct
131K$0.100$0.320LOW
Meta: Llama 3.2 11B Vision Instruct
meta-llama/llama-3.2-11b-vision-instruct
131K$0.245$0.245LOW
Llama Guard 3 8B
meta-llama/llama-guard-3-8b
131K$0.480$0.030LOW
Meta: Llama 4 Maverick
meta-llama/llama-4-maverick
1.0M$0.150$0.600LOW
Meta: Llama 3.1 70B Instruct
meta-llama/llama-3.1-70b-instruct
131K$0.400$0.400LOW
Meta: Llama 3 70B Instruct
meta-llama/llama-3-70b-instruct
8K$0.510$0.740MED
Provider

Microsoft

Models
2
Pricing table for Microsoft models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Microsoft: Phi 4
microsoft/phi-4
16K$0.065$0.140LOW
WizardLM-2 8x22B
microsoft/wizardlm-2-8x22b
66K$0.620$0.620MED
Provider

NVIDIA

Models
6
Pricing table for NVIDIA models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
NVIDIA: Nemotron Nano 9B V2
nvidia/nemotron-nano-9b-v2
131K$0.040$0.160LOW
NVIDIA: Nemotron 3 Nano 30B A3B
nvidia/nemotron-3-nano-30b-a3b
262K$0.050$0.200LOW
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5
131K$0.100$0.400LOW
NVIDIA: Nemotron 3 Super
nvidia/nemotron-3-super-120b-a12b
262K$0.100$0.500LOW
NVIDIA: Nemotron Nano 12B 2 VL
nvidia/nemotron-nano-12b-v2-vl
131K$0.200$0.600LOW
NVIDIA: Llama 3.1 Nemotron 70B Instruct
nvidia/llama-3.1-nemotron-70b-instruct
131K$1.20$1.20MED
Provider

Perplexity

Models
5
Pricing table for Perplexity models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Perplexity: Sonar
perplexity/sonar
127K$1.00$1.00MED
Perplexity: Sonar Reasoning Pro
perplexity/sonar-reasoning-pro
128K$2.00$8.00HIGH
Perplexity: Sonar Deep Research
perplexity/sonar-deep-research
128K$2.00$8.00HIGH
Perplexity: Sonar Pro Search
perplexity/sonar-pro-search
200K$3.00$15HIGH
Perplexity: Sonar Pro
perplexity/sonar-pro
200K$3.00$15HIGH
Provider

Amazon

Models
5
Pricing table for Amazon models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Amazon: Nova Micro 1.0
amazon/nova-micro-v1
128K$0.035$0.140LOW
Amazon: Nova Lite 1.0
amazon/nova-lite-v1
300K$0.060$0.240LOW
Amazon: Nova 2 Lite
amazon/nova-2-lite-v1
1.0M$0.300$2.50MED
Amazon: Nova Pro 1.0
amazon/nova-pro-v1
300K$0.800$3.20MED
Amazon: Nova Premier 1.0
amazon/nova-premier-v1
1.0M$2.50$13HIGH
Provider

arcee-ai

Models
6
Pricing table for arcee-ai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Arcee AI: Trinity Mini
arcee-ai/trinity-mini
131K$0.045$0.150LOW
Arcee AI: Spotlight
arcee-ai/spotlight
131K$0.180$0.180LOW
Arcee AI: Trinity Large Thinking
arcee-ai/trinity-large-thinking
262K$0.220$0.850MED
Arcee AI: Coder Large
arcee-ai/coder-large
33K$0.500$0.800MED
Arcee AI: Virtuoso Large
arcee-ai/virtuoso-large
131K$0.750$1.20MED
Arcee AI: Maestro Reasoning
arcee-ai/maestro-reasoning
131K$0.900$3.30MED
Provider

kwaipilot

Models
1
Pricing table for kwaipilot models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Kwaipilot: KAT-Coder-Pro V2
kwaipilot/kat-coder-pro-v2
256K$0.300$1.20MED
Provider

rekaai

Models
2
Pricing table for rekaai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Reka Edge
rekaai/reka-edge
16K$0.100$0.100LOW
Reka Flash 3
rekaai/reka-flash-3
66K$0.100$0.200LOW
Provider

xiaomi

Models
3
Pricing table for xiaomi models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Xiaomi: MiMo-V2-Flash
xiaomi/mimo-v2-flash
262K$0.090$0.290LOW
Xiaomi: MiMo-V2-Omni
xiaomi/mimo-v2-omni
262K$0.400$2.00MED
Xiaomi: MiMo-V2-Pro
xiaomi/mimo-v2-pro
1.0M$1.00$3.00MED
Provider

bytedance-seed

Models
4
Pricing table for bytedance-seed models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
ByteDance Seed: Seed 1.6 Flash
bytedance-seed/seed-1.6-flash
262K$0.075$0.300LOW
ByteDance Seed: Seed-2.0-Mini
bytedance-seed/seed-2.0-mini
262K$0.100$0.400LOW
ByteDance Seed: Seed-2.0-Lite
bytedance-seed/seed-2.0-lite
262K$0.250$2.00MED
ByteDance Seed: Seed 1.6
bytedance-seed/seed-1.6
262K$0.250$2.00MED
Provider

inception

Models
1
Pricing table for inception models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Inception: Mercury 2
inception/mercury-2
128K$0.250$0.750MED
Provider

liquid

Models
1
Pricing table for liquid models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
LiquidAI: LFM2-24B-A2B
liquid/lfm-2-24b-a2b
33K$0.030$0.120LOW
Provider

aion-labs

Models
4
Pricing table for aion-labs models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
AionLabs: Aion-1.0-Mini
aion-labs/aion-1.0-mini
131K$0.700$1.40MED
AionLabs: Aion-2.0
aion-labs/aion-2.0
131K$0.800$1.60MED
AionLabs: Aion-RP 1.0 (8B)
aion-labs/aion-rp-llama-3.1-8b
33K$0.800$1.60MED
AionLabs: Aion-1.0
aion-labs/aion-1.0
131K$4.00$8.00HIGH
Provider

stepfun

Models
1
Pricing table for stepfun models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
StepFun: Step 3.5 Flash
stepfun/step-3.5-flash
262K$0.100$0.300LOW
Provider

upstage

Models
1
Pricing table for upstage models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Upstage: Solar Pro 3
upstage/solar-pro-3
128K$0.150$0.600LOW
Provider

writer

Models
1
Pricing table for writer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Writer: Palmyra X5
writer/palmyra-x5
1.0M$0.600$6.00MED
Provider

allenai

Models
3
Pricing table for allenai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
AllenAI: Olmo 2 32B Instruct
allenai/olmo-2-0325-32b-instruct
128K$0.050$0.200LOW
AllenAI: Olmo 3 32B Think
allenai/olmo-3-32b-think
66K$0.150$0.500LOW
AllenAI: Olmo 3.1 32B Instruct
allenai/olmo-3.1-32b-instruct
66K$0.200$0.600LOW
Provider

relace

Models
2
Pricing table for relace models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Relace: Relace Apply 3
relace/relace-apply-3
256K$0.850$1.25MED
Relace: Relace Search
relace/relace-search
256K$1.00$3.00MED
Provider

nex-agi

Models
1
Pricing table for nex-agi models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Nex AGI: DeepSeek V3.1 Nex N1
nex-agi/deepseek-v3.1-nex-n1
131K$0.135$0.500LOW
Provider

essentialai

Models
1
Pricing table for essentialai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
EssentialAI: Rnj 1 Instruct
essentialai/rnj-1-instruct
33K$0.150$0.150LOW
Provider

prime-intellect

Models
1
Pricing table for prime-intellect models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Prime Intellect: INTELLECT-3
prime-intellect/intellect-3
131K$0.200$1.10MED
Provider

ibm-granite

Models
1
Pricing table for ibm-granite models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
IBM: Granite 4.0 Micro
ibm-granite/granite-4.0-h-micro
131K$0.017$0.110LOW
Provider

thedrummer

Models
4
Pricing table for thedrummer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
TheDrummer: Rocinante 12B
thedrummer/rocinante-12b
33K$0.170$0.430LOW
TheDrummer: UnslopNemo 12B
thedrummer/unslopnemo-12b
33K$0.400$0.400LOW
TheDrummer: Cydonia 24B V4.1
thedrummer/cydonia-24b-v4.1
131K$0.300$0.500LOW
TheDrummer: Skyfall 36B V2
thedrummer/skyfall-36b-v2
33K$0.550$0.800MED
Provider

alibaba

Models
1
Pricing table for alibaba models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Tongyi DeepResearch 30B A3B
alibaba/tongyi-deepresearch-30b-a3b
131K$0.090$0.450LOW
Provider

nousresearch

Models
5
Pricing table for nousresearch models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
NousResearch: Hermes 2 Pro - Llama-3 8B
nousresearch/hermes-2-pro-llama-3-8b
8K$0.140$0.140LOW
Nous: Hermes 4 70B
nousresearch/hermes-4-70b
131K$0.130$0.400LOW
Nous: Hermes 3 70B Instruct
nousresearch/hermes-3-llama-3.1-70b
131K$0.300$0.300LOW
Nous: Hermes 3 405B Instruct
nousresearch/hermes-3-llama-3.1-405b
131K$1.00$1.00MED
Nous: Hermes 4 405B
nousresearch/hermes-4-405b
131K$1.00$3.00MED
Provider

ai21

Models
1
Pricing table for ai21 models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
AI21: Jamba Large 1.7
ai21/jamba-large-1.7
256K$2.00$8.00HIGH
Provider

switchpoint

Models
1
Pricing table for switchpoint models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Switchpoint Router
switchpoint/router
131K$0.850$3.40MED
Provider

tngtech

Models
1
Pricing table for tngtech models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
TNG: DeepSeek R1T2 Chimera
tngtech/deepseek-r1t2-chimera
164K$0.300$1.10MED
Provider

morph

Models
2
Pricing table for morph models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Morph: Morph V3 Fast
morph/morph-v3-fast
82K$0.800$1.20MED
Morph: Morph V3 Large
morph/morph-v3-large
262K$0.900$1.90MED
Provider

alfredpros

Models
1
Pricing table for alfredpros models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
AlfredPros: CodeLLaMa 7B Instruct Solidity
alfredpros/codellama-7b-instruct-solidity
4K$0.800$1.20MED
Provider

sao10k

Models
5
Pricing table for sao10k models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Sao10K: Llama 3 8B Lunaris
sao10k/l3-lunaris-8b
8K$0.040$0.050LOW
Sao10K: Llama 3.3 Euryale 70B
sao10k/l3.3-euryale-70b
131K$0.650$0.750MED
Sao10K: Llama 3.1 Euryale 70B v2.2
sao10k/l3.1-euryale-70b
131K$0.850$0.850MED
Sao10k: Llama 3 Euryale 70B v2.1
sao10k/l3-euryale-70b
8K$1.48$1.48MED
Sao10K: Llama 3.1 70B Hanami x1
sao10k/l3.1-70b-hanami-x1
16K$3.00$3.00MED
Provider

anthracite-org

Models
1
Pricing table for anthracite-org models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Magnum v4 72B
anthracite-org/magnum-v4-72b
16K$3.00$5.00MED
Provider

inflection

Models
2
Pricing table for inflection models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Inflection: Inflection 3 Pi
inflection/inflection-3-pi
8K$2.50$10HIGH
Inflection: Inflection 3 Productivity
inflection/inflection-3-productivity
8K$2.50$10HIGH
Provider

alpindale

Models
1
Pricing table for alpindale models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Goliath 120B
alpindale/goliath-120b
6K$3.75$7.50HIGH
Provider

mancer

Models
1
Pricing table for mancer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Mancer: Weaver (alpha)
mancer/weaver
8K$0.750$1.00MED
Provider

undi95

Models
1
Pricing table for undi95 models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
ReMM SLERP 13B
undi95/remm-slerp-l2-13b
6K$0.450$0.650MED
Provider

gryphe

Models
1
Pricing table for gryphe models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
MythoMax 13B
gryphe/mythomax-l2-13b
4K$0.060$0.060LOW