Skip to main content

LLM TOKEN
COST CALC

Compare token pricing across 314 LLM models from 54 AI providers

Total Models
314
Providers
54
Last Updated
8 minutes ago
Price data loaded successfully. Showing 314 models from 54 providers.

Model Pricing by Provider

All prices shown are per million tokens in USD. Click the copy button on any row to copy the model ID to clipboard.
Provider

GPT

Models
62
Pricing table for GPT models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
OpenAI: gpt-oss-20b
openai/gpt-oss-20b
131K$0.029$0.140LOW
OpenAI: gpt-oss-120b
openai/gpt-oss-120b
131K$0.039$0.180LOW
OpenAI: gpt-oss-safeguard-20b
openai/gpt-oss-safeguard-20b
131K$0.075$0.300LOW
OpenAI: GPT-5 Nano
openai/gpt-5-nano
400K$0.050$0.400LOW
OpenAI: GPT-4.1 Nano
openai/gpt-4.1-nano
1.0M$0.100$0.400LOW
OpenAI: GPT-4o-mini Search Preview
openai/gpt-4o-mini-search-preview
128K$0.150$0.600LOW
OpenAI: GPT-4o-mini (2024-07-18)
openai/gpt-4o-mini-2024-07-18
128K$0.150$0.600LOW
OpenAI: GPT-4o-mini
openai/gpt-4o-mini
128K$0.150$0.600LOW
OpenAI: GPT-5.4 Nano
openai/gpt-5.4-nano
400K$0.200$1.25MED
OpenAI: GPT-4.1 Mini
openai/gpt-4.1-mini
1.0M$0.400$1.60MED
OpenAI: GPT-3.5 Turbo
openai/gpt-3.5-turbo
16K$0.500$1.50MED
OpenAI: GPT-5.1-Codex-Mini
openai/gpt-5.1-codex-mini
400K$0.250$2.00MED
OpenAI: GPT-5 Mini
openai/gpt-5-mini
400K$0.250$2.00MED
OpenAI: GPT Audio Mini
openai/gpt-audio-mini
128K$0.600$2.40MED
OpenAI: GPT-3.5 Turbo (older v0613)
openai/gpt-3.5-turbo-0613
4K$1.00$2.00MED
OpenAI: GPT-3.5 Turbo Instruct
openai/gpt-3.5-turbo-instruct
4K$1.50$2.00MED
OpenAI: GPT-5 Image Mini
openai/gpt-5-image-mini
400K$2.50$2.00MED
OpenAI: GPT-5.4 Mini
openai/gpt-5.4-mini
400K$0.750$4.50MED
OpenAI: o4 Mini High
openai/o4-mini-high
200K$1.10$4.40MED
OpenAI: o4 Mini
openai/o4-mini
200K$1.10$4.40MED
OpenAI: o3 Mini High
openai/o3-mini-high
200K$1.10$4.40MED
OpenAI: o3 Mini
openai/o3-mini
200K$1.10$4.40MED
OpenAI: GPT-3.5 Turbo 16k
openai/gpt-3.5-turbo-16k
16K$3.00$4.00MED
OpenAI: o4 Mini Deep Research
openai/o4-mini-deep-research
200K$2.00$8.00HIGH
OpenAI: o3
openai/o3
200K$2.00$8.00HIGH
OpenAI: GPT-4.1
openai/gpt-4.1
1.0M$2.00$8.00HIGH
OpenAI: GPT-5.1-Codex-Max
openai/gpt-5.1-codex-max
400K$1.25$10HIGH
OpenAI: GPT-5.1
openai/gpt-5.1
400K$1.25$10HIGH
OpenAI: GPT-5.1 Chat
openai/gpt-5.1-chat
128K$1.25$10HIGH
OpenAI: GPT-5.1-Codex
openai/gpt-5.1-codex
400K$1.25$10HIGH
OpenAI: GPT-5 Codex
openai/gpt-5-codex
400K$1.25$10HIGH
OpenAI: GPT-5 Chat
openai/gpt-5-chat
128K$1.25$10HIGH
OpenAI: GPT-5
openai/gpt-5
400K$1.25$10HIGH
OpenAI: GPT Audio
openai/gpt-audio
128K$2.50$10HIGH
OpenAI: GPT-4o Search Preview
openai/gpt-4o-search-preview
128K$2.50$10HIGH
OpenAI: GPT-4o (2024-11-20)
openai/gpt-4o-2024-11-20
128K$2.50$10HIGH
OpenAI: GPT-4o (2024-08-06)
openai/gpt-4o-2024-08-06
128K$2.50$10HIGH
OpenAI: GPT-4o
openai/gpt-4o
128K$2.50$10HIGH
OpenAI: GPT-5.3 Chat
openai/gpt-5.3-chat
128K$1.75$14HIGH
OpenAI: GPT-5.3-Codex
openai/gpt-5.3-codex
400K$1.75$14HIGH
OpenAI: GPT-5.2-Codex
openai/gpt-5.2-codex
400K$1.75$14HIGH
OpenAI: GPT-5.2 Chat
openai/gpt-5.2-chat
128K$1.75$14HIGH
OpenAI: GPT-5.2
openai/gpt-5.2
400K$1.75$14HIGH
OpenAI: GPT-5.4
openai/gpt-5.4
1.1M$2.50$15HIGH
OpenAI: GPT-5 Image
openai/gpt-5-image
400K$10$10HIGH
OpenAI: GPT-4o (2024-05-13)
openai/gpt-4o-2024-05-13
128K$5.00$15HIGH
OpenAI: GPT-5.4 Image 2
openai/gpt-5.4-image-2
272K$8.00$15HIGH
OpenAI: GPT Chat Latest
openai/gpt-chat-latest
400K$5.00$30HIGH
OpenAI: GPT-5.5
openai/gpt-5.5
1.1M$5.00$30HIGH
OpenAI: GPT-4 Turbo
openai/gpt-4-turbo
128K$10$30HIGH
OpenAI: GPT-4 Turbo Preview
openai/gpt-4-turbo-preview
128K$10$30HIGH
OpenAI: GPT-4 Turbo (older v1106)
openai/gpt-4-1106-preview
128K$10$30HIGH
OpenAI: o3 Deep Research
openai/o3-deep-research
200K$10$40HIGH
OpenAI: o1
openai/o1
200K$15$60HIGH
OpenAI: GPT-4 (older v0314)
openai/gpt-4-0314
8K$30$60HIGH
OpenAI: GPT-4
openai/gpt-4
8K$30$60HIGH
OpenAI: o3 Pro
openai/o3-pro
200K$20$80HIGH
OpenAI: GPT-5 Pro
openai/gpt-5-pro
400K$15$120HIGH
OpenAI: GPT-5.2 Pro
openai/gpt-5.2-pro
400K$21$168HIGH
OpenAI: GPT-5.5 Pro
openai/gpt-5.5-pro
1.1M$30$180HIGH
OpenAI: GPT-5.4 Pro
openai/gpt-5.4-pro
1.1M$30$180HIGH
OpenAI: o1-pro
openai/o1-pro
200K$150$600HIGH
Provider

Claude

Models
15
Pricing table for Claude models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Anthropic: Claude 3 Haiku
anthropic/claude-3-haiku
200K$0.250$1.25MED
Anthropic: Claude 3.5 Haiku
anthropic/claude-3.5-haiku
200K$0.800$4.00MED
Anthropic: Claude Haiku 4.5
anthropic/claude-haiku-4.5
200K$1.00$5.00MED
Anthropic: Claude Sonnet 4.6
anthropic/claude-sonnet-4.6
1.0M$3.00$15HIGH
Anthropic: Claude Sonnet 4.5
anthropic/claude-sonnet-4.5
1.0M$3.00$15HIGH
Anthropic: Claude Sonnet 4
anthropic/claude-sonnet-4
1.0M$3.00$15HIGH
Anthropic: Claude Opus 4.8
anthropic/claude-opus-4.8
1.0M$5.00$25HIGH
Anthropic: Claude Opus 4.7
anthropic/claude-opus-4.7
1.0M$5.00$25HIGH
Anthropic: Claude Opus 4.6
anthropic/claude-opus-4.6
1.0M$5.00$25HIGH
Anthropic: Claude Opus 4.5
anthropic/claude-opus-4.5
200K$5.00$25HIGH
Anthropic: Claude Opus 4.8 (Fast)
anthropic/claude-opus-4.8-fast
1.0M$10$50HIGH
Anthropic: Claude Opus 4.1
anthropic/claude-opus-4.1
200K$15$75HIGH
Anthropic: Claude Opus 4
anthropic/claude-opus-4
200K$15$75HIGH
Anthropic: Claude Opus 4.7 (Fast)
anthropic/claude-opus-4.7-fast
1.0M$30$150HIGH
Anthropic: Claude Opus 4.6 (Fast)
anthropic/claude-opus-4.6-fast
1.0M$30$150HIGH
Provider

Gemini

Models
22
Pricing table for Gemini models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Google: Gemma 3 4B
google/gemma-3-4b-it
131K$0.040$0.080LOW
Google: Gemma 3 12B
google/gemma-3-12b-it
131K$0.040$0.130LOW
Google: Gemma 3n 4B
google/gemma-3n-e4b-it
33K$0.060$0.120LOW
Google: Gemma 3 27B
google/gemma-3-27b-it
131K$0.080$0.160LOW
Google: Gemma 4 26B A4B
google/gemma-4-26b-a4b-it
262K$0.060$0.330LOW
Google: Gemma 4 31B
google/gemma-4-31b-it
262K$0.120$0.370LOW
Google: Gemini 2.5 Flash Lite Preview 09-2025
google/gemini-2.5-flash-lite-preview-09-2025
1.0M$0.100$0.400LOW
Google: Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite
1.0M$0.100$0.400LOW
Google: Gemma 2 27B
google/gemma-2-27b-it
8K$0.650$0.650MED
Google: Gemini 3.1 Flash Lite
google/gemini-3.1-flash-lite
1.0M$0.250$1.50MED
Google: Gemini 3.1 Flash Lite Preview
google/gemini-3.1-flash-lite-preview
1.0M$0.250$1.50MED
Google: Nano Banana (Gemini 2.5 Flash Image)
google/gemini-2.5-flash-image
33K$0.300$2.50MED
Google: Gemini 2.5 Flash
google/gemini-2.5-flash
1.0M$0.300$2.50MED
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)
google/gemini-3.1-flash-image-preview
131K$0.500$3.00MED
Google: Gemini 3 Flash Preview
google/gemini-3-flash-preview
1.0M$0.500$3.00MED
Google: Gemini 3.5 Flash
google/gemini-3.5-flash
1.0M$1.50$9.00HIGH
Google: Gemini 2.5 Pro
google/gemini-2.5-pro
1.0M$1.25$10HIGH
Google: Gemini 2.5 Pro Preview 06-05
google/gemini-2.5-pro-preview
1.0M$1.25$10HIGH
Google: Gemini 2.5 Pro Preview 05-06
google/gemini-2.5-pro-preview-05-06
1.0M$1.25$10HIGH
Google: Gemini 3.1 Pro Preview Custom Tools
google/gemini-3.1-pro-preview-customtools
1.0M$2.00$12HIGH
Google: Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview
1.0M$2.00$12HIGH
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)
google/gemini-3-pro-image-preview
66K$2.00$12HIGH
Provider

Grok

Models
4
Pricing table for Grok models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
xAI: Grok Build 0.1
x-ai/grok-build-0.1
256K$1.00$2.00MED
xAI: Grok 4.3
x-ai/grok-4.3
1.0M$1.25$2.50MED
xAI: Grok 4.20
x-ai/grok-4.20
2.0M$1.25$2.50MED
xAI: Grok 4.20 Multi-Agent
x-ai/grok-4.20-multi-agent
2.0M$2.00$6.00MED
Provider

Qwen

Models
46
Pricing table for Qwen models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Qwen: Qwen2.5 7B Instruct
qwen/qwen-2.5-7b-instruct
131K$0.040$0.100LOW
Qwen: Qwen3 235B A22B Instruct 2507
qwen/qwen3-235b-a22b-2507
262K$0.071$0.100LOW
Qwen: Qwen3.5-9B
qwen/qwen3.5-9b
262K$0.040$0.150LOW
Qwen: Qwen3 235B A22B Thinking 2507
qwen/qwen3-235b-a22b-thinking-2507
262K$0.100$0.100LOW
Qwen: Qwen3 30B A3B Instruct 2507
qwen/qwen3-30b-a3b-instruct-2507
131K$0.043$0.172LOW
Qwen: Qwen3.5-Flash
qwen/qwen3.5-flash-02-23
1.0M$0.065$0.260LOW
Qwen: Qwen3 14B
qwen/qwen3-14b
132K$0.100$0.240LOW
Qwen: Qwen3 Coder 30B A3B Instruct
qwen/qwen3-coder-30b-a3b-instruct
160K$0.070$0.270LOW
Qwen: Qwen3 32B
qwen/qwen3-32b
131K$0.080$0.280LOW
Qwen: Qwen3 8B
qwen/qwen3-8b
131K$0.050$0.400LOW
Qwen: Qwen3 30B A3B Thinking 2507
qwen/qwen3-30b-a3b-thinking-2507
131K$0.080$0.400LOW
Qwen: Qwen3 VL 32B Instruct
qwen/qwen3-vl-32b-instruct
262K$0.104$0.416LOW
Qwen: Qwen3 30B A3B
qwen/qwen3-30b-a3b
131K$0.090$0.450LOW
Qwen: Qwen3 VL 8B Instruct
qwen/qwen3-vl-8b-instruct
256K$0.080$0.500LOW
Qwen: Qwen3 VL 30B A3B Instruct
qwen/qwen3-vl-30b-a3b-instruct
262K$0.130$0.520LOW
Qwen2.5 72B Instruct
qwen/qwen-2.5-72b-instruct
131K$0.360$0.400LOW
Qwen: Qwen3 Next 80B A3B Thinking
qwen/qwen3-next-80b-a3b-thinking
262K$0.098$0.780LOW
Qwen: Qwen3 Coder Next
qwen/qwen3-coder-next
262K$0.110$0.800LOW
Qwen: Qwen2.5 VL 72B Instruct
qwen/qwen2.5-vl-72b-instruct
131K$0.250$0.750MED
Qwen: Qwen Plus 0728 (thinking)
qwen/qwen-plus-2025-07-28:thinking
1.0M$0.260$0.780MED
Qwen: Qwen Plus 0728
qwen/qwen-plus-2025-07-28
1.0M$0.260$0.780MED
Qwen: Qwen-Plus
qwen/qwen-plus
1.0M$0.260$0.780MED
Qwen: Qwen3 VL 235B A22B Instruct
qwen/qwen3-vl-235b-a22b-instruct
262K$0.200$0.880MED
Qwen: Qwen3.6 35B A3B
qwen/qwen3.6-35b-a3b
262K$0.140$1.00MED
Qwen: Qwen3.5-35B-A3B
qwen/qwen3.5-35b-a3b
262K$0.140$1.00MED
Qwen: Qwen3 Coder Flash
qwen/qwen3-coder-flash
1.0M$0.195$0.975MED
Qwen: Qwen3 Next 80B A3B Instruct
qwen/qwen3-next-80b-a3b-instruct
262K$0.090$1.10MED
Qwen: Qwen3.6 Flash
qwen/qwen3.6-flash
1.0M$0.188$1.13MED
Qwen: Qwen3 VL 8B Thinking
qwen/qwen3-vl-8b-thinking
256K$0.117$1.36MED
Qwen2.5 Coder 32B Instruct
qwen/qwen-2.5-coder-32b-instruct
128K$0.660$1.00MED
Qwen: Qwen3 VL 30B A3B Thinking
qwen/qwen3-vl-30b-a3b-thinking
131K$0.130$1.56MED
Qwen: Qwen3.5-27B
qwen/qwen3.5-27b
262K$0.195$1.56MED
Qwen: Qwen3.5 Plus 2026-02-15
qwen/qwen3.5-plus-02-15
1.0M$0.260$1.56MED
Qwen: Qwen3 Coder 480B A35B
qwen/qwen3-coder
1.0M$0.220$1.80MED
Qwen: Qwen3.5 Plus 2026-04-20
qwen/qwen3.5-plus-20260420
1.0M$0.300$1.80MED
Qwen: Qwen3.6 Plus
qwen/qwen3.6-plus
1.0M$0.325$1.95MED
Qwen: Qwen3 235B A22B
qwen/qwen3-235b-a22b
131K$0.455$1.82MED
Qwen: Qwen3.5-122B-A10B
qwen/qwen3.5-122b-a10b
262K$0.260$2.08MED
Qwen: Qwen3.5 397B A17B
qwen/qwen3.5-397b-a17b
262K$0.390$2.34MED
Qwen: Qwen3 VL 235B A22B Thinking
qwen/qwen3-vl-235b-a22b-thinking
131K$0.260$2.60MED
Qwen: Qwen3.6 27B
qwen/qwen3.6-27b
262K$0.290$3.20MED
Qwen: Qwen3 Coder Plus
qwen/qwen3-coder-plus
1.0M$0.650$3.25MED
Qwen: Qwen3 Max Thinking
qwen/qwen3-max-thinking
262K$0.780$3.90MED
Qwen: Qwen3 Max
qwen/qwen3-max
262K$0.780$3.90MED
Qwen: Qwen3.7 Max
qwen/qwen3.7-max
1.0M$1.25$3.75MED
Qwen: Qwen3.6 Max Preview
qwen/qwen3.6-max-preview
262K$1.04$6.24MED
Provider

DeepSeek

Models
12
Pricing table for DeepSeek models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
DeepSeek: DeepSeek V4 Flash
deepseek/deepseek-v4-flash
1.0M$0.098$0.197LOW
DeepSeek: DeepSeek V3.2
deepseek/deepseek-v3.2
131K$0.229$0.343LOW
DeepSeek: R1 Distill Qwen 32B
deepseek/deepseek-r1-distill-qwen-32b
128K$0.290$0.290LOW
DeepSeek: DeepSeek V3.2 Exp
deepseek/deepseek-v3.2-exp
164K$0.270$0.410LOW
DeepSeek: DeepSeek V3 0324
deepseek/deepseek-chat-v3-0324
164K$0.200$0.770LOW
DeepSeek: DeepSeek V3.1
deepseek/deepseek-chat-v3.1
164K$0.210$0.790LOW
DeepSeek: DeepSeek V3
deepseek/deepseek-chat
131K$0.200$0.800MED
DeepSeek: DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminus
164K$0.270$0.950MED
DeepSeek: DeepSeek V4 Pro
deepseek/deepseek-v4-pro
1.0M$0.435$0.870MED
DeepSeek: R1 Distill Llama 70B
deepseek/deepseek-r1-distill-llama-70b
131K$0.700$0.800MED
DeepSeek: R1 0528
deepseek/deepseek-r1-0528
164K$0.500$2.15MED
DeepSeek: R1
deepseek/deepseek-r1
164K$0.700$2.50MED
Provider

Mistral

Models
19
Pricing table for Mistral models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Mistral: Mistral Nemo
mistralai/mistral-nemo
131K$0.020$0.030LOW
Mistral: Mistral Small 3
mistralai/mistral-small-24b-instruct-2501
33K$0.050$0.080LOW
Mistral: Ministral 3 3B 2512
mistralai/ministral-3b-2512
131K$0.100$0.100LOW
Mistral: Mistral Small 3.2 24B
mistralai/mistral-small-3.2-24b-instruct
128K$0.075$0.200LOW
Mistral: Ministral 3 8B 2512
mistralai/ministral-8b-2512
262K$0.150$0.150LOW
Mistral: Ministral 3 14B 2512
mistralai/ministral-14b-2512
262K$0.200$0.200LOW
Mistral: Voxtral Small 24B 2507
mistralai/voxtral-small-24b-2507
32K$0.100$0.300LOW
Mistral: Mistral Small 4
mistralai/mistral-small-2603
262K$0.150$0.600LOW
Mistral: Saba
mistralai/mistral-saba
33K$0.200$0.600LOW
Mistral: Mistral Small 3.1 24B
mistralai/mistral-small-3.1-24b-instruct
128K$0.351$0.555LOW
Mistral: Codestral 2508
mistralai/codestral-2508
256K$0.300$0.900MED
Mistral: Mistral Large 3 2512
mistralai/mistral-large-2512
262K$0.500$1.50MED
Mistral: Devstral 2 2512
mistralai/devstral-2512
262K$0.400$2.00MED
Mistral: Mistral Medium 3.1
mistralai/mistral-medium-3.1
131K$0.400$2.00MED
Mistral: Mistral Medium 3
mistralai/mistral-medium-3
131K$0.400$2.00MED
Mistral Large 2407
mistralai/mistral-large-2407
131K$2.00$6.00MED
Mistral: Mixtral 8x22B Instruct
mistralai/mixtral-8x22b-instruct
66K$2.00$6.00MED
Mistral Large
mistralai/mistral-large
128K$2.00$6.00MED
Mistral: Mistral Medium 3.5
mistralai/mistral-medium-3-5
262K$1.50$7.50MED
Provider

Cohere

Models
4
Pricing table for Cohere models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Cohere: Command R7B (12-2024)
cohere/command-r7b-12-2024
128K$0.037$0.150LOW
Cohere: Command R (08-2024)
cohere/command-r-08-2024
128K$0.150$0.600LOW
Cohere: Command A
cohere/command-a
256K$2.50$10HIGH
Cohere: Command R+ (08-2024)
cohere/command-r-plus-08-2024
128K$2.50$10HIGH
Provider

MoonshotAI

Models
5
Pricing table for MoonshotAI models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
MoonshotAI: Kimi K2.5
moonshotai/kimi-k2.5
262K$0.400$1.90MED
MoonshotAI: Kimi K2 0711
moonshotai/kimi-k2
131K$0.570$2.30MED
MoonshotAI: Kimi K2 Thinking
moonshotai/kimi-k2-thinking
262K$0.600$2.50MED
MoonshotAI: Kimi K2 0905
moonshotai/kimi-k2-0905
262K$0.600$2.50MED
MoonshotAI: Kimi K2.6
moonshotai/kimi-k2.6
262K$0.684$3.42MED
Provider

ByteDance

Models
1
Pricing table for ByteDance models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
ByteDance: UI-TARS 7B
bytedance/ui-tars-1.5-7b
128K$0.100$0.200LOW
Provider

DeepCogito

Models
1
Pricing table for DeepCogito models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Deep Cogito: Cogito v2.1 671B
deepcogito/cogito-v2.1-671b
128K$1.25$1.25MED
Provider

Baidu

Models
2
Pricing table for Baidu models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Baidu: ERNIE 4.5 VL 28B A3B
baidu/ernie-4.5-vl-28b-a3b
131K$0.140$0.560LOW
Baidu: ERNIE 4.5 VL 424B A47B
baidu/ernie-4.5-vl-424b-a47b
131K$0.420$1.25MED
Provider

Z-AI

Models
12
Pricing table for Z-AI models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Z.ai: GLM 4 32B
z-ai/glm-4-32b
128K$0.100$0.100LOW
Z.ai: GLM 4.7 Flash
z-ai/glm-4.7-flash
203K$0.060$0.400LOW
Z.ai: GLM 4.5 Air
z-ai/glm-4.5-air
131K$0.125$0.850LOW
Z.ai: GLM 4.6V
z-ai/glm-4.6v
131K$0.300$0.900MED
Z.ai: GLM 4.7
z-ai/glm-4.7
203K$0.400$1.75MED
Z.ai: GLM 4.6
z-ai/glm-4.6
203K$0.430$1.74MED
Z.ai: GLM 4.5V
z-ai/glm-4.5v
66K$0.600$1.80MED
Z.ai: GLM 5
z-ai/glm-5
203K$0.600$1.92MED
Z.ai: GLM 4.5
z-ai/glm-4.5
131K$0.600$2.20MED
Z.ai: GLM 5.1
z-ai/glm-5.1
203K$0.980$3.08MED
Z.ai: GLM 5V Turbo
z-ai/glm-5v-turbo
203K$1.20$4.00MED
Z.ai: GLM 5 Turbo
z-ai/glm-5-turbo
203K$1.20$4.00MED
Provider

Tencent

Models
2
Pricing table for Tencent models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Tencent: Hy3 preview
tencent/hy3-preview
262K$0.063$0.210LOW
Tencent: Hunyuan A13B Instruct
tencent/hunyuan-a13b-instruct
131K$0.140$0.570LOW
Provider

MiniMax

Models
8
Pricing table for MiniMax models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
MiniMax: MiniMax M2.1
minimax/minimax-m2.1
205K$0.290$0.950MED
MiniMax: MiniMax M2
minimax/minimax-m2
205K$0.255$1.00MED
MiniMax: MiniMax M2.5
minimax/minimax-m2.5
205K$0.150$1.15MED
MiniMax: MiniMax-01
minimax/minimax-01
1.0M$0.200$1.10MED
MiniMax: MiniMax M2.7
minimax/minimax-m2.7
205K$0.279$1.20MED
MiniMax: MiniMax M3
minimax/minimax-m3
1.0M$0.300$1.20MED
MiniMax: MiniMax M2-her
minimax/minimax-m2-her
66K$0.300$1.20MED
MiniMax: MiniMax M1
minimax/minimax-m1
1.0M$0.400$2.20MED
Provider

Meta-Llama

Models
12
Pricing table for Meta-Llama models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Meta: Llama 3.1 8B Instruct
meta-llama/llama-3.1-8b-instruct
131K$0.020$0.050LOW
Meta: Llama 3 8B Instruct
meta-llama/llama-3-8b-instruct
8K$0.040$0.040LOW
Meta: Llama 3.2 1B Instruct
meta-llama/llama-3.2-1b-instruct
131K$0.027$0.201LOW
Meta: Llama Guard 4 12B
meta-llama/llama-guard-4-12b
164K$0.180$0.180LOW
Meta: Llama 4 Scout
meta-llama/llama-4-scout
10.0M$0.080$0.300LOW
Meta: Llama 3.2 3B Instruct
meta-llama/llama-3.2-3b-instruct
131K$0.051$0.335LOW
Meta: Llama 3.3 70B Instruct
meta-llama/llama-3.3-70b-instruct
131K$0.100$0.320LOW
Meta: Llama 3.2 11B Vision Instruct
meta-llama/llama-3.2-11b-vision-instruct
131K$0.245$0.245LOW
Llama Guard 3 8B
meta-llama/llama-guard-3-8b
131K$0.484$0.030LOW
Meta: Llama 4 Maverick
meta-llama/llama-4-maverick
1.0M$0.150$0.600LOW
Meta: Llama 3.1 70B Instruct
meta-llama/llama-3.1-70b-instruct
131K$0.400$0.400LOW
Meta: Llama 3 70B Instruct
meta-llama/llama-3-70b-instruct
8K$0.510$0.740MED
Provider

Microsoft

Models
3
Pricing table for Microsoft models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Microsoft: Phi 4
microsoft/phi-4
16K$0.065$0.140LOW
Microsoft: Phi 4 Mini Instruct
microsoft/phi-4-mini-instruct
131K$0.080$0.350LOW
WizardLM-2 8x22B
microsoft/wizardlm-2-8x22b
66K$0.620$0.620MED
Provider

NVIDIA

Models
4
Pricing table for NVIDIA models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
NVIDIA: Nemotron Nano 9B V2
nvidia/nemotron-nano-9b-v2
131K$0.040$0.160LOW
NVIDIA: Nemotron 3 Nano 30B A3B
nvidia/nemotron-3-nano-30b-a3b
262K$0.050$0.200LOW
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5
131K$0.100$0.400LOW
NVIDIA: Nemotron 3 Super
nvidia/nemotron-3-super-120b-a12b
1.0M$0.090$0.450LOW
Provider

Perplexity

Models
5
Pricing table for Perplexity models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Perplexity: Sonar
perplexity/sonar
127K$1.00$1.00MED
Perplexity: Sonar Reasoning Pro
perplexity/sonar-reasoning-pro
128K$2.00$8.00HIGH
Perplexity: Sonar Deep Research
perplexity/sonar-deep-research
128K$2.00$8.00HIGH
Perplexity: Sonar Pro Search
perplexity/sonar-pro-search
200K$3.00$15HIGH
Perplexity: Sonar Pro
perplexity/sonar-pro
200K$3.00$15HIGH
Provider

Amazon

Models
5
Pricing table for Amazon models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Amazon: Nova Micro 1.0
amazon/nova-micro-v1
128K$0.035$0.140LOW
Amazon: Nova Lite 1.0
amazon/nova-lite-v1
300K$0.060$0.240LOW
Amazon: Nova 2 Lite
amazon/nova-2-lite-v1
1.0M$0.300$2.50MED
Amazon: Nova Pro 1.0
amazon/nova-pro-v1
300K$0.800$3.20MED
Amazon: Nova Premier 1.0
amazon/nova-premier-v1
1.0M$2.50$13HIGH
Provider

stepfun

Models
2
Pricing table for stepfun models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
StepFun: Step 3.5 Flash
stepfun/step-3.5-flash
262K$0.090$0.300LOW
StepFun: Step 3.7 Flash
stepfun/step-3.7-flash
256K$0.200$1.15MED
Provider

perceptron

Models
1
Pricing table for perceptron models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Perceptron: Perceptron Mk1
perceptron/perceptron-mk1
33K$0.150$1.50MED
Provider

inclusionai

Models
3
Pricing table for inclusionai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
inclusionAI: Ling-2.6-flash
inclusionai/ling-2.6-flash
262K$0.010$0.030LOW
inclusionAI: Ring-2.6-1T
inclusionai/ring-2.6-1t
262K$0.075$0.625LOW
inclusionAI: Ling-2.6-1T
inclusionai/ling-2.6-1t
262K$0.075$0.625LOW
Provider

ibm-granite

Models
2
Pricing table for ibm-granite models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
IBM: Granite 4.0 Micro
ibm-granite/granite-4.0-h-micro
131K$0.017$0.112LOW
IBM: Granite 4.1 8B
ibm-granite/granite-4.1-8b
131K$0.050$0.100LOW
Provider

~anthropic

Models
3
Pricing table for ~anthropic models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Anthropic Claude Haiku Latest
~anthropic/claude-haiku-latest
200K$1.00$5.00MED
Anthropic Claude Sonnet Latest
~anthropic/claude-sonnet-latest
1.0M$3.00$15HIGH
Anthropic: Claude Opus Latest
~anthropic/claude-opus-latest
1.0M$5.00$25HIGH
Provider

~openai

Models
2
Pricing table for ~openai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
OpenAI GPT Mini Latest
~openai/gpt-mini-latest
400K$0.750$4.50MED
OpenAI GPT Latest
~openai/gpt-latest
1.1M$5.00$30HIGH
Provider

~google

Models
2
Pricing table for ~google models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Google Gemini Flash Latest
~google/gemini-flash-latest
1.0M$1.50$9.00HIGH
Google Gemini Pro Latest
~google/gemini-pro-latest
1.0M$2.00$12HIGH
Provider

~moonshotai

Models
1
Pricing table for ~moonshotai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
MoonshotAI Kimi Latest
~moonshotai/kimi-latest
262K$0.684$3.42MED
Provider

xiaomi

Models
3
Pricing table for xiaomi models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Xiaomi: MiMo-V2-Flash
xiaomi/mimo-v2-flash
262K$0.100$0.300LOW
Xiaomi: MiMo-V2.5
xiaomi/mimo-v2.5
1.0M$0.140$0.280LOW
Xiaomi: MiMo-V2.5-Pro
xiaomi/mimo-v2.5-pro
1.0M$0.435$0.870MED
Provider

arcee-ai

Models
6
Pricing table for arcee-ai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Arcee AI: Trinity Mini
arcee-ai/trinity-mini
131K$0.045$0.150LOW
Arcee AI: Spotlight
arcee-ai/spotlight
131K$0.180$0.180LOW
Arcee AI: Trinity Large Thinking
arcee-ai/trinity-large-thinking
262K$0.220$0.850MED
Arcee AI: Coder Large
arcee-ai/coder-large
33K$0.500$0.800MED
Arcee AI: Virtuoso Large
arcee-ai/virtuoso-large
131K$0.750$1.20MED
Arcee AI: Maestro Reasoning
arcee-ai/maestro-reasoning
131K$0.900$3.30MED
Provider

kwaipilot

Models
1
Pricing table for kwaipilot models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Kwaipilot: KAT-Coder-Pro V2
kwaipilot/kat-coder-pro-v2
256K$0.300$1.20MED
Provider

rekaai

Models
2
Pricing table for rekaai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Reka Edge
rekaai/reka-edge
16K$0.100$0.100LOW
Reka Flash 3
rekaai/reka-flash-3
66K$0.100$0.200LOW
Provider

bytedance-seed

Models
4
Pricing table for bytedance-seed models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
ByteDance Seed: Seed 1.6 Flash
bytedance-seed/seed-1.6-flash
262K$0.075$0.300LOW
ByteDance Seed: Seed-2.0-Mini
bytedance-seed/seed-2.0-mini
262K$0.100$0.400LOW
ByteDance Seed: Seed-2.0-Lite
bytedance-seed/seed-2.0-lite
262K$0.250$2.00MED
ByteDance Seed: Seed 1.6
bytedance-seed/seed-1.6
262K$0.250$2.00MED
Provider

inception

Models
1
Pricing table for inception models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Inception: Mercury 2
inception/mercury-2
128K$0.250$0.750MED
Provider

liquid

Models
1
Pricing table for liquid models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
LiquidAI: LFM2-24B-A2B
liquid/lfm-2-24b-a2b
128K$0.030$0.120LOW
Provider

aion-labs

Models
4
Pricing table for aion-labs models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
AionLabs: Aion-1.0-Mini
aion-labs/aion-1.0-mini
131K$0.700$1.40MED
AionLabs: Aion-2.0
aion-labs/aion-2.0
131K$0.800$1.60MED
AionLabs: Aion-RP 1.0 (8B)
aion-labs/aion-rp-llama-3.1-8b
33K$0.800$1.60MED
AionLabs: Aion-1.0
aion-labs/aion-1.0
131K$4.00$8.00HIGH
Provider

upstage

Models
1
Pricing table for upstage models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Upstage: Solar Pro 3
upstage/solar-pro-3
128K$0.150$0.600LOW
Provider

writer

Models
1
Pricing table for writer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Writer: Palmyra X5
writer/palmyra-x5
1.0M$0.600$6.00MED
Provider

relace

Models
2
Pricing table for relace models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Relace: Relace Apply 3
relace/relace-apply-3
256K$0.850$1.25MED
Relace: Relace Search
relace/relace-search
256K$1.00$3.00MED
Provider

nex-agi

Models
1
Pricing table for nex-agi models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Nex AGI: DeepSeek V3.1 Nex N1
nex-agi/deepseek-v3.1-nex-n1
131K$0.135$0.500LOW
Provider

essentialai

Models
1
Pricing table for essentialai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
EssentialAI: Rnj 1 Instruct
essentialai/rnj-1-instruct
33K$0.150$0.150LOW
Provider

prime-intellect

Models
1
Pricing table for prime-intellect models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Prime Intellect: INTELLECT-3
prime-intellect/intellect-3
131K$0.200$1.10MED
Provider

allenai

Models
1
Pricing table for allenai models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
AllenAI: Olmo 3 32B Think
allenai/olmo-3-32b-think
66K$0.150$0.500LOW
Provider

thedrummer

Models
4
Pricing table for thedrummer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
TheDrummer: Rocinante 12B
thedrummer/rocinante-12b
33K$0.170$0.430LOW
TheDrummer: UnslopNemo 12B
thedrummer/unslopnemo-12b
33K$0.400$0.400LOW
TheDrummer: Cydonia 24B V4.1
thedrummer/cydonia-24b-v4.1
131K$0.300$0.500LOW
TheDrummer: Skyfall 36B V2
thedrummer/skyfall-36b-v2
33K$0.550$0.800MED
Provider

nousresearch

Models
5
Pricing table for nousresearch models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
NousResearch: Hermes 2 Pro - Llama-3 8B
nousresearch/hermes-2-pro-llama-3-8b
8K$0.140$0.140LOW
Nous: Hermes 4 70B
nousresearch/hermes-4-70b
131K$0.130$0.400LOW
Nous: Hermes 3 70B Instruct
nousresearch/hermes-3-llama-3.1-70b
131K$0.300$0.300LOW
Nous: Hermes 3 405B Instruct
nousresearch/hermes-3-llama-3.1-405b
131K$1.00$1.00MED
Nous: Hermes 4 405B
nousresearch/hermes-4-405b
131K$1.00$3.00MED
Provider

ai21

Models
1
Pricing table for ai21 models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
AI21: Jamba Large 1.7
ai21/jamba-large-1.7
256K$2.00$8.00HIGH
Provider

switchpoint

Models
1
Pricing table for switchpoint models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Switchpoint Router
switchpoint/router
131K$0.850$3.40MED
Provider

morph

Models
2
Pricing table for morph models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Morph: Morph V3 Fast
morph/morph-v3-fast
82K$0.800$1.20MED
Morph: Morph V3 Large
morph/morph-v3-large
262K$0.900$1.90MED
Provider

sao10k

Models
5
Pricing table for sao10k models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Sao10K: Llama 3 8B Lunaris
sao10k/l3-lunaris-8b
8K$0.040$0.050LOW
Sao10K: Llama 3.3 Euryale 70B
sao10k/l3.3-euryale-70b
131K$0.650$0.750MED
Sao10K: Llama 3.1 Euryale 70B v2.2
sao10k/l3.1-euryale-70b
131K$0.850$0.850MED
Sao10k: Llama 3 Euryale 70B v2.1
sao10k/l3-euryale-70b
8K$1.48$1.48MED
Sao10K: Llama 3.1 70B Hanami x1
sao10k/l3.1-70b-hanami-x1
16K$3.00$3.00MED
Provider

anthracite-org

Models
1
Pricing table for anthracite-org models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Magnum v4 72B
anthracite-org/magnum-v4-72b
33K$3.00$5.00MED
Provider

inflection

Models
2
Pricing table for inflection models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Inflection: Inflection 3 Pi
inflection/inflection-3-pi
8K$2.50$10HIGH
Inflection: Inflection 3 Productivity
inflection/inflection-3-productivity
8K$2.50$10HIGH
Provider

mancer

Models
1
Pricing table for mancer models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
Mancer: Weaver (alpha)
mancer/weaver
8K$0.750$1.00MED
Provider

undi95

Models
1
Pricing table for undi95 models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
ReMM SLERP 13B
undi95/remm-slerp-l2-13b
6K$0.450$0.650MED
Provider

gryphe

Models
1
Pricing table for gryphe models showing context length, input cost per million tokens, output cost per million tokens, and pricing tier
ModelContextInput $/MOutput $/MTier
MythoMax 13B
gryphe/mythomax-l2-13b
4K$0.060$0.060LOW