Free LLM Models Token API
These models offer completely free usage with no token costs for both input and output via OpenRouter API. Perfect for experimentation, development, and testing.
Available at no cost
Offering free models
Auto-refreshes every 12 hours
All models listed below have zero cost for both input and output tokens
| Provider | Model | Context | Pricing |
|---|---|---|---|
Google: Gemini 2.0 Flash Experimental (free) google/gemini-2.0-flash-exp:free | 1.0M | FREE | |
Qwen: Qwen3 Coder 480B A35B (free) qwen/qwen3-coder:free | 262K | FREE | |
MiniMax | MiniMax: MiniMax M2 (free) minimax/minimax-m2:free | 205K | FREE |
DeepSeek: R1 0528 (free) deepseek/deepseek-r1-0528:free | 164K | FREE | |
DeepSeek: DeepSeek V3 0324 (free) deepseek/deepseek-chat-v3-0324:free | 164K | FREE | |
DeepSeek: R1 (free) deepseek/deepseek-r1:free | 164K | FREE | |
tngtech | TNG: DeepSeek R1T2 Chimera (free) tngtech/deepseek-r1t2-chimera:free | 164K | FREE |
tngtech | TNG: DeepSeek R1T Chimera (free) tngtech/deepseek-r1t-chimera:free | 164K | FREE |
Microsoft: MAI DS R1 (free) microsoft/mai-ds-r1:free | 164K | FREE | |
DeepSeek: DeepSeek V3.1 (free) deepseek/deepseek-chat-v3.1:free | 164K | FREE | |
OpenAI: gpt-oss-20b (free) openai/gpt-oss-20b:free | 131K | FREE | |
Google: Gemma 3 27B (free) google/gemma-3-27b-it:free | 131K | FREE | |
Z.AI: GLM 4.5 Air (free) z-ai/glm-4.5-air:free | 131K | FREE | |
DeepSeek: DeepSeek R1 0528 Qwen3 8B (free) deepseek/deepseek-r1-0528-qwen3-8b:free | 131K | FREE | |
Tongyi DeepResearch 30B A3B (free) alibaba/tongyi-deepresearch-30b-a3b:free | 131K | FREE | |
meituan | Meituan: LongCat Flash Chat (free) meituan/longcat-flash-chat:free | 131K | FREE |
MoonshotAI: Kimi Dev 72B (free) moonshotai/kimi-dev-72b:free | 131K | FREE | |
nousresearch | Nous: DeepHermes 3 Llama 3 8B Preview (free) nousresearch/deephermes-3-llama-3-8b-preview:free | 131K | FREE |
nousresearch | Nous: Hermes 3 405B Instruct (free) nousresearch/hermes-3-llama-3.1-405b:free | 131K | FREE |
Mistral: Mistral Small 3.2 24B (free) mistralai/mistral-small-3.2-24b-instruct:free | 131K | FREE | |
Mistral: Mistral Nemo (free) mistralai/mistral-nemo:free | 131K | FREE | |
Meta: Llama 3.3 70B Instruct (free) meta-llama/llama-3.3-70b-instruct:free | 131K | FREE | |
Meta: Llama 3.2 3B Instruct (free) meta-llama/llama-3.2-3b-instruct:free | 131K | FREE | |
Andromeda Alpha openrouter/andromeda-alpha | 128K | FREE | |
NVIDIA: Nemotron Nano 9B V2 (free) nvidia/nemotron-nano-9b-v2:free | 128K | FREE | |
Meta: Llama 3.3 8B Instruct (free) meta-llama/llama-3.3-8b-instruct:free | 128K | FREE | |
Meta: Llama 4 Maverick (free) meta-llama/llama-4-maverick:free | 128K | FREE | |
Meta: Llama 4 Scout (free) meta-llama/llama-4-scout:free | 128K | FREE | |
Mistral: Mistral Small 3.1 24B (free) mistralai/mistral-small-3.1-24b-instruct:free | 96K | FREE | |
agentica-org | Agentica: Deepcoder 14B Preview (free) agentica-org/deepcoder-14b-preview:free | 96K | FREE |
Qwen: Qwen3 4B (free) qwen/qwen3-4b:free | 41K | FREE | |
Qwen: Qwen3 30B A3B (free) qwen/qwen3-30b-a3b:free | 41K | FREE | |
Qwen: Qwen3 8B (free) qwen/qwen3-8b:free | 41K | FREE | |
Qwen: Qwen3 14B (free) qwen/qwen3-14b:free | 41K | FREE | |
Qwen: Qwen3 235B A22B (free) qwen/qwen3-235b-a22b:free | 41K | FREE | |
Qwen2.5 Coder 32B Instruct (free) qwen/qwen-2.5-coder-32b-instruct:free | 33K | FREE | |
Qwen2.5 72B Instruct (free) qwen/qwen-2.5-72b-instruct:free | 33K | FREE | |
Google: Gemma 3 4B (free) google/gemma-3-4b-it:free | 33K | FREE | |
Google: Gemma 3 12B (free) google/gemma-3-12b-it:free | 33K | FREE | |
MoonshotAI: Kimi K2 0711 (free) moonshotai/kimi-k2:free | 33K | FREE | |
Mistral: Devstral Small 2505 (free) mistralai/devstral-small-2505:free | 33K | FREE | |
Mistral: Mistral Small 3 (free) mistralai/mistral-small-24b-instruct-2501:free | 33K | FREE | |
Mistral: Mistral 7B Instruct (free) mistralai/mistral-7b-instruct:free | 33K | FREE | |
cognitivecomputations | Venice: Uncensored (free) cognitivecomputations/dolphin-mistral-24b-venice-edition:free | 33K | FREE |
cognitivecomputations | Dolphin3.0 Mistral 24B (free) cognitivecomputations/dolphin3.0-mistral-24b:free | 33K | FREE |
Tencent | Tencent: Hunyuan A13B Instruct (free) tencent/hunyuan-a13b-instruct:free | 33K | FREE |
shisa-ai | Shisa AI: Shisa V2 Llama 3.3 70B (free) shisa-ai/shisa-v2-llama3.3-70b:free | 33K | FREE |
arliai | ArliAI: QwQ 32B RpR v1 (free) arliai/qwq-32b-arliai-rpr-v1:free | 33K | FREE |
Qwen: Qwen2.5 VL 32B Instruct (free) qwen/qwen2.5-vl-32b-instruct:free | 16K | FREE | |
Google: Gemma 3n 2B (free) google/gemma-3n-e2b-it:free | 8K | FREE | |
Google: Gemma 3n 4B (free) google/gemma-3n-e4b-it:free | 8K | FREE | |
Google: Gemma 2 9B (free) google/gemma-2-9b-it:free | 8K | FREE | |
DeepSeek: R1 Distill Llama 70B (free) deepseek/deepseek-r1-distill-llama-70b:free | 8K | FREE |
• Free Tier Limitations: While these models have zero token costs, they may have rate limits, usage quotas, or require API keys from the respective providers.
• Availability: Free pricing may change at any time. Always verify current pricing with the provider before production use.
• Performance: Free models may have different performance characteristics compared to paid alternatives.
• Updates: This list is automatically updated every 12 hours from the OpenRouter API.