Review list pricing by model group, compare input and output rates, and align teams on how model choice affects cost, context, and infrastructure posture.
Granular usage feedback and budgeting controls empower users to manage their spend.
Compare price with context window, infrastructure type, and model family in one place. Auto supplier error capture and redirect for higher uptime and less agentic flow disruption.
Role based access, GDPR & ISO compliance, multi-provider for resilience.
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Kimi K2.6
kimi-k2.6 |
chat | 262K | cloud | $0.9 input / $4.00 output / MTok | MTok | π¨π³ SiliconFlow, πΊπΈ DeepInfra, πΊπΈ OpenRouter | Live |
|
GLM 5.1
glm-5.1 |
chat | 200K | cloud | $1.05 input / $3.5 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ SiliconFlow | Live |
|
Minimax M2.7
minimax-m2.7 |
chat | 204K | cloud | $0.25 input / $1.00 output / MTok | MTok | πΈπ¬ MiniMax Direct, πΊπΈ OpenRouter | Live |
|
Qwen3 235b
qwen3-235b |
chat | 256K | cloud | $0.09 input / $0.1 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ SiliconFlow | Live |
|
Hy3 Preview
hy3-preview |
chat | 262K | cloud | $0.066 input / $0.26 output / MTok | MTok | π¨π³ SiliconFlow, πΊπΈ OpenRouter | Live |
|
Deepseek R1 70b
deepseek-r1-70b |
chat | 128K | cloud | $0.56 input / $0.64 output / MTok | MTok | π¨π³ SiliconFlow, πΊπΈ DeepInfra | Live |
|
Deepseek R1 Distill Qwen 7b
deepseek-r1-distill-qwen-7b |
chat | - | cloud | $0.04 input / $0.1 output / MTok | MTok | π¨π³ SiliconFlow, πΊπΈ DeepInfra | Live |
|
Mistral Small 4
mistral-small-4 |
chat | 131K | cloud | $0.1 input / $0.3 output / MTok | MTok | π«π· Mistral AI | Live |
|
GLM 4.7 Flash
glm-4.7-flash |
chat | - | cloud | $0.06 input / $0.4 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ SiliconFlow | Live |
|
Qwen2.5 Coder 32b Instruct
qwen2.5-coder-32b-instruct |
chat | - | cloud | $0.06 input / $0.15 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ SiliconFlow | Live |
|
Qwen3 30b A3b
qwen3-30b-a3b |
chat | - | cloud | $0.12 input / $0.5 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ Alibaba DashScope | Live |
|
Qwen3 VL 30b A3b Instruct
qwen3-vl-30b-a3b-instruct |
chat | - | cloud | $0.15 input / $0.6 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ SiliconFlow | Live |
|
Qwen3.5 397b A17b
qwen3.5-397b-a17b |
chat | - | cloud | $0.39 input / $2.34 output / MTok | MTok | π¨π³ Alibaba DashScope, π¨π³ SiliconFlow | Live |
|
Codestral
codestral |
chat | 256K | cloud | $0.3 input / $0.9 output / MTok | MTok | π«π· Mistral AI, πΊπΈ OpenRouter | Live |
|
Gemini 2.5 Pro
gemini-2.5-pro |
chat | 1M | cloud | $1.25 input / $10.00 output / MTok | MTok | πΊπΈ Google AI (Gemini), πΊπΈ OpenRouter | Live |
|
Qwen 3 32b
qwen-3-32b |
chat | 40K | cloud | $0.08 input / $0.28 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ Alibaba DashScope | Live |
|
Qwq 32b
qwq-32b |
chat | 40K | cloud | $0.15 input / $0.4 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ SiliconFlow | Live |
|
Qwen 2.5 7b
qwen-2.5-7b |
chat | 33K | cloud | $0.04 input / $0.1 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ Alibaba DashScope | Live |
|
Gemma 3 27b
gemma-3-27b |
chat | 128K | cloud | $0.08 input / $0.16 output / MTok | MTok | πΊπΈ DeepInfra, πΊπΈ OpenRouter | Live |
|
Gemma 4 31b
gemma-4-31b |
chat | 256K | cloud | $0.13 input / $0.38 output / MTok | MTok | πΊπΈ DeepInfra, πΊπΈ OpenRouter | Live |
|
Qwen3.5 Plus
qwen3.5-plus |
chat | 1M | cloud | $0.26 input / $1.56 output / MTok | MTok | π¨π³ Alibaba DashScope, πΊπΈ OpenRouter | Live |
|
GLM 4.5 Air
glm-4.5-air |
chat | 128K | cloud | $0.05 input / $0.1 output / MTok | MTok | π¨π³ SiliconFlow, π¨π³ Z.AI (Zhipu) | Live |
|
Llama 3.1 8b
llama-3.1-8b |
chat | 128K | cloud | $0.016 input / $0.04 output / MTok | MTok | πΊπΈ Groq, πΊπΈ DeepInfra | Live |
|
Llama 3.3 70b
llama-3.3-70b |
chat | 128K | cloud | $0.1 input / $0.32 output / MTok | MTok | πΊπΈ DeepInfra, πΊπΈ Groq | Live |
|
Llama 4 Scout
llama-4-scout |
chat | 320K | cloud | $0.1 input / $0.3 output / MTok | MTok | πΊπΈ DeepInfra, πΊπΈ Groq | Live |
|
Magistral Medium
magistral-medium |
chat | 40K | cloud | $2.00 input / $5.00 output / MTok | MTok | π«π· Mistral AI | Live |
|
Mistral Medium
mistral-medium |
chat | 131K | cloud | $1.5 input / $7.5 output / MTok | MTok | π«π· Mistral AI, πΊπΈ OpenRouter | Live |
|
Qwen 72b
qwen-72b |
chat | 33K | cloud | $0.36 input / $0.4 output / MTok | MTok | πΊπΈ DeepInfra, π¨π³ SiliconFlow, πΊπΈ OpenRouter | Live |
|
Deepseek Chat
deepseek-chat |
chat | 64K | cloud | $0.14 input / $0.28 output / MTok | MTok | π¨π³ DeepSeek, πΊπΈ DeepInfra, πΊπΈ OpenRouter | Live |
|
Claude Fable 5
claude-fable-5 |
chat | 1M | cloud | $10.00 input / $50.00 output / MTok | MTok | πΊπΈ Anthropic Corporate, πΊπΈ OpenRouter | Live |
|
GPT 5.5 Pro
gpt-5.5-pro |
chat | 400K | cloud | $30.00 input / $180.00 output / MTok | MTok | πΊπΈ OpenAI Corporate, πΊπΈ OpenRouter | Live |
|
Claude Opus 4.8
claude-opus-4.8 |
chat | 200K | cloud | $5.00 input / $25.00 output / MTok | MTok | πΊπΈ Anthropic Corporate, πΊπΈ OpenRouter | Live |
|
GPT 5.5
gpt-5.5 |
chat | 400K | cloud | $5.00 input / $30.00 output / MTok | MTok | πΊπΈ OpenAI, πΊπΈ OpenRouter, AWS Bedrock OpenAI, πΊπΈ OpenAI Corporate | Live |
|
Gemini 3.1 Pro Preview
gemini-3.1-pro-preview |
chat | 2M | cloud | $2.00 input / $12.00 output / MTok | MTok | πΊπΈ Google AI (Gemini), πΊπΈ OpenRouter | Live |
|
GPT 5.4
gpt-5.4 |
chat | 1.1M | cloud | $1.75 input / $14.00 output / MTok | MTok | πΊπΈ OpenAI, πΊπΈ OpenAI Corporate, πΊπΈ OpenRouter | Live |
|
Claude Sonnet 4.6
claude-sonnet-4.6 |
chat | 1M | cloud | $3.00 input / $15.00 output / MTok | MTok | πΊπΈ Anthropic Corporate, πΊπΈ OpenRouter | Live |
|
Gemini 3.5 Flash
gemini-3.5-flash |
chat | 1M | cloud | $1.5 input / $9.00 output / MTok | MTok | πΊπΈ Google AI (Gemini), πΊπΈ OpenRouter | Live |
|
Gemini 3.1 Flash Lite
gemini-3.1-flash-lite |
chat | 1M | cloud | $0.25 input / $1.5 output / MTok | MTok | πΊπΈ Google AI (Gemini), πΊπΈ OpenRouter | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Qwen2.5 VL 72b
qwen2.5-vl-72b |
vision | 33K | cloud | $0.2 input / $0.6 output / MTok | MTok | πΊπΈ DeepInfra, πΊπΈ OpenRouter | Live |
|
Mistral Large
mistral-large |
vision | 131K | cloud | $0.5 input / $1.5 output / MTok | MTok | π«π· Mistral AI | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Mistral Ocr
mistral-ocr |
ocr | - | cloud | $2.00 / 1K Pages | 1K Pages | π«π· Mistral AI | Live |
|
Olmocr 2 7b Fp8
olmocr-2-7b-fp8 |
ocr | 33K | cloud | $8.00 / 1K Pages | 1K Pages | OTE Greece, Athens OCR olmOCR, A20 OCR olmOCR | Live |
|
Paddleocr VL 1 6
paddleocr-vl-1-6 |
ocr | 33K | cloud | $6.00 / 1K Pages | 1K Pages | OTE Greece, Athens OCR PaddleOCR-VL, A20 OCR PaddleOCR-VL | Live |
|
Paddleocr V5
paddleocr-v5 |
ocr | - | cloud | $1.00 / 1K Pages | 1K Pages | Athens OCR CPU Utilities, OTE Greece | Live |
|
Docling Granite 258m
docling-granite-258m |
ocr | - | cloud | $3.00 / 1K Pages | 1K Pages | Athens OCR CPU Utilities, OTE Greece | Live |
|
Paddleocr Structure V3
paddleocr-structure-v3 |
ocr | - | cloud | $4.00 / 1K Pages | 1K Pages | Athens OCR CPU Utilities, A20 OCR Utilities | Live |
|
Smoldocling 256m Preview
smoldocling-256m-preview |
ocr | - | cloud | $2.00 / 1K Pages | 1K Pages | Athens OCR CPU Utilities, A20 OCR Utilities | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Docx Native Parser
docx-native-parser |
document_conversion | - | cloud | β / Request | Request | TaaS Gateway | Live |
|
Markitdown Text Preview
markitdown-text-preview |
document_conversion | - | cloud | β / Request | Request | TaaS Gateway | Live |
|
Markitdown Full Preview
markitdown-full-preview |
document_conversion | - | cloud | β / Request | Request | TaaS Gateway | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
GPT Image 2
gpt-image-2 |
image | - | cloud | from $0.04 / Image | Image | πΊπΈ OpenAI, πΊπΈ OpenAI Corporate, AWS Bedrock OpenAI, πΊπΈ OpenRouter | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
BGE M3
bge-m3 |
embedding | - | sovereign | $0.05 input / MTok | MTok | OTE Greece, CloudSigma, A20 Embeddings BGE-M3 | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
BGE Reranker V2 M3
bge-reranker-v2-m3 |
reranker | - | sovereign | $0.02 / 1K Rerank Pairs | 1K Rerank Pairs | OTE Greece, CloudSigma | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Kokoro
kokoro |
tts | - | cloud | $0.01 / 1K Characters | 1K Characters | OTE Greece | Live |
|
F5 TTS
f5-tts |
tts | - | cloud | $0.015 / 1K Characters | 1K Characters | OTE Greece | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Whisper
whisper |
transcription | - | cloud | $0.006 / Audio Minute | Audio Minute | OTE Greece, πΊπΈ Groq, A20 Audio Whisper | Live |
|
Whisper 1
whisper-1 |
transcription | - | cloud | $0.006 / Audio Minute | Audio Minute | OTE Greece, πΊπΈ Groq, A20 Audio Whisper | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Ecapa Tdnn
ecapa-tdnn |
speaker | - | cloud | $0.0015 / Audio Minute | Audio Minute | OTE Greece | Live |
|
Xvector
xvector |
speaker | - | cloud | $0.0015 / Audio Minute | Audio Minute | OTE Greece | Live |
|
Wavlm Base Plus Sv
wavlm-base-plus-sv |
speaker | - | cloud | $0.0015 / Audio Minute | Audio Minute | OTE Greece | Live |
Public list pricing for currently grouped models.
| Model | Family | Context | Infrastructure | Pricing | Unit | Regions | Status |
|---|---|---|---|---|---|---|---|
|
Clap
clap |
audio | - | cloud | $0.002 / Audio Minute | Audio Minute | OTE Greece | Live |
|
Ast
ast |
audio | - | cloud | $0.002 / Audio Minute | Audio Minute | OTE Greece | Live |
|
Mert
mert |
audio | - | cloud | $0.002 / Audio Minute | Audio Minute | OTE Greece | Live |
Use the pricing tables with the public model catalog and API documentation to decide what your team should test, approve, and scale.