Available Models

Explore all available LLM models with pricing and capabilities. This page is dynamically generated from our model database.

Pricing Note: Prices shown are per million tokens. Actual costs may vary based on your plan and usage. Use GET /chat/v1/models for programmatic access to the current model list.

OpenAI

Model Input ($/1M) Output ($/1M) Context Max Output Category
gpt-3.5-turbo $0.50 $1.50 - - chat
gpt-4-turbo $10.00 $30.00 - - chat
gpt-4.1 $2.00 $8.00 - - chat
gpt-4.1-mini $0.40 $1.60 - - chat
gpt-4.1-nano $0.10 $0.40 - - chat
gpt-4o $2.50 $10.00 - - chat
gpt-4o-mini $0.15 $0.60 - - chat
gpt-4o-mini-realtime-preview $0.60 $2.40 - - realtime
gpt-4o-mini-search-preview $0.15 $0.60 - - chat
gpt-4o-mini-transcribe $3.00 - 448 448 stt
gpt-4o-search-preview $2.50 $10.00 - - chat
gpt-4o-transcribe $6.00 - 448 448 stt
gpt-4o-transcribe-diarize $6.00 - 448 448 stt
gpt-5 $1.25 $10.00 - - chat
gpt-5-chat-latest $1.25 $10.00 - - chat
gpt-5-codex $1.25 $10.00 - - code
gpt-5-mini $0.25 $2.00 - - chat
gpt-5-nano $0.05 $0.40 - - chat
gpt-5-search-api $1.25 $10.00 - - chat
gpt-image-1 $40.00 - - - image
gpt-image-1-hd $80.00 - - - image
gpt-image-1.5 $34.00 - - - image
gpt-image-1.5-hd $134.00 - - - image
gpt-realtime-mini $0.60 $2.40 - - realtime
tts-1 $15.00 - - - tts
tts-1-hd $30.00 - - - tts
gpt-4o-2024-05-13 Disabled $5.00 $15.00 - - chat
gpt-4o-realtime-preview Disabled $5.00 $20.00 - - realtime
gpt-5-pro Disabled $15.00 $120.00 - - chat
gpt-realtime Disabled $4.00 $16.00 - - realtime

Anthropic

Model Input ($/1M) Output ($/1M) Context Max Output Category
claude-3-5-sonnet-20241022 $3.00 $15.00 - - chat
claude-3-7-sonnet $3.00 $15.00 - - chat
claude-3-haiku-20240307 $0.25 $1.25 - - chat
claude-3-opus-20240229 $15.00 $75.00 - - chat
claude-haiku-3-5 $0.80 $4.00 - - chat
claude-haiku-4-5 $1.00 $5.00 - - chat
claude-haiku-4-5-20250514 $1.00 $5.00 - - chat
claude-haiku-4-5-20251001 $1.00 $5.00 - - chat
claude-opus-4 $15.00 $75.00 - - chat
claude-opus-4-1 $15.00 $75.00 - - chat
claude-opus-4-5 $5.00 $25.00 - - chat
claude-opus-4-5-20251101 $5.00 $25.00 - - chat
claude-sonnet-3-7 $3.00 $15.00 - - chat
claude-sonnet-4 $3.00 $15.00 - - chat
claude-sonnet-4-5 $3.00 $15.00 - - chat
claude-3-5-haiku-20241022 Disabled $1.00 $5.00 - - chat

Google Gemini

Model Input ($/1M) Output ($/1M) Context Max Output Category
gemini-1.5-flash $0.07 $0.30 1,048,576 8,192 chat
gemini-1.5-pro $1.25 $5.00 2,097,152 8,192 chat
gemini-2.0-flash $0.10 $0.40 1,048,576 65,536 chat
gemini-2.0-flash-lite $0.07 $0.30 1,048,576 65,536 chat
gemini-2.5-flash $0.30 $2.50 1,048,576 65,536 chat
gemini-2.5-flash-image - $30.00 - - image
gemini-2.5-flash-lite $0.10 $0.40 1,048,576 65,536 chat
gemini-2.5-pro $1.25 $10.00 1,048,576 65,536 chat
gemini-3-pro-image-preview $2.00 $134.00 - - image
gemini-3-pro-preview $2.00 $12.00 1,048,576 65,536 chat
imagen-3.0-generate-002 $20.00 - - - image
tts-neural2 $16.00 - - - tts
tts-standard $4.00 - - - tts
tts-studio $160.00 - - - tts
tts-wavenet $16.00 - - - tts

Groq

Model Input ($/1M) Output ($/1M) Context Max Output Category
allam-2-7b $0.05 $0.08 4,096 4,096 chat
canopylabs/orpheus-v1-english - $22.00 200 200 tts
groq/compound $0.20 $0.80 131,072 40,960 chat
groq/compound-mini $0.10 $0.40 131,072 40,960 chat
llama-3.1-8b-instant $0.05 $0.08 131,072 131,072 chat
llama-3.3-70b-versatile $0.59 $0.79 131,072 32,768 chat
meta-llama/llama-4-maverick-17b-128e-instruct $0.20 $0.60 131,072 8,192 chat
meta-llama/llama-4-scout-17b-16e-instruct $0.11 $0.34 131,072 8,192 chat
meta-llama/llama-guard-4-12b $0.20 $0.20 131,072 1,024 guard
meta-llama/llama-prompt-guard-2-22m $0.03 $0.03 512 512 prompt_guard
meta-llama/llama-prompt-guard-2-86m $0.04 $0.04 512 512 prompt_guard
moonshotai/kimi-k2-instruct $1.00 $3.00 131,072 16,384 chat
moonshotai/kimi-k2-instruct-0905 $1.00 $3.00 262,144 16,384 chat
openai/gpt-oss-120b $0.15 $0.60 131,072 65,536 chat
openai/gpt-oss-20b $0.07 $0.30 131,072 65,536 chat
openai/gpt-oss-safeguard-20b $0.07 $0.30 131,072 65,536 guard
qwen/qwen3-32b $0.29 $0.59 131,072 40,960 chat
whisper-large-v3 $8.22 - 448 448 stt
whisper-large-v3-turbo $2.96 - 448 448 stt
playai-tts Disabled - $50.00 8,192 8,192 tts
playai-tts-arabic Disabled - $50.00 8,192 8,192 tts

xAI

Model Input ($/1M) Output ($/1M) Context Max Output Category
grok-2-image-1212 $70.00 - - - image
grok-3-mini $0.30 $0.50 131,072 32,768 chat
grok-4-1-fast-non-reasoning $0.20 $0.50 2,097,152 131,072 chat
grok-4-1-fast-reasoning $0.20 $0.50 2,097,152 131,072 chat
grok-4-fast-non-reasoning $0.20 $0.50 131,072 32,768 chat
grok-4-fast-reasoning $0.20 $0.50 131,072 32,768 chat
grok-code-fast-1 $0.20 $1.50 262,144 65,536 chat
grok-2-vision-1212 Disabled $2.00 $10.00 32,768 8,192 vision
grok-3 Disabled $3.00 $15.00 131,072 32,768 chat
grok-4-0709 Disabled $3.00 $15.00 2,097,152 131,072 chat

Stability

Model Input ($/1M) Output ($/1M) Context Max Output Category
sd3-large $65.00 - - - image
sd3-medium $35.00 - - - image

Elevenlabs

Model Input ($/1M) Output ($/1M) Context Max Output Category
eleven_monolingual_v1 $300.00 - - - tts
eleven_multilingual_v2 $300.00 - - - tts
eleven_turbo_v2 $150.00 - - - tts

Programmatic Access

To get the current list of available models programmatically, use the /models endpoint:

curl https://api.demeterics.com/chat/v1/models \
  -H "Authorization: Bearer dmt_your_api_key"

For provider-specific endpoints:

  • GET /groq/v1/models - Groq models
  • GET /openai/v1/models - OpenAI models
  • GET /google/v1/models - Google Gemini models
  • GET /openrouter/v1/models - OpenRouter models (300+)
  • GET /grok/v1/models - xAI Grok models

See the API Reference for complete endpoint documentation.