Models
Every model, one efficient API
Open-weight language, reasoning, coding, vision, embedding, and speech models, all hosted by us on 100% renewable energy and tuned to run efficiently. One catalog, one key, the lowest energy use that still does the job.
The catalog
Every model, by category
Open-weight models run on 100% renewable energy with industry-leading efficiency, reachable through one OpenAI-compatible API and a single key.
Chat & language
GreenPT-branded open-weight models for everyday chat, reasoning, and writing, tuned for European languages.
- Google Gemma 4
gemma4
Multimodal reasoning with a long context window for documents and rich prompts.
- Vision
- Reasoning
- Long context
- Input
- €0.50
- Output
- €1.50
per 1M tokens
- GPT-OSS
green-r
Advanced reasoning, writing, and multimodal understanding with GreenPT guardrails.
- Reasoning
- Vision
- Input
- €0.35
- Output
- €0.95
per 1M tokens
- Mistral Small 3.2 24B
green-l
Fast multilingual model with Dutch grammar guardrails for European workloads.
- Multilingual
- Functions
- Input
- €0.25
- Output
- €0.80
per 1M tokens
Foundation models
Open-weight foundation models we host ourselves, tuned to run at the lowest energy use for each task.
- Qwen
qwen3.5-397b-a17b
Large mixture-of-experts model for code generation and agentic tasks.
- Code
- Agentic
- Functions
- Input
- €0.70
- Output
- €4.35
per 1M tokens
- OpenAI
gpt-oss-120b
Open-weight 120B model with vision and long-context reasoning.
- Vision
- Reasoning
- Input
- €0.20
- Output
- €0.70
per 1M tokens
- Mistral
mistral-small-3.2-24b-instruct-2506
Efficient instruct model with function calling and vision.
- Functions
- Vision
- Input
- €0.20
- Output
- €0.40
per 1M tokens
- Google
gemma-3-27b-it
Compact multimodal model for general reasoning and instruction-following.
- Vision
- Reasoning
- Input
- €0.30
- Output
- €0.60
per 1M tokens
- Meta
llama-3.3-70b-instruct
Multilingual instruction-following at 70B for broad general use.
- Multilingual
- Input
- €1.10
- Output
- €1.10
per 1M tokens
- Mistral
mistral-medium-3.5-128b
Frontier-class reasoning, coding, and vision with a long context window.
- Reasoning
- Code
- Vision
- Input
- €1.80
- Output
- €9.00
per 1M tokens
Coding
Models tuned for code generation, completion, and agentic developer workflows.
- Qwen
qwen3-coder-30b-a3b-instruct
Code-specialised model for generation and completion across languages.
- Code
- Functions
- Input
- €0.25
- Output
- €0.95
per 1M tokens
- Mistral
devstral-2-123b-instruct-2512
Large coding model for agentic software tasks and tool use.
- Code
- Agentic
- Functions
- Input
- €0.50
- Output
- €2.40
per 1M tokens
Audio & speech
Transcription and speech understanding, multilingual and accurate.
- Mistral
voxtral-small-24b-2507
Audio transcription and speech understanding in one model.
- Audio
- Input
- €0.20
- Output
- €0.45
per 1M tokens
- GreenPT
green-s
Pre-recorded and live speech-to-text for general transcription.
- Audio
- Recorded
- €0.52
- Live
- €0.65
per hour
- GreenPT
green-s-pro
Higher-accuracy transcription with multilingual options.
- Audio
- Multilingual
- Recorded
- €0.52
- Live
- €0.78
per hour
Embeddings & retrieval
Vectors and reranking for semantic search and RAG pipelines.
- Qwen3-Embedding-4B
green-embedding
Multilingual embeddings up to 2560 dimensions for semantic search and RAG.
- Embeddings
- Multilingual
- Price
- €0.20
per 1M tokens
- Qwen3-Reranker-4B
green-rerank
Reorders retrieved documents by true relevance, the last mile of search.
- Reranking
- Price
- €0.12
per 1M tokens
On the way
Coming soon
New open-weight models joining the catalog. Pricing and benchmark scores are provisional and may change at launch.
Coming soon
New open-weight models joining the catalog. Pricing and benchmarks are provisional and may change at launch.
- z-ai
z-ai/glm-5.2
High-intelligence reasoning model with a 1M-token context window.
- Intel
- 51.1
- Coding
- 50.7
- Functions
- Tool Choice
- Reasoning
- Input
- $1.50
- Cache
- $0.38
- Output
- $4.50
per 1M tokens
- minimax
minimax/minimax-m3
Agentic multimodal model with strong tool use and a 1M-token context.
- Intel
- 44.4
- Coding
- 43.4
- Agentic
- 89%
- Functions
- Tool Choice
- Reasoning
- Vision
- Input
- $0.40
- Cache
- $0.10
- Output
- $2.00
per 1M tokens
- deepseek
deepseek/deepseek-v4-pro
Flagship DeepSeek model for coding and agentic tasks with a 1M-token context.
- Intel
- 44.3
- Coding
- 47.5
- Agentic
- 96%
- Functions
- Tool Choice
- Reasoning
- Input
- $1.75
- Cache
- $0.44
- Output
- $3.50
per 1M tokens
- moonshotai
moonshotai/kimi-k2.6
Agentic multimodal model with vision and a 256K-token context.
- Intel
- 42.8
- Coding
- 47.1
- Agentic
- 96%
- Functions
- Tool Choice
- Reasoning
- Vision
- Input
- $1.00
- Cache
- $0.25
- Output
- $4.00
per 1M tokens
- moonshotai
moonshotai/kimi-k2.7-code
Code-focused Kimi variant with vision and a 256K-token context.
- Intel
- 41.9
- Coding
- 45.8
- Functions
- Tool Choice
- Reasoning
- Vision
- Input
- $1.25
- Cache
- $0.31
- Output
- $4.50
per 1M tokens
- deepseek
deepseek/deepseek-v4-flash
Low-cost, high-throughput DeepSeek model with a 1M-token context.
- Intel
- 40.3
- Coding
- 38.7
- Agentic
- 95%
- Functions
- Tool Choice
- Reasoning
- Input
- $0.15
- Cache
- $0.04
- Output
- $0.30
per 1M tokens
Models, in short
How do I choose a model?
Pick by capability and budget. Every model is open-weight and hosted by us, so you can match the smallest model that handles your task and get strong results at the lowest energy use and cost.
Why are these models more efficient?
They are open-weight and run on 100% renewable energy in data centres with a PUE of 1.25 and a WUE of 0.25, well below the industry averages of 1.55 and 1.8. Lighter, quantised models and automatic routing mean each request uses the least compute that still does the job.
How is pricing calculated?
Most models are priced per million input and output tokens; speech models are priced per hour of audio. Prices are listed on each card and in the API docs.
See the full catalog →What are the coming-soon models?
New open-weight models being added to the catalog. Their pricing and benchmark scores are provisional and may change at launch.
How do I call a model?
Through the OpenAI-compatible API: set the base URL and key, then pass the model id. One key covers every model, plus embeddings, reranking, OCR, speech, scraping, and search.
Read the API docs →See the difference
One key for every model.
Start a free 14-day trial, no credit card. Call any model through one OpenAI-compatible API, hosted by us on 100% renewable energy and tuned for the lowest energy use from the first request.
No credit card required.
- 100% Renewable
- PUE 1.25
- Open-weight