W&B hosted models
Prices per 1M tokens
Aug 2025
$0.15 input / $0.60 output
131K
Efficient Mixture-of-Experts model designed for high-reasoning, agentic and general-purpose use cases.
Aug 2025
$0.05 input / $0.20 output
131K
Lower latency Mixture-of-Experts model trained on OpenAI's Harmony response format with reasoning capabilities.
Jul 2025
$1.00 input / $1.50 output
262K
Mixture-of-Experts model optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning.
Jul 2025
$0.10 input / $0.10 output
262K
Efficient multilingual, Mixture-of-Experts, instruction-tuned model, optimized for logical reasoning.
Jul 2025
$0.10 input / $0.10 output
262K
High-performance Mixture-of-Experts model optimized for structured reasoning, math, and long-form generation.
Aug 2025
$0.55 input / $1.65 output
128K
A large hybrid model that supports both thinking and non-thinking modes via prompt templates.
Jul 2024
$0.22 input / $0.22 output
128K
Efficient conversational model optimized for responsive multilingual chatbot interactions.
Mar 2025
$1.14 input / $2.75 output
161K
Robust Mixture-of-Experts model tailored for high-complexity language processing and comprehensive document analysis.
Dec 2024
$0.71 input / $0.71 output
128K
Multilingual model excelling in conversational tasks, detailed instruction-following, and coding.
May 2025
$1.35 input / $5.40 output
161K
Optimized for precise reasoning tasks including complex coding, math, and structured document analysis.
Jul 2025
$1.35 input / $4.00 output
128K
Mixture-of-Experts model optimized for complex tool use, reasoning, and code synthesis.
Apr 2025
$0.17 input / $0.66 output
64K
Multimodal model integrating text and image understanding, ideal for visual tasks and combined analysis.
Feb 2025
$0.08 input / $0.35 output
128K
Compact, efficient model ideal for fast responses in resource-constrained environments.