Moonshot AI Kimi K2.5

Moonshot AI Kimi K2.5 inference overview

Price per 1M tokens

$0.50 (input)

$2.85 (output)

Parameters

32B (active)

1T (total)

Context Window

262K

Release Date

Jan 2026

Moonshot AI Kimi K2.5 inference details

Kimi K2.5 is a multimodal Mixture-of-Experts language model featuring 32 billion activated parameters and a total of 1 trillion parameters. It has been optimized for agentic use cases and excels at reasoning and coding tasks.

Created by:

Moonshot AI

License:

other

Model card:

Kimi-K2.5

				
					import openai
import weave

# Weave autopatches OpenAI to log LLM calls to W&B
weave.init("<team>/<project>")

client = openai.OpenAI(
    # The custom base URL points to W&B Inference
    base_url='https://api.inference.wandb.ai/v1',

    # Get your API key from https://wandb.ai/authorize
    # Consider setting it in the environment as OPENAI_API_KEY instead for safety
    api_key="<your-apikey>",

    # Optional: Team and project for usage tracking
    project="<team>/<project>",
)

response = client.chat.completions.create(
    model="moonshotai/Kimi-K2.5",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Tell me a joke."}
    ],
)

print(response.choices[0].message.content)

Moonshot AI Kimi K2.5 resources

Guide

Kimi K2 for code generation tutorial

Course

AI engineering course: Agents

Guide

Moonshot AI Kimi K2.5 inference overview

Price per 1M tokens

Parameters

Context Window

Release Date

Moonshot AI Kimi K2.5 inference details

Moonshot AI Kimi K2.5 resources

The Platform

Article

Resources

Company

Use cases

Industries

Learn more

Moonshot AI Kimi K2.5 inference overview

Price per 1M tokens

Parameters

Context Window

Release Date

Moonshot AI Kimi K2.5 inference details

Moonshot AI Kimi K2.5 resources

The Platform

Article

Resources

Company

Use cases

Industries