Zhipu AI

GLM 5.1

A flagship model for agents and complex coding

GLM-5.1 is Zhipu AI's next-generation flagship text model, with stronger thinking, coding, and agent-task capabilities. It supports long context, context caching, structured output, and function calling, making it suitable for complex coding, tool use, multi-step reasoning, and long-running agent workflows.

Context200K
Released2026-04
Relays34 sites
200K context windowAgents and tool useStructured outputCoding and long workflows

Zhipu AI Official Pricing

CNY
Updated: 2026-04-01T00:00:00.000+08:00Source

Input

¥6/ 1M tokens

Output

¥24/ 1M tokens

Cache read

¥1.3/ 1M tokens

Relay Comparison

Compare token, per-request, or per-second pricing by relay channel.

How should GLM 5.1 relay pricing be compared?

This GLM 5.1 pricing page compares official pricing with public prices from 34 listed AI gateways. Token prices are shown in CNY per 1M tokens, while per-request, per-second, and per-character rows use the unit shown in the table. Last updated: 06/12/2026, 23:12.

Data sources
Public price catalogs, official pricing records, and monitoring results.
Metric definitions
Uptime means successful probe response rate, fake-rate signals possible model mismatch or abnormal output risk, and latency is average API response time.
Risk note
Relay gateways are third-party services. Pricing, billing, privacy, and stability can change; start with a small top-up and verify reliability before continued use.