Z.ai icon

Z.ai GLM-5.2

GLM-5.2

Use Z.ai's new flagship coding model across all GLM Coding Plan tiers, including Lite, Pro, Max, and Team plans, with 1M-context support for large codebases and long agent sessions Configure Claude Code with `glm-5.2[1m]` plus `CLAUDE_CODE_AUTO_COMPACT_WINDOW=1000000`; configure OpenClaw/Cline-style tools with `glm-5.2`, 1M context settings, and 131K max output where the tool supports it. Reserve GLM-5.2 for complex work because Z.ai treats it as a premium Opus-level model with higher quota multipliers, while GLM-4.7 remains the recommended option for routine tasks

Reviewed by ToolWorthy Editors·updated today·GLM-5.2 released yesterday

Pricing:From $18/mo
Categories:
Jump to section
No media available

Featured alternatives

TypingMind icon

TypingMind

Doubao icon

Doubao

Happycapy icon

Happycapy

Patchrooms icon

Patchrooms

Odysseus icon

Odysseus

Botpress icon

Botpress

Pros & Cons

Pros

  • Brings GLM's documented coding model path to a 1M-token context window for long repository and agent sessions
  • Available across Lite, Pro, Max, and Team Coding Plan tiers rather than limited to only top-tier users
  • Works with common coding-agent tools that support Z.ai's Coding Plan or custom OpenAI-compatible model configuration
  • Exposes max-effort behavior for deeper reasoning in Claude Code workflows
  • Gives OpenClaw users a documented manual configuration path

Cons

  • Current official docs focus on setup and quota rules, not a full benchmark report for GLM-5.2
  • API, chatbot, and open-source release channels were announced as planned but should not be treated as live until official docs confirm them
  • Premium quota multipliers can make GLM-5.2 expensive to use for routine tasks inside a fixed plan
  • 1M-context mode requires the correct [1m] suffix and tool settings, so misconfiguration can hide the main upgrade
  • Unsupported third-party usage may violate Coding Plan restrictions

Overview

GLM-5.2 is Z.ai's new flagship coding model for the GLM Coding Plan, announced on June 13, 2026 and rolled out to Lite, Pro, Max, and Team users. It is positioned as the upgrade path for complex coding-agent work: large repositories, long-running sessions, deeper reasoning, and workflows that need a usable 1M-token context window rather than the 200K-class context used by earlier GLM-5.x models.

The practical change is not just a model name. Z.ai documents glm-5.2[1m] specifically for Claude Code 1M-context setup; OpenClaw and Cline-style custom providers use glm-5.2 with the documented 1M context-window setting where supported. The [1m] suffix enables the 1M-context variant when paired with the CLAUDE_CODE_AUTO_COMPACT_WINDOW=1000000 setting in Claude Code. For teams already using Z.ai's Coding Plan inside supported coding tools, GLM-5.2 is the version to test on the hardest agentic engineering tasks.

This release should still be treated carefully. Z.ai's current documentation emphasizes availability, setup, context length, effort mapping, and quota rules; it does not publish a full GLM-5.2 benchmark table in the docs reviewed here. API, chatbot, and open-source availability were announced as planned for the week after the June 13, 2026 release, so this page avoids treating those channels as already live until Z.ai publishes the corresponding docs or repository.

What's New

GLM-5.2 for All Coding Plan Tiers

The GLM Coding Plan now lists GLM-5.2 as supported across Lite, Pro, Max, and Team plans. That is a wider rollout than an invite-only or Max-only release: existing Coding Plan users can switch their supported coding agent to GLM-5.2 without waiting for a separate enterprise enablement path.

Z.ai describes GLM-5.2 as an advanced model comparable to Claude Opus-class usage inside the Coding Plan. That makes it a better fit for complex tasks than routine autocomplete or small bug fixes, where the documentation still recommends GLM-4.7 to preserve quota.

1M-Context Mode

GLM-5.2 adds a documented 1M-context path through the glm-5.2[1m] model identifier. In Claude Code, Z.ai's setup example pairs that model name with:

{
  "env": {
    "CLAUDE_CODE_AUTO_COMPACT_WINDOW": "1000000",
    "ANTHROPIC_DEFAULT_HAIKU_MODEL": "glm-4.5-air",
    "ANTHROPIC_DEFAULT_SONNET_MODEL": "glm-5.2[1m]",
    "ANTHROPIC_DEFAULT_OPUS_MODEL": "glm-5.2[1m]"
  }
}

For OpenClaw, the documented model entry sets contextWindow to 1000000 and maxTokens to 131072. For Cline-style custom providers, Z.ai recommends using the OpenAI-compatible API provider, setting the base URL to https://api.z.ai/api/coding/paas/v4, entering the Z.ai API key, choosing a custom model such as glm-5.2, disabling image support, and setting context window size to 1000000.

Effort Mapping for Deeper Coding Reasoning

GLM-5.2 exposes effort behavior through Claude Code's /effort command. Z.ai maps Claude Code's low, medium, and high settings to GLM-5.2's high effort, while xhigh, max, and ultracode map to GLM-5.2's max effort. The official guidance is to use max effort for coding tasks when deeper reasoning and more stable complex-task performance matter.

This matters for upgrade testing because a team can mistakenly evaluate GLM-5.2 at a shallower effort level and miss the intended behavior. If your goal is long-horizon planning, multi-file refactoring, or agentic debugging, test with max effort and track quota consumption separately.

OpenClaw and Multi-Tool Configuration

OpenClaw users who cannot switch models through a provider picker can manually add glm-5.2 to ~/.openclaw/openclaw.json, update the primary model from zai/glm-5 to zai/glm-5.2, then restart the gateway. Z.ai also documents custom-provider setup for tools that support OpenAI-compatible endpoints, while warning that tools without custom model settings must wait for official support.

The result is a practical compatibility update for AI coding agents rather than a standalone chat-only release. GLM-5.2 is meant to sit inside developer tools where model routing, context windows, max output, and effort settings directly affect real work.

Compatibility Notes

GLM-5.2 access is currently documented through the GLM Coding Plan and officially supported coding tools. It is not safe to assume every SDK, general chatbot surface, or third-party agent runner can use the model immediately.

Supported or documented paths include Claude Code, OpenClaw, Cline-style OpenAI-compatible custom provider setup, and other coding tools that allow custom model configuration. Z.ai states that unsupported or unauthorized use of Coding Plan benefits may lead to restrictions, so teams should keep GLM-5.2 usage inside approved Coding Plan scenarios until API and chatbot services are formally documented.

The 1M-context path also has a versioning footnote: if Claude Code reports that the [1m] model does not exist, Z.ai recommends upgrading Claude Code and trying again. In practice, rollout issues may be tool-version or configuration problems rather than model unavailability.

Availability & Access

GLM-5.2 is available to GLM Coding Plan users on Lite, Pro, Max, and Team plans. Z.ai's public materials stated that API and chatbot services were planned for the week after the June 13, 2026 release, and that the model was planned for an MIT-licensed open-source release. Until those channels are live in official docs or repositories, the dependable access path is the Coding Plan inside supported coding tools.

For users comparing Claude Code, OpenClaw, Cline, and similar agentic coding workflows, the key question is not only "Can I call the model?" but "Can my tool expose the right model string, context window, and effort settings?" GLM-5.2's most important benefits require correct configuration.

Pricing & Plans

GLM-5.2 is included in the GLM Coding Plan rather than priced as a standalone public API model in the current docs reviewed. Z.ai's Coding Plan overview states that plans start at 18 USD per month and provides estimated usage limits by tier.

Plan GLM-5.2 access 5-hour estimated limit Weekly estimated limit
Lite Included Up to approx. 80 prompts Up to approx. 400 prompts
Pro Included Up to approx. 400 prompts Up to approx. 2,000 prompts
Max Included Up to approx. 1,600 prompts Up to approx. 8,000 prompts
Team Included Team-specific limits and policy Team-specific limits and policy

GLM-5.2 and GLM-5-Turbo consume quota faster than routine models. Z.ai says these advanced models are deducted at 3x during peak hours and 2x during off-peak hours, with a limited-time off-peak 1x benefit through the end of September. Peak hours are 14:00-18:00 UTC+8.

For cost control, the official guidance is simple: use GLM-5.2 for complex tasks and continue using GLM-4.7 for routine development work.

Best For

  • Developers working on large repositories where 200K context is no longer enough
  • Teams running long Claude Code or OpenClaw sessions that need deeper planning and more stable multi-step execution
  • Coding Plan users deciding when to spend premium quota on GLM-5.2 instead of GLM-4.7
  • Agent builders comparing MCP, tool use, and model-routing behavior across coding environments
  • Early adopters who can validate configuration, quota behavior, and output quality before broader API or chatbot rollout

FAQ

What is GLM-5.2?

GLM-5.2 is Z.ai's new flagship coding model for the GLM Coding Plan. It is documented for supported coding tools and adds a 1M-context configuration path for complex coding-agent workflows.

Is GLM-5.2 available to Lite users?

Yes. Z.ai's documentation says all GLM Coding Plan tiers support GLM-5.2, including Lite, Pro, Max, and Team plans.

How do I enable the 1M context window?

In Claude Code, use the glm-5.2[1m] model identifier and set CLAUDE_CODE_AUTO_COMPACT_WINDOW to 1000000. In OpenClaw or Cline-style tools, configure the model/context window according to Z.ai's documentation.

Does GLM-5.2 have public benchmark scores?

The official docs reviewed here do not publish a full GLM-5.2 benchmark table. They position it as a new flagship model with strong coding, 1M-context support, and long-horizon task strengths, but this page avoids inventing specific benchmark numbers.

Is GLM-5.2 available through API or chatbot?

Z.ai's public announcement said API and chatbot services were planned for the week after the June 13, 2026 release. Until official docs confirm those channels, the reliable access path is the GLM Coding Plan inside supported coding tools.

Is GLM-5.2 open source?

Z.ai's public announcement says GLM-5.2 will be open-sourced under the MIT License the following week. This page treats that as planned availability until the repository or official model page is published.

Should I use GLM-5.2 for every coding task?

No. Z.ai recommends GLM-5.2 for complex tasks and GLM-4.7 for routine tasks because GLM-5.2 consumes quota at a higher multiplier.

Version History

GLM-5.2

Current Version

Released on June 13, 2026

+What's new
3 updates
  • Use Z.ai's new flagship coding model across all GLM Coding Plan tiers, including Lite, Pro, Max, and Team plans, with 1M-context support for large codebases and long agent sessions
  • Configure Claude Code with `glm-5.2[1m]` plus `CLAUDE_CODE_AUTO_COMPACT_WINDOW=1000000`; configure OpenClaw/Cline-style tools with `glm-5.2`, 1M context settings, and 131K max output where the tool supports it.
  • Reserve GLM-5.2 for complex work because Z.ai treats it as a premium Opus-level model with higher quota multipliers, while GLM-4.7 remains the recommended option for routine tasks

GLM-5.1

Released on April 7, 2026

View Update
+What's new
3 updates
  • Use GLM-5.1 for long-horizon agentic engineering work, with stronger coding than GLM-5 and an official SWE-Bench Pro score of 58.4 against frontier coding models
  • Run multi-stage engineering tasks for up to 8 hours in one autonomous loop, covering planning, execution, testing, bug fixing, and production-grade delivery
  • Build agents that repeatedly experiment, analyze results, adjust strategy, and optimize systems, including benchmark-driven performance tuning workflows

GLM-5V-Turbo

Released on April 1, 2026

View Update
+What's new
3 updates
  • Process images, videos, design drafts, and document layouts natively as a multimodal vision coding model with 200K context window and 128K max output tokens for long-horizon agentic tasks
  • Execute the full perceive-plan-execute loop in GUI environments with leading scores on AndroidWorld, WebVoyager, and ZClawBench agent benchmarks optimized for OpenClaw workflows
  • Fuse visual understanding and code generation through CogViT vision encoder and 30+ task joint reinforcement learning across STEM, grounding, video, and coding domains

GLM-5.1

Released on March 27, 2026

View Update
+What's new
3 updates
  • Score 45.3 on Claude Code coding benchmark—94.6% of Claude Opus 4.6 performance—with 28% improvement over GLM-5, establishing a new frontier in cost-efficient agentic coding
  • Generate code at 55+ tokens/sec with estimated 200K context window, enabling long-horizon multi-file refactoring and distributed system architecture design
  • Access frontier-level coding intelligence from $3/month via Coding Plan with native compatibility for Claude Code, Cline, and Roo Code MCP tool integrations

GLM-5-Turbo

Released on March 15, 2026

+What's new
3 updates
  • Execute complex OpenClaw agent workflows with superior tool invocation reliability, scheduled task continuity, and high-throughput long-chain execution optimized since training phase
  • Decompose and follow multi-layered complex instructions with enhanced comprehension, supporting collaborative task division among multiple agents and MCP tool integrations
  • Outperform GLM-5 across multiple ZClawBench task categories while supporting 200K context input with multiple thinking modes for dynamic, long-running agent tasks

GLM-5

Released on February 12, 2026

View Update
+What's new
3 updates
  • Handle complex systems engineering and long-horizon agentic tasks with 744B parameters (40B active) and DeepSeek Sparse Attention integration
  • Generate production-ready documents (.docx, .pdf, .xlsx) directly from text with built-in Agent mode and multi-turn collaboration
  • Execute code with best-in-class open-source performance on reasoning benchmarks, approaching frontier model capabilities

GLM-4.7-Flash

Released on January 19, 2026

+What's new
3 updates
  • Get lightweight version of GLM-4.7 with faster response times and high throughput optimized for real-time coding, writing, and translation tasks
  • Deploy efficiently with competitive performance at smaller scale while maintaining strong general capabilities across reasoning and content generation
  • Access free-tier model designed for high-frequency use cases with best-in-class aesthetic outputs, low latency, and simplified deployment

GLM-4.7-Flash

Released on January 19, 2026

+What's new
3 updates
  • Get lightweight version of GLM-4.7 with faster response times and high throughput optimized for real-time coding, writing, and translation tasks
  • Deploy efficiently with competitive performance at smaller scale while maintaining strong general capabilities across reasoning and content generation
  • Access free-tier model designed for high-frequency use cases with best-in-class aesthetic outputs, low latency, and simplified deployment

GLM-4.7

Released on December 22, 2025

+What's new
3 updates
  • Build cleaner modern webpages and professional slides with major improvements in UI aesthetics, visual quality, and accurate layout sizing for frontend development
  • Solve multilingual coding tasks faster with 73.8% on SWE-bench and 41% on Terminal Bench 2.0, delivering stronger performance across agent frameworks
  • Reason through complex mathematical and logical problems with 42.8% on HLE benchmark while enhancing tool-using and web browsing capabilities

GLM-4.6

Released on September 30, 2025

+What's new
3 updates
  • Handle longer conversations and complex multi-file codebases with expanded 200K context window, enabling more sophisticated agentic task execution
  • Code more efficiently in Claude Code, Cline, and Roo Code with superior benchmark performance and improved real-world coding accuracy
  • Leverage enhanced reasoning capabilities with native tool use support during inference, delivering stronger results in search-based agent workflows

GLM-4.5

Released on July 28, 2025

+What's new
3 updates
  • Unify reasoning, coding, and agentic capabilities in a single model delivering balanced performance across complex problem-solving and rapid content generation
  • Switch between thinking mode for deep analysis and non-thinking mode for instant responses, adapting intelligence level to task complexity on demand
  • Build full-stack web applications with stronger frontend quality, and integrate the model into Claude Code, Roo Code, or custom agent workflows through tool APIs

ChatGLM3-6B

Released on October 27, 2023

+What's new
3 updates
  • Execute code directly and invoke external tools with new Code Interpreter and Function Call capabilities, enabling autonomous agent-style task completion
  • Process information more accurately with improved training across 42 benchmarks covering semantics, mathematics, reasoning, code, and knowledge understanding
  • Deploy locally on consumer hardware with open-source 6B-parameter model supporting both academic research and free commercial use after registration

ChatGLM2-6B

Released on June 25, 2023

+What's new
3 updates
  • Handle longer conversations with expanded 32K context window using FlashAttention technology, enabling deeper multi-turn dialogue understanding
  • Get responses 42% faster with improved inference speed and INT4 quantization, supporting extended dialogues on consumer GPUs with only 6GB VRAM
  • Achieve stronger performance across reasoning and knowledge benchmarks with enhanced training on 1.4T bilingual tokens covering diverse domains

ChatGLM-6B

Released on March 14, 2023

+What's new
3 updates
  • Deploy locally on consumer-grade graphics cards with lightweight 6.2B-parameter bilingual model, enabling private ChatGPT-style conversations
  • Chat naturally in Chinese and English with open-source conversational AI trained on approximately 1 trillion tokens of diverse text data
  • Use freely for commercial purposes after simple registration, with full model weights and training code available for academic research

Top alternatives

Related categories

From the blog

View all →

Track Z.ai in ToolWorthy Weekly

Important tool updates, better alternatives, and selected AI signals in one weekly brief.

Weekly only. Unsubscribe anytime.