Z.ai GLM-5.2

GLM-5.2

Use Z.ai's new flagship coding model across all GLM Coding Plan tiers, including Lite, Pro, Max, and Team plans, with 1M-context support for large codebases and long agent sessions Configure Claude Code with `glm-5.2[1m]` plus `CLAUDE_CODE_AUTO_COMPACT_WINDOW=1000000`; configure OpenClaw/Cline-style tools with `glm-5.2`, 1M context settings, and 131K max output where the tool supports it. Reserve GLM-5.2 for complex work because Z.ai treats it as a premium Opus-level model with higher quota multipliers, while GLM-4.7 remains the recommended option for routine tasks

Reviewed by ToolWorthy Editors·updated 1 month ago·GLM-5.2 released 1 month ago

Pricing:From $18/mo

Categories:

AI Chatbots

Visit Site

Is this your tool? Upgrade this listing to Verified for $99 →

Jump to section

No media available

Featured alternatives

Coasty

MakersClaw

Blocks.ai

BrowserAct

YAGNI

Kimi

Pros & Cons

Pros

Brings GLM's documented coding model path to a 1M-token context window for long repository and agent sessions
Available across Lite, Pro, Max, and Team Coding Plan tiers rather than limited to only top-tier users
Works with common coding-agent tools that support Z.ai's Coding Plan or custom OpenAI-compatible model configuration
Exposes max-effort behavior for deeper reasoning in Claude Code workflows
Gives OpenClaw users a documented manual configuration path

Cons

Current official docs focus on setup and quota rules, not a full benchmark report for GLM-5.2
API, chatbot, and open-source release channels were announced as planned but should not be treated as live until official docs confirm them
Premium quota multipliers can make GLM-5.2 expensive to use for routine tasks inside a fixed plan
1M-context mode requires the correct [1m] suffix and tool settings, so misconfiguration can hide the main upgrade
Unsupported third-party usage may violate Coding Plan restrictions

Jump to section

Overview

GLM-5.2 is Z.ai's new flagship coding model for the GLM Coding Plan, announced on June 13, 2026 and rolled out to Lite, Pro, Max, and Team users. It is positioned as the upgrade path for complex coding-agent work: large repositories, long-running sessions, deeper reasoning, and workflows that need a usable 1M-token context window rather than the 200K-class context used by earlier GLM-5.x models.

The practical change is not just a model name. Z.ai documents glm-5.2[1m] specifically for Claude Code 1M-context setup; OpenClaw and Cline-style custom providers use glm-5.2 with the documented 1M context-window setting where supported. The [1m] suffix enables the 1M-context variant when paired with the CLAUDE_CODE_AUTO_COMPACT_WINDOW=1000000 setting in Claude Code. For teams already using Z.ai's Coding Plan inside supported coding tools, GLM-5.2 is the version to test on the hardest agentic engineering tasks.

This release should still be treated carefully. Z.ai's current documentation emphasizes availability, setup, context length, effort mapping, and quota rules; it does not publish a full GLM-5.2 benchmark table in the docs reviewed here. API, chatbot, and open-source availability were announced as planned for the week after the June 13, 2026 release, so this page avoids treating those channels as already live until Z.ai publishes the corresponding docs or repository.

What's New

GLM-5.2 for All Coding Plan Tiers

The GLM Coding Plan now lists GLM-5.2 as supported across Lite, Pro, Max, and Team plans. That is a wider rollout than an invite-only or Max-only release: existing Coding Plan users can switch their supported coding agent to GLM-5.2 without waiting for a separate enterprise enablement path.

Z.ai describes GLM-5.2 as an advanced model comparable to Claude Opus-class usage inside the Coding Plan. That makes it a better fit for complex tasks than routine autocomplete or small bug fixes, where the documentation still recommends GLM-4.7 to preserve quota.

1M-Context Mode

GLM-5.2 adds a documented 1M-context path through the glm-5.2[1m] model identifier. In Claude Code, Z.ai's setup example pairs that model name with:

{
  "env": {
    "CLAUDE_CODE_AUTO_COMPACT_WINDOW": "1000000",
    "ANTHROPIC_DEFAULT_HAIKU_MODEL": "glm-4.5-air",
    "ANTHROPIC_DEFAULT_SONNET_MODEL": "glm-5.2[1m]",
    "ANTHROPIC_DEFAULT_OPUS_MODEL": "glm-5.2[1m]"
  }
}

For OpenClaw, the documented model entry sets contextWindow to 1000000 and maxTokens to 131072. For Cline-style custom providers, Z.ai recommends using the OpenAI-compatible API provider, setting the base URL to https://api.z.ai/api/coding/paas/v4, entering the Z.ai API key, choosing a custom model such as glm-5.2, disabling image support, and setting context window size to 1000000.

Effort Mapping for Deeper Coding Reasoning

GLM-5.2 exposes effort behavior through Claude Code's /effort command. Z.ai maps Claude Code's low, medium, and high settings to GLM-5.2's high effort, while xhigh, max, and ultracode map to GLM-5.2's max effort. The official guidance is to use max effort for coding tasks when deeper reasoning and more stable complex-task performance matter.

This matters for upgrade testing because a team can mistakenly evaluate GLM-5.2 at a shallower effort level and miss the intended behavior. If your goal is long-horizon planning, multi-file refactoring, or agentic debugging, test with max effort and track quota consumption separately.

OpenClaw and Multi-Tool Configuration

OpenClaw users who cannot switch models through a provider picker can manually add glm-5.2 to ~/.openclaw/openclaw.json, update the primary model from zai/glm-5 to zai/glm-5.2, then restart the gateway. Z.ai also documents custom-provider setup for tools that support OpenAI-compatible endpoints, while warning that tools without custom model settings must wait for official support.

The result is a practical compatibility update for AI coding agents rather than a standalone chat-only release. GLM-5.2 is meant to sit inside developer tools where model routing, context windows, max output, and effort settings directly affect real work.

Compatibility Notes

GLM-5.2 access is currently documented through the GLM Coding Plan and officially supported coding tools. It is not safe to assume every SDK, general chatbot surface, or third-party agent runner can use the model immediately.

Supported or documented paths include Claude Code, OpenClaw, Cline-style OpenAI-compatible custom provider setup, and other coding tools that allow custom model configuration. Z.ai states that unsupported or unauthorized use of Coding Plan benefits may lead to restrictions, so teams should keep GLM-5.2 usage inside approved Coding Plan scenarios until API and chatbot services are formally documented.

The 1M-context path also has a versioning footnote: if Claude Code reports that the [1m] model does not exist, Z.ai recommends upgrading Claude Code and trying again. In practice, rollout issues may be tool-version or configuration problems rather than model unavailability.

Availability & Access

GLM-5.2 is available to GLM Coding Plan users on Lite, Pro, Max, and Team plans. Z.ai's public materials stated that API and chatbot services were planned for the week after the June 13, 2026 release, and that the model was planned for an MIT-licensed open-source release. Until those channels are live in official docs or repositories, the dependable access path is the Coding Plan inside supported coding tools.

For users comparing Claude Code, OpenClaw, Cline, and similar agentic coding workflows, the key question is not only "Can I call the model?" but "Can my tool expose the right model string, context window, and effort settings?" GLM-5.2's most important benefits require correct configuration.

Pricing & Plans

GLM-5.2 is included in the GLM Coding Plan rather than priced as a standalone public API model in the current docs reviewed. Z.ai's Coding Plan overview states that plans start at 18 USD per month and provides estimated usage limits by tier.

Plan	GLM-5.2 access	5-hour estimated limit	Weekly estimated limit
Lite	Included	Up to approx. 80 prompts	Up to approx. 400 prompts
Pro	Included	Up to approx. 400 prompts	Up to approx. 2,000 prompts
Max	Included	Up to approx. 1,600 prompts	Up to approx. 8,000 prompts
Team	Included	Team-specific limits and policy	Team-specific limits and policy

GLM-5.2 and GLM-5-Turbo consume quota faster than routine models. Z.ai says these advanced models are deducted at 3x during peak hours and 2x during off-peak hours, with a limited-time off-peak 1x benefit through the end of September. Peak hours are 14:00-18:00 UTC+8.

For cost control, the official guidance is simple: use GLM-5.2 for complex tasks and continue using GLM-4.7 for routine development work.

Best For

Developers working on large repositories where 200K context is no longer enough
Teams running long Claude Code or OpenClaw sessions that need deeper planning and more stable multi-step execution
Coding Plan users deciding when to spend premium quota on GLM-5.2 instead of GLM-4.7
Agent builders comparing MCP, tool use, and model-routing behavior across coding environments
Early adopters who can validate configuration, quota behavior, and output quality before broader API or chatbot rollout

FAQ

What is GLM-5.2?

GLM-5.2 is Z.ai's new flagship coding model for the GLM Coding Plan. It is documented for supported coding tools and adds a 1M-context configuration path for complex coding-agent workflows.

Is GLM-5.2 available to Lite users?

Yes. Z.ai's documentation says all GLM Coding Plan tiers support GLM-5.2, including Lite, Pro, Max, and Team plans.

How do I enable the 1M context window?

In Claude Code, use the glm-5.2[1m] model identifier and set CLAUDE_CODE_AUTO_COMPACT_WINDOW to 1000000. In OpenClaw or Cline-style tools, configure the model/context window according to Z.ai's documentation.

Does GLM-5.2 have public benchmark scores?

The official docs reviewed here do not publish a full GLM-5.2 benchmark table. They position it as a new flagship model with strong coding, 1M-context support, and long-horizon task strengths, but this page avoids inventing specific benchmark numbers.

Is GLM-5.2 available through API or chatbot?

Z.ai's public announcement said API and chatbot services were planned for the week after the June 13, 2026 release. Until official docs confirm those channels, the reliable access path is the GLM Coding Plan inside supported coding tools.

Released on March 15, 2026

+What's new

3 updates

•Execute complex OpenClaw agent workflows with superior tool invocation reliability, scheduled task continuity, and high-throughput long-chain execution optimized since training phase
•Decompose and follow multi-layered complex instructions with enhanced comprehension, supporting collaborative task division among multiple agents and MCP tool integrations
•Outperform GLM-5 across multiple ZClawBench task categories while supporting 200K context input with multiple thinking modes for dynamic, long-running agent tasks

GLM-5

Released on February 12, 2026

View Update

+What's new

3 updates

•Handle complex systems engineering and long-horizon agentic tasks with 744B parameters (40B active) and DeepSeek Sparse Attention integration
•Generate production-ready documents (.docx, .pdf, .xlsx) directly from text with built-in Agent mode and multi-turn collaboration
•Execute code with best-in-class open-source performance on reasoning benchmarks, approaching frontier model capabilities

GLM-4.7-Flash

Released on January 19, 2026

+What's new

3 updates

•Get lightweight version of GLM-4.7 with faster response times and high throughput optimized for real-time coding, writing, and translation tasks
•Deploy efficiently with competitive performance at smaller scale while maintaining strong general capabilities across reasoning and content generation
•Access free-tier model designed for high-frequency use cases with best-in-class aesthetic outputs, low latency, and simplified deployment

GLM-4.7-Flash

Released on January 19, 2026

+What's new

3 updates

•Get lightweight version of GLM-4.7 with faster response times and high throughput optimized for real-time coding, writing, and translation tasks
•Deploy efficiently with competitive performance at smaller scale while maintaining strong general capabilities across reasoning and content generation
•Access free-tier model designed for high-frequency use cases with best-in-class aesthetic outputs, low latency, and simplified deployment

GLM-4.7

Released on December 22, 2025

+What's new

3 updates

•Build cleaner modern webpages and professional slides with major improvements in UI aesthetics, visual quality, and accurate layout sizing for frontend development
•Solve multilingual coding tasks faster with 73.8% on SWE-bench and 41% on Terminal Bench 2.0, delivering stronger performance across agent frameworks
•Reason through complex mathematical and logical problems with 42.8% on HLE benchmark while enhancing tool-using and web browsing capabilities

GLM-4.6

Released on September 30, 2025

+What's new

3 updates

•Handle longer conversations and complex multi-file codebases with expanded 200K context window, enabling more sophisticated agentic task execution
•Code more efficiently in Claude Code, Cline, and Roo Code with superior benchmark performance and improved real-world coding accuracy
•Leverage enhanced reasoning capabilities with native tool use support during inference, delivering stronger results in search-based agent workflows

GLM-4.5

Released on July 28, 2025

+What's new

3 updates

•Unify reasoning, coding, and agentic capabilities in a single model delivering balanced performance across complex problem-solving and rapid content generation
•Switch between thinking mode for deep analysis and non-thinking mode for instant responses, adapting intelligence level to task complexity on demand
•Build full-stack web applications with stronger frontend quality, and integrate the model into Claude Code, Roo Code, or custom agent workflows through tool APIs

ChatGLM3-6B

Released on October 27, 2023

+What's new

3 updates

•Execute code directly and invoke external tools with new Code Interpreter and Function Call capabilities, enabling autonomous agent-style task completion
•Process information more accurately with improved training across 42 benchmarks covering semantics, mathematics, reasoning, code, and knowledge understanding
•Deploy locally on consumer hardware with open-source 6B-parameter model supporting both academic research and free commercial use after registration

ChatGLM2-6B

Released on June 25, 2023

+What's new

3 updates

•Handle longer conversations with expanded 32K context window using FlashAttention technology, enabling deeper multi-turn dialogue understanding
•Get responses 42% faster with improved inference speed and INT4 quantization, supporting extended dialogues on consumer GPUs with only 6GB VRAM
•Achieve stronger performance across reasoning and knowledge benchmarks with enhanced training on 1.4T bilingual tokens covering diverse domains

ChatGLM-6B

Released on March 14, 2023

+What's new

3 updates

•Deploy locally on consumer-grade graphics cards with lightweight 6.2B-parameter bilingual model, enabling private ChatGPT-style conversations
•Chat naturally in Chinese and English with open-source conversational AI trained on approximately 1 trillion tokens of diverse text data
•Use freely for commercial purposes after simple registration, with full model weights and training code available for academic research

Top alternatives

ChatGPT

GPT-5.6Verified

Generates human-like text, code, translations, and summaries from natural language inputs across diverse topics.

21 days ago

Free + from $20/mo

Claude

Opus 5Verified

Claude is an AI assistant from Anthropic, designed for work, focused on safety, accuracy, and security. Currently in open beta.

6 days ago

From $5/per use

Gemini

3.6 FlashVerified

Access Google's AI models directly on your phone for assistance in writing, planning, learning, and more.

9 days ago

Free + from $1.50/per use

Grok

Grok 4.5

Answers questions, generates images and videos, and provides trend analysis using a conversational AI with real-time search.

22 days ago

Free + from $30/mo

DeepSeek

DeepSeek-V4 Preview

Delivers open-weight V4 frontier models (Pro 1.6T / Flash 284B) with 1M context, MIT license, and OpenAI/Anthropic API compatibility.

3 months ago

Free + from $0.14/per M input tokens

Qwen

Qwen3.6-Plus

Generates conversational responses, images, and analysis from prompts, documents, videos, and web searches.

4 months ago

Free + from $0/per 1M input tokens

Related categories

AI Search Engine AI Detector

From the blog

View all →

11 Best AI Chatbots – 2026 Comparison & Reviews

Compare 11 leading AI chatbots tested for capabilities, pricing, and use cases. Find the right conversational AI for writing, research, coding, and team collaboration.

Jan 11, 2026

10 Best Intercom Alternatives 2026 — After the Fin Rebrand

10 Intercom alternatives compared after the 2026 Fin rebrand and Salesforce deal — Zendesk, Freshdesk, Help Scout, Crisp, Front, Gorgias, Tidio, Chatwoot and more.

Jul 29, 2026

19 Best AI Recruiting Tools 2026 - Hiring Stack Fit

Choosing AI recruiting tools is not just ATS vs sourcing. Compare 19 platforms by workflow fit, AI depth, pricing transparency, interview support, and hiring team needs.

Jul 19, 2026

20 Best AI Customer Service Tools 2026 - Support Fit

Choosing AI customer service tools is not just chatbot vs helpdesk. Compare 20 options by support workflow, AI autonomy, pricing model, and team fit.

Jul 18, 2026

Track Z.ai in ToolWorthy Weekly

Important tool updates, better alternatives, and selected AI signals in one weekly brief.