THE CIRCUITRYYour one-stop source for all tech news

Home/Tech/Z.ai Launches GLM-5.2, a Huawei-Trained Rival to Top AI Models

VERIFIEDBy Xavier Rivera· ·2 min read

Z.ai Launches GLM-5.2, a Huawei-Trained Rival to Top AI Models

Z.ai released GLM-5.2 on June 16, which scores within 1 percent of Claude Opus 4.8 on FrontierSWE while beating GPT-5.5. The MIT-licensed model trained solely on Huawei Ascend chips without NVIDIA hardware and undercuts Western API pricing.

Source:Decrypt

Post

Z.ai Launches GLM-5.2, a Huawei-Trained Rival to Top AI Models

TL;DRAI · 60 sec read

Z.ai has introduced GLM-5.2, which reportedly delivers performance within 1 percent of Claude Opus 4.8 on the FrontierSWE benchmark while surpassing GPT-5.5.

GLM-5.2 posts strong benchmark scores. The model achieved 74.4 on FrontierSWE against Claude Opus 4.8's 75.1 and GPT-5.5's 72.6. It reached 62.1 on SWE-bench Pro, exceeding both GPT-5.5 at 58.6 and its predecessor GLM-5.1 at 58.4.

Development occurred solely on Huawei Ascend chips without any NVIDIA components.

These results position it as the leading open-source entry on the Artificial Analysis Intelligence Index that combines nine separate evaluations. Benchmarks from OpenRouter align it with the now-banned Claude Fable 5.

The model was trained exclusively on Huawei silicon. Development occurred solely on Huawei Ascend chips without any NVIDIA components. Stability AI founder Emad Mostaque placed overall training expenses near $25 million, approximately 80 percent of which went toward post-training.

From The CircuitryThe Feed — live briefs across tech, all day.See what’s happening →

The Beijing laboratory, listed on the U.S. Entity List since January 2025, had already used Huawei Ascend Atlas servers for image models without American hardware. GLM-5.2 expands that foundation as a 744-billion-parameter mixture-of-experts system featuring a genuine 1-million-token context window, five times the 200K capacity of GLM-5.1.

GLM-5.2 expands that foundation as a 744-billion-parameter mixture-of-experts system featuring a genuine 1-million-token context window, five times the 200K capacity of GLM-5.1.

Pricing undercuts Western frontier models. Access through the API runs $1.40 per million input tokens and $4.40 per million output tokens. Those figures stand well below Claude Opus 4.8 rates of $5 input and $25 output. A monthly Coding Plan begins near $18 and works inside Claude Code, Cline, Kilo Code plus leading agentic platforms. The release carries an MIT license and carries no regional limits.

Local deployment requires substantial hardware. Unsloth AI supplied 2-bit GGUF quantizations that compress the original 1.51TB file size down to 238GB while preserving roughly 82 percent accuracy. Execution still calls for 256GB of unified memory or an equivalent RAM/VRAM setup, whether a fully equipped M4 Ultra Mac Studio or a mid-range GPU workstation paired with 256GB system RAM and mixture-of-experts offloading.

Z.ai released GLM-5.2 on June 16. Combined with the recent ban on Anthropic Fable, the debut has reportedly lifted the company's shares 90 percent to a record high.

Why this mattersAI · ~100 words

Tap a lens to see what this story means for you.

Reader-supported

DonateBuy me a coffee →Follow@thecircuitry_ →Follow@thecircuitry.to →

Reader-supported · Daily Brief

Daily brief at 7 AM ET. Top tech stories, every morning. Sourced and fact-checked.

HELP US IMPROVE

Reader-supported

The Circuitry is a passion project I've always wanted to build, and I love the work behind it.

Running it costs real money. APIs, hosting, time. To keep improving the site and growing this into something useful for everyone, those costs have to be covered.

Any contribution is appreciated. If not, no pressure. Thanks for reading.

Buy me a coffee

AI Open Source China

MORE IN TECH

Tesla files trademark for MEGAPOD AI data center hardware

Tesla filed a USPTO trademark for MEGAPOD on June 18, 2026, covering modular data center hardware for AI computing. The live pending application signals the company's continued focus on AI infrastructure products.

Hackers exploit info disclosure bug in Gravity SMTP plugin

Threat actors are exploiting CVE-2026-4020 in the Gravity SMTP WordPress plugin active on over 100,000 sites, with Wordfence blocking more than 17 million attempts since a June 7 spike. The unauthenticated endpoint leaks API keys, email credentials, and detailed system information that can enable impersonation and targeted follow-on attacks.

CISA Directs Federal Agencies to Secure Splunk Enterprise Systems by Sunday

CISA placed CVE-2026-20253 affecting Splunk Enterprise on its KEV catalog after confirmed active exploitation and required federal agencies to install patches by June 21. The unauthenticated flaw permits remote file creation or truncation and potential RCE, while Shadowserver monitors over 1,400 publicly reachable instances.