The Circuitry
THE CIRCUITRYYour one-stop source for all tech news
HOMETODAYNEWSFEEDEVENTS
BOOKMARKS
RSS
© 2026 The Circuitry
About UsSourcesContactCorrectionsPrivacy
  • Home
  • Feed
  • Today
  • Saved
Scroll for more
Verification
VERIFIEDConfidence: HIGH
Source identified
Claims cross-referenced
No discrepancies found
Fact-check summary

Z.ai's GLM-5.2 release, benchmarks, Huawei training, pricing, and 1M context are corroborated by VentureBeat, the company's official blog, Hugging Face, docs.z.ai, Wikipedia, and multiple tech outlets as of June 16-18 2026.

Sourcing
1source

via Decrypt

Decrypt · track record
22Stories
100%Verified
930d
All sources →
Home/Tech/Z.ai Launches GLM-5.2, a Huawei-Trained Rival to Top AI Models
VERIFIEDBy Xavier Rivera· ·2 min read

Z.ai Launches GLM-5.2, a Huawei-Trained Rival to Top AI Models

Z.ai released GLM-5.2 on June 16, which scores within 1 percent of Claude Opus 4.8 on FrontierSWE while beating GPT-5.5. The MIT-licensed model trained solely on Huawei Ascend chips without NVIDIA hardware and undercuts Western API pricing.

Source:Decrypt
Post
Z.ai Launches GLM-5.2, a Huawei-Trained Rival to Top AI Models
TL;DRAI · 60 sec read

Z.ai releases GLM-5.2, a 744B MoE model trained solely on Huawei Ascend chips. It scores 74.4 on FrontierSWE, within 1 percent of Claude Opus 4.8 and above GPT-5.5, while offering a 1M-token context. The open-source model undercuts Western API prices and faces no regional limits, though local runs need 256GB memory.

Z.ai has introduced GLM-5.2, which reportedly delivers performance within 1 percent of Claude Opus 4.8 on the FrontierSWE benchmark while surpassing GPT-5.5.

GLM-5.2 posts strong benchmark scores. The model achieved 74.4 on FrontierSWE against Claude Opus 4.8's 75.1 and GPT-5.5's 72.6. It reached 62.1 on SWE-bench Pro, exceeding both GPT-5.5 at 58.6 and its predecessor GLM-5.1 at 58.4.
Development occurred solely on Huawei Ascend chips without any NVIDIA components.
These results position it as the leading open-source entry on the Artificial Analysis Intelligence Index that combines nine separate evaluations. Benchmarks from OpenRouter align it with the now-banned Claude Fable 5.

The model was trained exclusively on Huawei silicon. Development occurred solely on Huawei Ascend chips without any NVIDIA components. Stability AI founder Emad Mostaque placed overall training expenses near $25 million, approximately 80 percent of which went toward post-training.
From The CircuitryThe Feed — live briefs across tech, all day.See what’s happening →
The Beijing laboratory, listed on the U.S. Entity List since January 2025, had already used Huawei Ascend Atlas servers for image models without American hardware. GLM-5.2 expands that foundation as a 744-billion-parameter mixture-of-experts system featuring a genuine 1-million-token context window, five times the 200K capacity of GLM-5.1.
GLM-5.2 expands that foundation as a 744-billion-parameter mixture-of-experts system featuring a genuine 1-million-token context window, five times the 200K capacity of GLM-5.1.
Pricing undercuts Western frontier models. Access through the API runs $1.40 per million input tokens and $4.40 per million output tokens. Those figures stand well below Claude Opus 4.8 rates of $5 input and $25 output. A monthly Coding Plan begins near $18 and works inside Claude Code, Cline, Kilo Code plus leading agentic platforms. The release carries an MIT license and carries no regional limits.

Local deployment requires substantial hardware. Unsloth AI supplied 2-bit GGUF quantizations that compress the original 1.51TB file size down to 238GB while preserving roughly 82 percent accuracy. Execution still calls for 256GB of unified memory or an equivalent RAM/VRAM setup, whether a fully equipped M4 Ultra Mac Studio or a mid-range GPU workstation paired with 256GB system RAM and mixture-of-experts offloading.

Z.ai released GLM-5.2 on June 16. Combined with the recent ban on Anthropic Fable, the debut has reportedly lifted the company's shares 90 percent to a record high.
Why this mattersAI · ~100 words

Tap a lens to see what this story means for you.

Reader-supported
DonateBuy me a coffee →Follow@thecircuitry_ →Follow@thecircuitry.to →

Reader-supported · Daily Brief

Daily brief at 7 AM ET. Top tech stories, every morning. Sourced and fact-checked.

HELP US IMPROVE
From The Circuitry

See what’s happening right now

The Feed runs all day — short, verified briefs the moment they break.

Open the Feed →
From The Circuitry

Follow @thecircuitry_

Every story we publish, as it happens. No noise between.

Follow on X ↗On Bluesky ↗

Reader-supported

The Circuitry is a passion project I've always wanted to build, and I love the work behind it.

Running it costs real money. APIs, hosting, time. To keep improving the site and growing this into something useful for everyone, those costs have to be covered.

Any contribution is appreciated. If not, no pressure. Thanks for reading.

Buy me a coffee
AIOpen SourceChina
More fromDecrypt
  • Greek Regulator Expected to Reject Binance MiCA License Application

    Markets · 2d
  • US Directs Anthropic to Block Access to Latest Frontier AI Models

    Tech · 6d
  • Iran-Linked Hackers Claim FBI Drone Breach, Threaten World Cup

    Tech · 7d
More inTech
  • Tesla files trademark for MEGAPOD AI data center hardware

    Tech · 1h
  • Hackers exploit info disclosure bug in Gravity SMTP plugin

    Tech · 1h
  • CISA Directs Federal Agencies to Secure Splunk Enterprise Systems by Sunday

    Tech · 7h
SupportThe Work

The Circuitry is reader-supported. If you find the daily brief useful, you can buy me a coffee to keep it going.

Buy a coffee →
SubscribeCircuitry Brief

Daily brief at 7 AM ET. Top tech stories, every morning.

MORE IN TECH

Tesla files trademark for MEGAPOD AI data center hardware

Tesla filed a USPTO trademark for MEGAPOD on June 18, 2026, covering modular data center hardware for AI computing. The live pending application signals the company's continued focus on AI infrastructure products.

Hackers exploit info disclosure bug in Gravity SMTP plugin

Threat actors are exploiting CVE-2026-4020 in the Gravity SMTP WordPress plugin active on over 100,000 sites, with Wordfence blocking more than 17 million attempts since a June 7 spike. The unauthenticated endpoint leaks API keys, email credentials, and detailed system information that can enable impersonation and targeted follow-on attacks.

CISA Directs Federal Agencies to Secure Splunk Enterprise Systems by Sunday

CISA placed CVE-2026-20253 affecting Splunk Enterprise on its KEV catalog after confirmed active exploitation and required federal agencies to install patches by June 21. The unauthenticated flaw permits remote file creation or truncation and potential RCE, while Shadowserver monitors over 1,400 publicly reachable instances.