Beijing, China – April 15, 2025 – In a strategic move that underscores its technological prowess and global ambitions, potentially paving the way for a future IPO, Chinese AI company Zhipu.AI has announced the comprehensive open-sourcing of its next-generation General Language Models (GLM). This release includes the advanced GLM-4 series and the groundbreaking GLM-Z1 inference models, boasting unprecedented inference speeds and the launch of a dedicated international domain, Z.ai.
The spotlight shines on the GLM-Z1 inference model, which Zhipu claims achieves inference speeds up to eight times faster than DeepSeek-R1. By optimizing GQA parameters, employing quantization, and implementing speculative sampling, the GLM-Z1-32B-0414 delivers a remarkable 200 tokens per second on consumer-grade GPUs – a staggering 50 times faster than human reading speed. This exceptional responsiveness positions it as a frontrunner in efficient AI inference, a key advantage as Zhipu eyes further market expansion.
Further demonstrating its innovation, Zhipu has unveiled the “Rumination” model, GLM-Z1-Rumination-32B-0414. This model signifies a leap towards more autonomous AI agents, capable of actively searching the internet, utilizing tools, conducting in-depth analysis, and self-verifying information to tackle complex, open-ended queries – a significant step beyond purely reactive AI and a testament to Zhipu’s cutting-edge research.
The open-sourced portfolio also features the foundational GLM-4-32B-0414, specifically enhanced for agent capabilities with superior performance in tool usage, web search, and code generation. Its real-time code generation capability for languages like HTML, CSS, JS, and SVG directly within conversations offers a significant boost to developer productivity.
Recognizing the diverse needs of the AI community, Zhipu has also open-sourced smaller 9B parameter versions of both GLM-4 and GLM-Z1 models. These compact yet powerful models demonstrate impressive performance in mathematical reasoning and general tasks, providing an efficient solution for resource-constrained environments and further broadening Zhipu’s appeal. All models are released under the permissive MIT license.
This strategic open-sourcing initiative, coupled with the launch of the international-facing Z.ai platform, strongly signals Zhipu.AI’s commitment to global accessibility and fostering a vibrant open-source AI ecosystem. The new domain, Z.ai, serves as a central hub for users worldwide to freely experience these advanced models through a web interface and dedicated app.
For enterprise clients, Zhipu continues to offer its robust Model-as-a-Service (MaaS) platform, now integrating the newly open-sourced base and inference models. This platform provides API access with tiered pricing, including the ultra-fast GLM-Z1-AirX, the highly cost-effective GLM-Z1-Air, and the free GLM-Z1-Flash, catering to a wide range of commercial applications. The foundational models GLM-4-Air-250414 and the free GLM-4-Flash-250414 are also available on the MaaS platform.
As Zhipu.AI strategically expands its global footprint and showcases its technological leadership through this significant open-source release and the launch of Z.ai, the move could be interpreted as a strong indicator of the company’s readiness and ambition for a potential IPO in the near future. By democratizing access to its cutting-edge AI technology, Zhipu is not only fostering innovation but also building a strong global presence and user base.
Explore the new GLM models for free at: https://chat.z.ai/ Open Source Download: https://huggingface.co/collections/THUDM/glm-4-0414-67f3cbcb34dd9d252707cb2e
The post Zhipu.AI’s Open-Source Power Play: Blazing-Fast GLM Models & Global Expansion Ahead of Potential IPO first appeared on Synced.
Leave a Reply