News Daily


Men's Weekly

News from Asia

  • Written by Media Outreach
LLM Performance Evaluation: Agentic, Reasoning and Coding
SINGAPORE - Media OutReach Newswire - 19 February 2026 - GLM-5, newly released as open source, signals a broader shift in artificial intelligence. Large language models are moving beyond generating code snippets or interface prototypes toward building complete systems and carrying out complex, end-to-end tasks. The change marks a transition from so-called "vibe coding" to what researchers increasingly describe as agentic engineering.

LLM Performance Evaluation: Agentic, Reasoning and Coding

Built for this new phase, GLM-5 ranks among the strongest open-source models for coding and autonomous task execution. In practical programming settings, its performance approaches that of Claude Opus 4.5, particularly in complex system design and long-horizon tasks requiring sustained planning and execution.

The model rests on a new architecture aimed at scaling both capability and efficiency. Its parameter count has expanded from 355bn to 744bn, with active parameters rising from 32bn to 40bn, while pre-training data has grown to 28.5trn tokens. These increases are paired with advances in training methods. A framework called Slime enables asynchronous reinforcement learning at a larger scale, allowing the model to learn continuously from extended interactions and improve post-training efficiency. GLM-5 also introduces DeepSeek Sparse Attention, which maintains long-context performance while cutting deployment costs and improving token efficiency.

Benchmarks suggest strong gains. On SWE-bench-Verified and Terminal Bench 2.0, GLM-5 scores 77.8 and 56.2, respectively, the highest reported results for open-source models, surpassing Gemini 3 Pro in several software-engineering tasks. On Vending Bench 2, which simulates running a vending-machine business over a year, it finishes with a balance of $4,432, leading other open-source models in operational and economic management.

These results highlight the qualities required for agentic engineering: maintaining goals across long horizons, managing resources, and coordinating multi-step processes. As models increasingly assume these capabilities, the frontier of AI appears to be shifting from writing code to delivering functioning systems.

Chat & Official API Access

Z.ai Chat: https://chat.z.ai GLM Coding Plan: https://z.ai/subscribe?utm_source=pr&utm_medium=press&utm_campaign=launch

Open-Source Repositories

GitHub: https://github.com/zai-org/GLM-5 Hugging Face: https://huggingface.co/zai-org/GLM-5

Blog GLM-5 Technical Blog: https://z.ai/blog/glm-5

Hashtag: #ZAI

The issuer is solely responsible for the content of this announcement.

Read more

Top Electrical Safety Tips from Inner West Sydney Electricians

While it may not be the most exciting subject to discuss, having an electrically safe home is definitely one of the most critical. Knowing the basics could help you avoid accidents and ensure your home remains in good condition, whether... Read more

When to Escalate a Debt Recovery Matter to Legal Action

Knowing when to transition from informal debt collection efforts to formal legal proceedings is a decision that many creditors find difficult to navigate. Acting too early can damage commercial relationships, while waiting too long can reduce the likelihood of recovery... Read more

Why Slurry Hose Systems Are Essential for Handling Abrasive Industrial Materials

Transporting abrasive mixtures is a common challenge in industries such as mining, dredging, and construction. These mixtures, known as slurry, consist of solid particles suspended in water or other liquids. Moving slurry through pipelines requires specialised equipment that can withstand... Read more

Why Choosing the Right Dental Clinic Matters for Long Term Oral Health

Maintaining good oral health requires regular checkups, preventive care, and professional treatment when needed. Visiting a trusted Dental Clinic plays a vital role in keeping teeth and gums healthy while preventing more serious dental problems in the future. Many people only... Read more

Is Deep Plane Facelift Safe in Thailand?

When you ask whether a deep plane facelift is safe in Thailand, you’re really asking: “Can I get high-quality surgical care with strong safety standards and reliable follow-up while I’m traveling?” That’s a smart question. But the country name alone... Read more

Why Cloud Services Are Now Essential for Business Growth and Security

In today’s fast-moving digital environment, understanding how cloud services support long-term stability has become a priority for businesses across Australia. As expectations shift and workplaces adopt more flexible models, organisations are turning to cloud services to keep systems running smoothly... Read more