Security

ASSERT Framework Turns AI Specs Into Executable Evals

3 min read

Summary

Microsoft has released ASSERT, an open-source framework that converts natural-language behavior requirements into executable evaluation pipelines for AI models, agents, and applications. The tool helps teams build behavior-specific tests faster, improve regression coverage, and better validate whether AI systems follow product policies and safety expectations.

Need help with Security?Talk to an Expert

Introduction

AI teams often document intended behaviors in policy notes, prompts, or product requirements, but turning those expectations into reliable evaluation suites is slow and difficult. Microsoft’s new open-source ASSERT framework aims to close that gap by converting plain-language specifications into runnable, inspectable evaluations for models, agents, and AI applications.

For security and governance teams, this matters because generic AI metrics like relevance or helpfulness do not always catch application-specific failures such as unsafe tool use, policy violations, or risky decision-making.

What is ASSERT?

ASSERT stands for Adaptive Spec-driven Scoring for Evaluation and Regression Testing. It is designed to make behavior specifications a direct input to AI evaluation, rather than relying on broad benchmarks that may miss real-world product requirements.

How the pipeline works

ASSERT uses four main stages:

  • Systematization: Converts a broad behavior requirement into a structured concept specification.
  • Taxonomization: Builds an editable taxonomy of permissible and impermissible behaviors.
  • Test generation: Creates stratified single-turn or multi-turn test cases across declared conditions such as persona, task type, tool access, or environment.
  • Scoring: Evaluates traces against the taxonomy and returns labels, rationales, policy citations, and failure patterns.

A key capability is instrumentation. ASSERT can capture not only final outputs, but also tool calls, retrieved context, routing decisions, and intermediate actions—important for agentic systems where the final answer alone may not explain risky behavior.

Validation results

Microsoft says internal validation showed stronger behavior-specific coverage than a direct generation baseline from the same written intent. According to the study, ASSERT:

  • Covered about 1.2x more of the intended behavior space
  • Surfaced roughly 1.5x more inspectable cases
  • Produced 4x stronger separation between stronger and weaker systems
  • Had about half as many saturated cases where all models behaved the same
  • Found about 2x more distinct failure patterns

For judge quality, LLM-to-human agreement was typically in the 80–90% range, with human inter-annotator agreement around 90%.

Why this matters for IT and security teams

Organizations deploying copilots, assistants, and agent workflows need repeatable ways to test policy compliance and behavioral boundaries. ASSERT could help teams validate scenarios such as:

  • Unsafe health or financial guidance
  • Tool-use governance violations
  • Task adherence failures
  • Restricted data handling issues
  • Policy drift during model or workflow updates

Next steps

Teams building internal AI apps or security-sensitive agents should review ASSERT as a possible framework for regression testing and policy validation. Because it is open source, organizations can adapt the taxonomy and test generation process to their own governance, compliance, and operational requirements.

Need help with Security?

Our experts can help you implement and optimize your Microsoft solutions.

Talk to an Expert

Stay updated on Microsoft technologies

ASSERTAI evaluationagent testingsecurityregression testing

Related Posts

Security

AI Activity Investigations: New Microsoft Playbook

Microsoft has published a new investigator playbook to help security teams reconstruct AI-related activity across Microsoft 365 Copilot and Azure AI services. The guidance brings together telemetry, KQL queries, schema references, and detection logic across Purview, Defender, and Sentinel so investigators can move from isolated signals to a clear incident timeline.

Security

AI Brand Phishing Campaigns Target Microsoft Users

Microsoft Threat Intelligence reports a rise in phishing, malvertising, and SEO-driven attacks that abuse popular AI brands like ChatGPT, Claude, Copilot, and DeepSeek as social engineering lures. The campaigns use familiar tactics such as urgent payment notices, fake policy violations, and malicious installers to steal credentials, payment data, and deploy malware, making user awareness and layered defenses critical.

Security

AI GitHub Actions Secret Exposure in Claude Code

Microsoft Threat Intelligence found that Anthropic’s Claude Code GitHub Action could expose CI/CD secrets when AI agents process untrusted GitHub content such as issues, pull requests, and comments. Anthropic fixed the issue in Claude Code 2.1.128, but the research highlights broader risks for any AI-enabled workflow with access to secrets, file reads, or outbound communication.

Security

Agentic AI Failure Modes Taxonomy Updated by Microsoft

Microsoft has updated its taxonomy of failure modes in agentic AI systems after a year of red teaming against real-world deployments. The v2.0 framework adds seven new risk categories and expanded mitigations, giving security teams a more practical model for assessing agentic AI threats such as MCP/plugin abuse, goal hijacking, and session context contamination.

Security

Red Hat npm Miasma Attack Hits CI/CD Supply Chains

Microsoft Threat Intelligence uncovered a large-scale npm supply chain attack involving trojanized packages under the @redhat-cloud-services scope. The campaign abused a compromised CI/CD publishing workflow to deliver credential-stealing malware targeting GitHub, npm, AWS, Azure, GCP, Kubernetes, and developer systems, making it especially relevant for security teams and DevOps administrators.

Security

Microsoft Build 2026 Security: Code, Agents, Models

At Microsoft Build 2026, Microsoft announced new security capabilities to protect code, AI agents, and models across the development lifecycle. Highlights include the expanded preview of MDASH for exploitability-focused vulnerability discovery and general availability of Microsoft Defender integration with GitHub Code Security to help teams prioritize and remediate real risks faster.