Open Detection Format (ATR) Empowers Early Detection of AI Agent Threats Across SaaS Supply Chains

Open Detection Format (ATR) Enables Early Identification of AI Agent Threats like Prompt Injection and Credential Theft

What Happened — A new open‑source detection format, Agent Threat Rules (ATR), was released to surface AI‑agent‑specific attacks such as prompt injection, tool poisoning, and credential theft. The YAML‑based rule set (‑400+ rules) is evaluated by reference engines in TypeScript and Python and is already being used by Microsoft, Cisco, MISP/CIRCL and Gen Digital.

Why It Matters for TPRM —

AI‑driven services are increasingly embedded in third‑party SaaS platforms; undetected agent attacks can lead to data exfiltration or credential compromise.
ATR provides a standardized, vendor‑agnostic way to assess the security posture of AI‑agent integrations across the supply chain.
Early detection reduces the risk of downstream breaches that could affect multiple downstream customers.

Who Is Affected — Technology SaaS providers, cloud‑hosted AI platforms, MSPs offering AI‑assisted tooling, and any organization that integrates LLM‑powered agents (e.g., coding assistants, chatbot frameworks).

Recommended Actions —

Review whether your AI‑agent vendors support or can ingest ATR rule packs.
Validate that your security tooling can parse and enforce ATR YAML rules (or integrate the reference engines).
Pair ATR detection with sandboxed execution, credential‑brokering controls, and manual review for high‑risk agent actions.

Technical Notes —

ATR rules are versioned YAML documents that specify the attack pattern, the inspected input field (LLM prompt, tool‑call arguments, SKILL.md, etc.), and test cases.
Benchmark recall: 98 % on NVIDIA garak jailbreak corpus, 38.5 % on broader garak set, 66 % on hackaprompt; very low recall (0‑5 %) on academic adversarial sets (PromptBench, PromptInject, AdvBench, HarmBench).
Coverage gaps stem from regex‑based matching; semantic re‑phrasings evade detection.
The project is MIT‑licensed; reference engines are available in TypeScript and Python (pyATR).

Source: Help Net Security – Agent Threat Rules: Open detection rule format for AI agent security threats

Open Detection Format (ATR) Empowers Early Detection of AI Agent Threats Across SaaS Supply Chains

Open Detection Format (ATR) Enables Early Identification of AI Agent Threats like Prompt Injection and Credential Theft

Monitor Your Vendor Risk with LiveThreat™