News

Follow Pragmatic Interventions to Keep Agentic AI in Check

Steve Durbin

Published 10 - November - 2025

security week

Agentic AI shifts the conversation about AI from passive tools to autonomous systems that can plan, act and coordinate across services. Where large language models (LLMs) have excelled at generating text or answering prompts, agentic systems combine multiple specialized agents into workflows that take initiative, pursue goals and interact with APIs, data stores and human teams. This shift promises operational scale and speed, but it also widens the attack surface and complicates governance in ways business leaders can no longer ignore.

First-use Cases

Making the strategic case for agentic autonomy is straightforward. When designed responsibly, these systems can automate complex, multi-step processes that today require orchestration across people and tools. Incident response, threat hunting and routine decision-making are natural first-use cases. An agentic system can run reconnaissance, synthesize findings, execute containment, and present a human-readable summary, shortening decision loops and freeing experts for higher-order tasks. The productivity potential is significant, particularly for organizations that already operate in software-driven, API-first environments.

Pitfalls of Autonomy

Yet autonomy is a double-edged sword. The very capability that lets an agentic system act without human direction also increases the risk of unintended actions. Small specification errors or poorly calibrated reward functions can lead agents to impulsively pursue shortcuts that may ignore safeguards. Multiple interacting agents raise the bar for traceability and accountability; when a decision emerges from a chain of agent interactions rather than a single model, understanding who or what is responsible becomes harder. Those dynamics create new threat vectors. For instance, adversaries can repurpose agentic architectures for sustained, automated reconnaissance, tailor them for social engineering attacks, or run adaptive multi-step attacks that escalate privileges and propagate rapidly across interconnected systems.

Top Three Red Flags

Three technical concerns deserve particular attention. First, decision opacity. Agentic configurations often involve emergent behaviors that are not easily explained by inspecting a single component. That “black box” opacity can undermine internal audits, regulatory compliance and stakeholder trust.

Second, goal misalignment. Even minor misinterpretations of objectives or reward signals can produce unwanted outcomes, such as agents optimizing minor or narrow metrics at the expense of safety or compliance.

Third, adversarial misuse. A capability that finds and exploits process gaps autonomously is valuable to defenders and attackers alike. The same orchestration that accelerates incident response can also be inverted into a persistent, automated attack engine.

Five Pragmatic Interventions

Addressing these risks requires a pragmatic combination of engineering controls, governance and continuous validation. First start by mapping where agentic autonomy touch critical systems, data stores and third-party services.

Treat agents as first-class service identities. Enforce least privilege, require short-lived credentials, and lock down the pathways through which they can act. Precise, testable task specifications are essential. Ambiguity in objectives invites improvisation. Embed hard constraints and explicit safety checks to help limit unintended behavior.

Auditability must be non-negotiable. Immutable logs that capture agent inputs, actions and decision paths enable replay, post-incident forensics and regulatory scrutiny. Favor systems that produce human-readable rationales and make it possible to trace an outcome back through the sequence of agent interactions that produced it. Those artifacts are the difference between an investigable incident and an opaque cascade of failures.

Demand human oversight. For high risk or policy-critical decisions, maintain human-in-the-loop governance, where operators can intervene in real time. For safety-sensitive outcomes, require human-in-the-loop approval before irreversible actions. The notion of “automation everywhere” is seductive, but the appropriate balance between autonomy and human control depends on context, impact and tolerance for error.

Red-team agentic behaviors. Traditional security testing focuses on static software or single-model inputs. Agentic systems demand adversarial simulations that explore multi-step exploit paths and ways an attacker could chain actions to achieve illegitimate goals.

Expand governance frameworks to cover agent lifecycle. Policies that worked for LLMs or human-in-the-loop workflows often do not scale fully to agentic autonomy. Boards and risk managers should view agentic deployments as higher-risk system integrations, requiring cross-functional sign-off from security, legal, compliance and business owners.

The opportunity before organizations is real: agentic AI can unlock new levels of automation, speed and insight. The distinguishing factor will be governance. Those who pair clear objectives with rigorous controls, traceability and adversarial validation will gain strategic advantage. Those who treat agentic systems like an incremental upgrade to existing models may risk encountering novel failure modes that erode trust, expose data, and create operational hazards.

The challenge is not to stop innovating but to design for it. Autonomy magnifies both capability and risk. Boards and technology leaders must therefore demand disciplined objectives, auditable decision paths and human oversight where it matters. With those guardrails in place, agentic AI can be a powerful accelerator of value rather than an unmanaged hazard.

Follow Pragmatic Interventions to Keep Agentic AI in Check

Read the full article