How do AI agents authenticate without an API key?

Autonomous agents authenticate by getting their own identity in your identity provider, then exchanging it for a short-lived, scoped access token per task instead of carrying a static API key. The dominant pattern is OAuth 2.0 with an on-behalf-of (OBO) flow: a human consents to a specific scope, the agent presents that consent plus its own credential, and the token service issues a token that expires in minutes and only covers the granted scope. Microsoft Entra Agent ID and Federated Identity Credentials (FIC) implement this so the agent never holds a long-lived secret. The agent's identity is the principal in the audit log, not the human's shared key.

Why do static API keys fail for autonomous agents?

Static API keys fail for autonomous agents because they grant the full permission set of whoever issued them, with no per-action scoping, no granular revocation, and no delegation chain you can audit. When an agent runs hundreds of tool calls unattended, a single leaked key compromises everything that key can touch, and rotating it breaks every agent and integration using it at once. You also lose attribution: logs show the key acted, not which agent, on whose behalf, for what task. Across agent systems we've reviewed, key sprawl, the same secret copied into many tools, is the most common identity failure. Scoped, short-lived tokens contain the blast radius.

What is the OAuth on-behalf-of (OBO) flow for agents?

The OAuth on-behalf-of flow lets an agent act with a user's authority without ever holding the user's credentials. The user authenticates and consents to a specific scope (for example, read calendar but not send email). The agent presents that delegated consent along with its own identity credential to the token service, which issues a fresh access token scoped to exactly what the user approved. That token is short-lived, typically minutes, and bound to the agent's identity. Microsoft Entra Agent ID implements OBO for agents directly. The result is a verifiable delegation chain: this user authorized this agent to do this action, recorded in the token's claims and your audit trail.

What is Microsoft Entra Agent ID?

Microsoft Entra Agent ID is Microsoft's identity primitive that gives an AI agent its own first-class identity in Entra, separate from the human who runs it and separate from a service principal. With an Entra Agent ID, an agent becomes a principal you can assign scoped permissions to, issue short-lived tokens for, audit by name, and revoke independently without breaking other agents. It plugs into existing Conditional Access and on-behalf-of flows, so a human can delegate a narrow scope to a named agent rather than sharing an API key. It is the practical implementation of the persistent identity binding layer: the agent exists in your IdP, not just as a credential in a config file.

How do you build an audit trail for autonomous agent actions?

Build an audit trail by making the agent's own identity the principal in every action, then logging the full delegation chain and token claims, not just the API call. Each entry should answer four questions: which agent acted, on whose behalf (the delegating human or upstream agent), under what scope, and for which task or run ID. Short-lived scoped tokens make this natural, the token itself carries the agent identity, the granted scope, and an expiry, so your logs inherit attribution instead of showing a shared key. Tag every tool call with the agent ID and a correlation ID for the run. When something goes wrong, you can trace exactly what one agent did and revoke only that identity.

What is the difference between human-to-agent delegation and agent-to-agent trust?

Human-to-agent delegation is a person authorizing an agent to act with a slice of their permissions, captured through OAuth consent and an on-behalf-of token scoped to what the human approved. Agent-to-agent trust is one agent authorizing or invoking another, which needs the calling agent's own identity, a way to pass (or narrow) the delegated scope downstream, and a record of the chain. The risk in agent-to-agent trust is scope amplification: a downstream agent ending up with more authority than the original human granted. The fix is token exchange that can only narrow scope, never widen it, plus protocol-level identity (A2A) so each hop is authenticated and logged, not implicitly trusted.

How does agent identity relate to AP2 payment mandates?

Agent identity is the foundation that AP2 (Agent Payments Protocol) mandates build on. AP2 mandates are signed, verifiable authorizations that say an agent may spend up to a limit, with a merchant, within a window, the financial-transaction version of scoped delegation. They only mean something if the agent presenting the mandate has a real, verifiable identity and the delegation chain back to the human is auditable. So you implement the identity layer first (the agent's own credential, OAuth OBO, short-lived scoped tokens), and AP2 mandates sit on top as a domain-specific scope for payments. Without agent identity, a payment mandate is just another bearer secret that anyone who copies it can replay.

BLOG/AI SECURITY

Agent Identity: Why API Keys Break for Autonomous Agents

A static API key gives an autonomous agent your full permissions forever. Here's the agent identity stack (OAuth OBO, Entra Agent ID, FIC) that fixes it.

Sebastian MondragonJUNE 01, 2026 · 13 MIN READ

Agent Identity: Why API Keys Break for Autonomous Agents

Hand an autonomous agent a static API key and you have handed it your entire permission set, forever, with no way to see what it did with that authority. This is the core agent identity problem, and it is not a hypothetical. When an agent runs a few hundred unattended tool calls against your CRM, your cloud, and three internal APIs, the key it carries is a single bearer secret that grants everything that key can touch. There is no per-action scoping, no granular revocation, and no delegation chain you can audit after the fact.

We got away with API keys for a decade because the thing holding the key was a deterministic service you wrote and reviewed. An autonomous agent is neither deterministic nor fully reviewed at runtime. It decides which tools to call, in which order, against which resources, often after reading untrusted content that can steer it. Giving that thing a long-lived, broadly scoped credential is the access-control equivalent of giving a contractor the master key to the building and never checking the logs. The industry is now converging on a real answer, and it looks a lot like the OAuth and identity machinery we already use for humans and services, adapted for a principal that acts on someone's behalf.

This is a technical deep-dive on what that answer actually is: the three-layer model (communication protocol, credential standard, persistent identity binding), how OAuth 2.0 on-behalf-of flows and Federated Identity Credentials replace static keys with short-lived scoped tokens, what Microsoft Entra Agent ID gives you in practice, and how human-to-agent delegation differs from agent-to-agent trust. We will also cover the audit trail you need for autonomous actions and where this overlaps with AP2 payment mandates. At Particula Tech we design this layer for production agent systems, and the failure modes below are the ones that bite teams first.

01 · Why Static API Keys Break for Autonomous Agents

A static API key is a long-lived bearer secret. Whoever holds it can do everything the key is authorized to do, until someone notices and rotates it. That model has three structural failures for agents.

No per-action scoping. An API key is usually issued at the granularity of an application or a user, not an action. So the key your agent carries to "check the calendar" is frequently the same key that can send email, delete events, or read every other user's data, depending on how the upstream API scopes things. The agent gets the union of everything the key permits, even when the task needs one read.

No granular revocation. When you suspect one agent is compromised or misbehaving, your only lever is to rotate the shared key. That breaks every agent, integration, and cron job using that key simultaneously. Revocation is all-or-nothing, which means in practice teams hesitate to revoke at all, exactly when speed matters most.

No auditable delegation chain. The log shows the key acted. It does not show which agent acted, on whose behalf, under what authority, for which task. When an agent does something it should not have, you cannot answer "who authorized this" because the key erased the chain the moment it was copied into a config file.

Then there is key sprawl. The same secret gets pasted into the agent runtime, a tool wrapper, an MCP server config, a CI variable, and a teammate's local .env. Across agent systems we've reviewed, this duplication is the single most common identity failure, and it is the one static keys actively encourage because copying a string is the path of least resistance. Every copy is another place the key can leak and another reason nobody wants to rotate it.

The fix is not "a better key." It is to stop treating the credential as the identity. The agent should have its own identity, and the credential it presents at runtime should be a short-lived token minted for a specific task with a specific scope. This is the same shift we made for access control generally, and it maps cleanly onto the patterns in role-based access control for AI applications: the principal is durable, the grant is scoped, and the token is ephemeral.

02 · The Three-Layer Model for Agent Identity

Agent identity is not one spec. It is converging into three distinct layers that solve three different problems, and conflating them is why early implementations feel muddy.

The communication protocol layer is about the wire: when agent A calls agent B, or an agent calls a tool server, the call is authenticated and the identities on each end are known. A2A is the protocol slot here. If you have read our comparison of MCP vs A2A vs AG-UI agent protocols, this is the same A2A: it carries identity between agents rather than assuming an open, trusted mesh.

The credential standard layer is what the agent hands over to prove it may do a thing right now. This is where OAuth 2.0 lives, extended with Federated Identity Credentials (FIC) and token exchange so the agent never holds a long-lived secret. Instead it presents its own identity (or a federated assertion) and gets back a token scoped to exactly the task, expiring in minutes.

The persistent identity binding layer answers "who is this agent" in a way that survives across tasks, tokens, and restarts. This is the agent's account in your identity provider. Microsoft Entra Agent ID is the most concrete implementation today. Without this layer, you have credentials floating free with nothing durable to attach scopes, audit entries, and revocation to.

Get all three and you have something coherent: a named agent (binding) that authenticates each hop (protocol) by presenting short-lived scoped tokens (credential). Skip the binding layer and you are back to keys with extra steps.

Layer	What it solves	Representative standard	What you implement
Communication protocol	How agents and services talk and authenticate each hop	A2A (Agent-to-Agent)	Authenticated message transport between agents
Credential standard	What the agent presents to prove authority for an action	OAuth 2.0 + Federated Identity Credentials, token exchange	OBO flows, short-lived scoped tokens
Persistent identity binding	Who the agent is, durably, in your identity provider	Microsoft Entra Agent ID (and emerging IdP primitives)	A first-class agent principal you can scope, audit, revoke

03 · OAuth 2.0, Federated Identity Credentials, and Short-Lived Tokens

The credential layer is the one with the most mature tooling, because OAuth 2.0 already solved most of it for human and service identities. The adaptation for agents is mostly about getting tokens that are short-lived, narrowly scoped, and bound to an agent identity rather than copied around as a static secret.

A leaked short-lived token is a small, expiring problem. A leaked static key is a standing liability. That difference is the entire reason to do the extra work.

Short-lived scoped tokens vs long-lived keys

The contrast is the whole argument. Hold a long-lived key and you optimize for convenience at the cost of blast radius. Mint a short-lived scoped token per task and you flip the tradeoff.

Federated Identity Credentials and token exchange

Federated Identity Credentials (FIC) let an agent authenticate with an identity provider using a trust relationship instead of a stored secret. The agent presents a federated assertion (an OIDC token from a trusted issuer), and the IdP exchanges it for an access token. No client secret sits in the agent's config. This is the same pattern that killed long-lived cloud credentials in CI: trust the workload's federated identity, mint a token on demand, never store a secret. Token exchange (RFC 8693) is the mechanism underneath OBO and the safe path for agent-to-agent calls. The critical rule: token exchange should only ever narrow scope, never widen it. If a human granted "read calendar," any downstream token the agent requests for a sub-agent must be a subset of that, never a superset. Enforcing scope-narrowing-only at the token service is how you prevent the scope amplification failure we cover below.

Property	Static API key	Short-lived scoped token
Lifetime	Months to never	Minutes
Scope	Whatever the key permits (broad)	Exactly the granted action
Revocation	Rotate, breaks everyone	Expires automatically; revoke the issuing grant
Attribution	"The key acted"	"This agent, on this user's behalf, for this task"
Leak blast radius	Everything, until noticed	One task's scope, until expiry
Key sprawl risk	High (copied everywhere)	Low (minted on demand, not stored)

04 · Microsoft Entra Agent ID and the On-Behalf-Of Flow

Microsoft Entra Agent ID makes the persistent identity binding concrete. It gives an AI agent its own first-class identity in Entra, distinct from the human who runs it and distinct from a generic service principal. Once an agent is a principal, everything else becomes ordinary identity management: you assign it scoped permissions, issue short-lived tokens for it, audit it by name, apply Conditional Access, and revoke it independently without breaking other agents.

The on-behalf-of (OBO) flow is where this earns its keep. OBO lets an agent act with a user's authority without ever touching the user's credentials. The sequence:

The user authenticates and consents to a specific scope. Not "access my account," but "read my calendar" while explicitly not granting "send email."

The agent presents that delegated consent plus its own Entra Agent ID credential to the token service.

The token service issues a fresh access token scoped to exactly what the user approved, bound to the agent's identity, expiring in minutes.

The agent calls the downstream API with that token. The API sees a scoped, short-lived credential, and the token's claims record both the agent and the delegating user.

The payoff is a verifiable delegation chain encoded in the token itself: this user authorized this named agent to perform this scope. That is exactly the attribution a static key destroys. It also slots into a zero-trust posture cleanly, which is the whole premise of Microsoft's ZT4AI zero-trust model for AI agents: never assume the agent is trusted, verify identity and scope on every action, and assume the agent can be compromised by what it reads.

FIDO opened an agentic authentication working group in 2026, which is a strong signal that the credential and binding layers are headed toward cross-vendor standards rather than staying a single-cloud feature. Build to the OAuth and token-exchange primitives now and you are positioned for whatever the standards bodies converge on, because Entra Agent ID is itself implemented on those primitives.

05 · Human-to-Agent Delegation vs Agent-to-Agent Trust

These are two different trust relationships, and the second is where most teams get hurt.

Human-to-agent delegation is a person authorizing an agent to act with a slice of their permissions. OAuth consent plus an OBO token captures it precisely: the human approves a scope, the agent gets a token limited to that scope, and the chain is one hop deep and easy to reason about. This is the well-trodden path.

Agent-to-agent trust is one agent invoking or authorizing another. A coordinator spawns a worker; a planning agent calls a research agent; an MCP-fronted tool is itself an agent. Now you have a chain, and chains amplify risk. The danger is scope amplification: a downstream agent ending up with more authority than the original human ever granted, because each hop "helpfully" requested a broader token to be safe.

Three rules contain it:

Identity at every hop. Each agent presents its own credential. A2A at the protocol layer means every call is authenticated and logged, not implicitly trusted because it came from inside the mesh. Implicit intra-mesh trust is the agent version of a flat network, and it fails the same way.

Scope can only narrow. Token exchange downstream must produce a subset of the upstream scope. The token service enforces this; the agents do not get to ask for more. If the human granted read-only, no descendant in the chain can obtain write.

The chain is recorded. Every delegation hop lands in the audit trail with the delegating principal, the receiving agent, the scope, and the task. When something goes wrong four hops deep, you can reconstruct who authorized what.

A useful mental model: human-to-agent delegation is signing a permission slip; agent-to-agent trust is that slip being photocopied down a chain where each copy must be a redaction, never an addition. The moment a copy can add a permission, the original signature means nothing.

06 · Audit Trails and Killing Key Sprawl

The entire point of giving agents real identities is that you can finally answer "who did this and under what authority" for autonomous actions. That requires deliberate logging, not just turning on identity.

Every audit entry for an agent action should answer four questions:

Which agent acted (the agent identity / Entra Agent ID), not a shared key.

On whose behalf (the delegating human, or the upstream agent in the chain).

Under what scope (the exact grant the token carried).

For which task (a correlation or run ID tying the action to a higher-level objective).

Short-lived scoped tokens make this nearly free, because the token already carries the agent identity, the scope, and an expiry. Your logs inherit attribution instead of you bolting it on. Tag every tool call with the agent ID and a run-level correlation ID, and you can reconstruct a full session: this run, by this agent, on this user's behalf, touched these resources under these scopes. When an agent does something wrong, you revoke that one identity and trace exactly that one chain, no shared-key blast radius.

This is also the structural fix for key sprawl. Tokens are minted on demand for a task and discarded; there is no long-lived secret to copy into five config files. The agent authenticates via its identity (ideally a federated credential, so not even a client secret is stored), and the runtime requests a scoped token when it needs one. The credential surface shrinks from "a static key in every tool" to "an identity in your IdP." This is the same defense-in-depth instinct that drives a production MCP server security hardening checklist, where every tool server gets least-privilege scopes rather than the agent's full credential, and the same principle behind preventing data leakage in AI applications: minimize what any single component can reach. For the strategic framing of why this matters across the whole stack, our AI security pillar ties identity into the broader threat model.

07 · Where Agent Identity Overlaps AP2 Mandates

Payments are where scoped delegation gets a name and a signature. The Agent Payments Protocol (AP2) introduces mandates: signed, verifiable authorizations that say an agent may spend up to a limit, with a particular merchant, within a window. A mandate is, structurally, a domain-specific scope for the action "spend money."

The dependency runs one direction. AP2 mandates only mean anything if the agent presenting them has a real, verifiable identity and the delegation chain back to the human is auditable. A mandate is a scoped grant; a scoped grant requires a principal to grant it to. So you implement the identity layer first, the agent's own credential, OAuth OBO, short-lived scoped tokens, scope-narrowing token exchange, and AP2 mandates sit on top as the payment-flavored scope.

Get the order wrong and a payment mandate becomes just another bearer secret: copy it, replay it, spend with it. Get it right and a mandate is a signed, identity-bound, expiring authorization with a delegation chain you can audit, which is exactly what moving real money demands. The same three-layer model carries through: A2A so the payment request is authenticated hop to hop, OAuth and token exchange so the mandate is a narrowed scope and not a master key, and a persistent agent identity so the mandate has a principal it provably belongs to.

08 · What a Developer Actually Implements

Strip away the standards-body framing and here is the concrete build order for production:

Give every agent its own identity. Not a shared service account, not the human's key. An Entra Agent ID, or the equivalent principal in your IdP. This is the binding layer and it comes first.

Replace stored keys with federated credentials. Use FIC so the agent authenticates via a trust relationship and never stores a client secret. Kill the static key at the source.

Mint short-lived scoped tokens per task via OBO. A human consents to a narrow scope; the agent gets a minutes-long token bound to its identity for exactly that scope. Never request more than the task needs.

Enforce scope-narrowing-only token exchange for agent-to-agent calls. Downstream tokens are subsets, never supersets. Enforce it at the token service, not in agent code.

Authenticate every hop with A2A and log the full chain. Which agent, on whose behalf, under what scope, for which task. Tag tool calls with agent ID and a run correlation ID.

Revoke by identity, not by rotating a shared secret. When one agent misbehaves, disable that one principal and trace that one chain.

None of this is exotic. It is the identity discipline we already apply to humans and workloads, pointed at a new principal that makes hundreds of decisions per minute. Teams that treat an autonomous agent as a first-class identity, with its own credential, scoped delegation, and an audit trail, contain incidents to a single task and a single principal. Teams still pasting one API key into every tool will learn, the hard way, that they gave a non-deterministic system the master key and threw away the logs.

API keys did not break because they were badly designed. They broke because the thing holding them stopped being a program you wrote and became an agent that decides. Identity is how you give that decision-maker exactly the authority it needs, for exactly as long as it needs it, and not one scope more.

09 · FAQ

Quick answers to the questions this post tends to raise.

BLOG/AI SECURITY

Agent Identity: Why API Keys Break for Autonomous Agents

A static API key gives an autonomous agent your full permissions forever. Here's the agent identity stack (OAuth OBO, Entra Agent ID, FIC) that fixes it.

Sebastian MondragonJUNE 01, 2026 · 13 MIN READ

01 · Why Static API Keys Break for Autonomous Agents

02 · The Three-Layer Model for Agent Identity

Agent identity is not one spec. It is converging into three distinct layers that solve three different problems, and conflating them is why early implementations feel muddy.

Layer	What it solves	Representative standard	What you implement
Communication protocol	How agents and services talk and authenticate each hop	A2A (Agent-to-Agent)	Authenticated message transport between agents
Credential standard	What the agent presents to prove authority for an action	OAuth 2.0 + Federated Identity Credentials, token exchange	OBO flows, short-lived scoped tokens
Persistent identity binding	Who the agent is, durably, in your identity provider	Microsoft Entra Agent ID (and emerging IdP primitives)	A first-class agent principal you can scope, audit, revoke

03 · OAuth 2.0, Federated Identity Credentials, and Short-Lived Tokens

A leaked short-lived token is a small, expiring problem. A leaked static key is a standing liability. That difference is the entire reason to do the extra work.

Short-lived scoped tokens vs long-lived keys

The contrast is the whole argument. Hold a long-lived key and you optimize for convenience at the cost of blast radius. Mint a short-lived scoped token per task and you flip the tradeoff.

Federated Identity Credentials and token exchange

Property	Static API key	Short-lived scoped token
Lifetime	Months to never	Minutes
Scope	Whatever the key permits (broad)	Exactly the granted action
Revocation	Rotate, breaks everyone	Expires automatically; revoke the issuing grant
Attribution	"The key acted"	"This agent, on this user's behalf, for this task"
Leak blast radius	Everything, until noticed	One task's scope, until expiry
Key sprawl risk	High (copied everywhere)	Low (minted on demand, not stored)

04 · Microsoft Entra Agent ID and the On-Behalf-Of Flow

The on-behalf-of (OBO) flow is where this earns its keep. OBO lets an agent act with a user's authority without ever touching the user's credentials. The sequence:

The user authenticates and consents to a specific scope. Not "access my account," but "read my calendar" while explicitly not granting "send email."

The agent presents that delegated consent plus its own Entra Agent ID credential to the token service.

The token service issues a fresh access token scoped to exactly what the user approved, bound to the agent's identity, expiring in minutes.

The agent calls the downstream API with that token. The API sees a scoped, short-lived credential, and the token's claims record both the agent and the delegating user.

05 · Human-to-Agent Delegation vs Agent-to-Agent Trust

These are two different trust relationships, and the second is where most teams get hurt.

Three rules contain it:

06 · Audit Trails and Killing Key Sprawl

Every audit entry for an agent action should answer four questions:

Which agent acted (the agent identity / Entra Agent ID), not a shared key.

On whose behalf (the delegating human, or the upstream agent in the chain).

Under what scope (the exact grant the token carried).

For which task (a correlation or run ID tying the action to a higher-level objective).

07 · Where Agent Identity Overlaps AP2 Mandates

08 · What a Developer Actually Implements

Strip away the standards-body framing and here is the concrete build order for production:

Give every agent its own identity. Not a shared service account, not the human's key. An Entra Agent ID, or the equivalent principal in your IdP. This is the binding layer and it comes first.

Replace stored keys with federated credentials. Use FIC so the agent authenticates via a trust relationship and never stores a client secret. Kill the static key at the source.

Enforce scope-narrowing-only token exchange for agent-to-agent calls. Downstream tokens are subsets, never supersets. Enforce it at the token service, not in agent code.

Authenticate every hop with A2A and log the full chain. Which agent, on whose behalf, under what scope, for which task. Tag tool calls with agent ID and a run correlation ID.

Revoke by identity, not by rotating a shared secret. When one agent misbehaves, disable that one principal and trace that one chain.

09 · FAQ

Quick answers to the questions this post tends to raise.