Unauthenticated endpoints
Production routes were reachable without the identity checks leaders assume exist.
An AI agent got attacker-level access to McKinsey's Lilli. This is not a SQL-injection story. It is an identity, permissions, and production-readiness story.
The exposed system included confidential files, user accounts, authentication tokens, and writable prompts. The dangerous part was not only data access. It was authority over the rules the agent followed.
CodeWall stopped at disclosure. A motivated adversary would not need to steal data only. They could reshape what the AI system recommends.
Plaintext client conversations across engagements.
System prompts that controlled agent behavior.
Consultants could receive altered guidance.
The breach moved from confidentiality to decision integrity.
The report describes 200+ documented API endpoints, including 22 that required no authentication. Some write paths were exposed too.
Production routes were reachable without the identity checks leaders assume exist.
Queries were built from request values, turning weak access control into data reach.
Ask what is enforced in production, not only what the platform supports in theory.
The old browser screen was a practical permissions layer. Agentic systems call APIs directly, which moves trust into code, tokens, scopes, and runtime policy.
| Decision area | SaaS era | Agent era | Board question |
|---|---|---|---|
| Primary actor | Humans click screens | Agents call APIs and tools directly | Can the system identify agent actors? |
| Permission boundary | Screen, role, workflow | Code, scopes, tokens, policies | Can defaults bypass review? |
| Operational proof | Vendor claim plus demo | Trace, audit, gates, revocation | Can a reviewer replay what happened? |
| Review timing | Architecture after purchase | Architecture before commitment | Who validates viability before signing? |
The market is moving toward implementation support, persistent context, governed tool catalogs, and API-native business systems.
| Signal | Move | Why it matters | Report read |
|---|---|---|---|
| Anthropic | Enterprise AI JV | Applied AI engineers embedded with customers | Deployment depth |
| OpenAI | The Development Company | Closer to enterprise deployment reality | Implementation ownership |
| SAP + WalkMe | Persistent enterprise AI | Real-time AI layer over business data | Runtime context |
| Pinecone Nexus | Compiled knowledge | Persistent context across agent sessions | Memory governance |
| Salesforce | Headless 360 | Full CRM through API, not browser UI | Agent permissions |
| ServiceNow | MCP registry | Governed and auditable agent tool catalog | Tool control plane |
Agent identity, permission boundaries, and auditability are not technical cleanup. They are business conditions for using AI safely.
Identity and audit belong on the strategy table, not the IT backlog.
Implementation viability must be proven before purchase, not after.
Technical reviewers belong in procurement day one.
These questions expose the gap between vendor capability and what actually happens when teams are rushed, defaults remain unchanged, and agents gain tool access.
Does your platform know the difference between a human and an agent?
Agents need narrower, task-scoped access than humans.
What happens when the team is under delivery pressure?
The gap appears when configuration is never revisited.
If the answers are vague, split across owners, or dependent on human memory, the deployment needs more control before it touches sensitive workflows.
Use narrower access, task-scoped permissions, and real-time revocation.
Capture traces, tool calls, policy checks, and environment context.
Require explicit evidence before production rollout and vendor commitment.
The organizations that avoid the next Lilli ask the identity, permission, and pressure-test questions before they deploy.
© 2026 Chander Dhall Methodworks, LLC. All rights reserved.