Skip to main content
Back to reports Slide brief
Forward Deployed Engineering

Forward Deployed Engineers: The Human Edge in the Age of AI

Forward Deployed Engineering . Enterprise AI . May 2026

Forward Deployed Engineers: The Human Edge in the Age of AI

The entire valley is talking about Forward Deployed Engineers. Not because AI does not work, but because AI alone does not work at enterprise scale. Ninety-five percent of enterprise GenAI projects produce zero measurable ROI. The fix is not a better model. It is a better human, embedded in your stack.

CD
Chander DhallBuilder . Leader . Speaker
Published May 2026 Reading time . 20 min Executive Report
95%
Enterprise GenAI pilots producing zero ROI (MIT Project NANDA, 2025)
18 min
Time from first seeing the relevant code to deployed fix inside a 30-minute meeting
97%
Cosmos DB RU reduction: 9,000 RUs to 300 RUs in a single session
$3.4B
Uber's AI budget burned in 4 months. COO says the ROI link is "very hard to draw" (May 2026)

Executive Summary . What This Report Covers

  • AI productivity gains are real, but 95% of enterprise GenAI pilots produce zero measurable ROI because the gap between demo and production is not a model problem. It is a human and architecture problem.
  • The probabilistic nature of LLMs collides directly with the deterministic requirements of enterprise systems: billing, compliance, audit trails, financial reconciliation, and regulated workflows cannot tolerate non-deterministic outputs.
  • Forward Deployed Engineers are the market's answer: senior practitioners embedded in your team who own outcomes, not tickets, and who bridge AI fluency with deep enterprise systems expertise.
  • In a live production crisis, a team using AI tools across their entire stack still had not surfaced the relevant code by minute 12 of a 30-minute call. Once the code was visible, the problem was solved and deployed in the remaining 18 minutes: 9,000 Azure Cosmos DB RUs reduced to 300.
  • Chander Dhall's FDE practice exists precisely for this moment: top-1% engineering caliber, compliance-aware, bias toward shipping, and honest enough to tell you 300 RUs is not yet good enough.
01 The FDE Moment

The entire valley is talking about Forward Deployed Engineers. Here is why.

AI demos dazzle. AI in production disappoints. The Forward Deployed Engineer is the answer the market is converging on, and the companies that understand this first will own the next five years.

Palantir invented the role. OpenAI is hiring them at $400K base. Most enterprises still do not know what one does. The term "Forward Deployed Engineer" was popularized by Palantir, whose model of embedding senior engineers directly inside customer operations became the defining competitive advantage of one of the most controversial and successful enterprise software companies of the past two decades. Today, OpenAI, Anthropic, Cursor, Decagon, and Sierra are all hiring FDEs aggressively. The role is no longer a Palantir-specific innovation. It is the market's answer to a structural problem.

That problem is this: AI works. AI in your enterprise, with your data, your compliance requirements, your edge cases, and your latency budgets, is a different problem entirely. The gap between a compelling demo and a production system that moves your P&L is not bridged by a better model. It is bridged by a human of rare caliber who understands both the technology and the enterprise context deeply enough to make the two work together.

"AI works. AI in your enterprise, with your data, your compliance, your edge cases, your latency budgets, is a different problem entirely. That problem is solved by humans of rare caliber, not by more models." Chander Dhall . Forward Deployed Engineering

The productivity gains from AI are genuinely extraordinary. Uber burned through its entire $3.4 billion annual AI budget in four months. Seventy percent of committed code at Uber is now AI-generated. Ninety-five percent of engineers use AI tools monthly. CEO Dara Khosrowshahi has described the tools as creating "employees with superpowers" and confirmed the company is deliberately slowing traditional hiring to invest in AI instead. These are not marketing claims. They are operational realities at one of the world's most technically sophisticated companies.

But here is the part of the Uber story that does not make the headlines: Uber COO Andrew Macdonald admitted publicly in May 2026 that the AI spending is "getting harder to justify." His exact words: "That link is not there yet, right? I think maybe implicitly there is more that is getting shipped, but it's very hard to draw a line between one of those stats and, 'Okay, now we're actually producing 25 percent more useful consumer features.'" Uber burned its entire AI budget in four months. Seventy percent of code is AI-generated. And the COO cannot draw a straight line from that spend to consumer value. That is the Uber story. That is the enterprise AI story. More code shipped does not mean more value delivered.

And yet, across the broader enterprise landscape, the story is almost the opposite. The same AI tools that are delivering extraordinary productivity at companies with deep technical infrastructure are producing zero measurable return at 95% of the organizations that deploy them. The difference is not the model. It is the human expertise applied to it. That is precisely what a Forward Deployed Engineer provides, and it is why the entire valley is talking about them now.

Why Now

The FDE conversation accelerated in 2025 and 2026 because the gap between AI capability and enterprise AI outcomes became impossible to ignore. The models improved dramatically. The failure rate did not. The market concluded, correctly, that the constraint was never the model.

02 The 95% Problem

Why Enterprise AI Projects Fail at Scale

The data is unambiguous and consistent across every major research institution that has studied it. Enterprise AI is failing at a rate that alarms boards and CTOs who have approved GenAI budgets.

MIT Project NANDA, July 2025
95%
GenAI pilots produced zero measurable P&L impact across 847 enterprise deployments tracked over 12 months.
RAND Corporation, 2024
80%+
Enterprise AI projects fail to deliver intended business value, at twice the failure rate of non-AI IT projects, per meta-analysis of 2,400+ projects.
Gartner, Q4 2024
30%
GenAI projects will be abandoned after proof of concept by end of 2025 due to poor data quality, inadequate risk controls, and unclear business value.
McKinsey State of AI, 2025
78%
Of organizations use AI in at least one function, yet fewer than 30% see material EBIT impact. Adoption is universal. Value is not.

These numbers are not outliers. They are consistent across methodologies, industries, and geographies. The question is not whether enterprise AI is failing. It is why it is failing at this rate despite genuine technological capability, and what the 5% who succeed are doing differently.

The failure causes are not mysterious. They are well-understood by anyone who has spent serious time inside enterprise AI deployments. Your teams are likely experiencing several of them right now:

  • Probabilistic outputs colliding with deterministic requirements: The same prompt produces different answers every time. This is acceptable for ideation. It is unacceptable for billing, compliance, audit trails, and financial reconciliation.
  • Hallucinations in regulated workflows: Finance, healthcare, and legal workflows cannot tolerate confident fabrication. The model does not know what it does not know.
  • Token economics ignored until production: A query that costs $0.002 in a demo costs $2.00 at enterprise scale. Latency and cost-per-query are architecture decisions, not afterthoughts.
  • RAG pipelines built without understanding the data store: A retrieval-augmented generation pipeline is only as good as the indexing strategy behind it. Most teams build the pipeline before they understand the data.
  • No evals, no observability, no rollback strategy: You cannot improve what you cannot measure. Most enterprise AI deployments have no systematic evaluation harness and no way to detect silent degradation.
  • Generic consultants who can prompt but cannot debug: There is a significant difference between someone who can write a system prompt and someone who can diagnose why a Cosmos DB query is consuming 9,000 Request Units.
  • "Vibe coding" at scale: AI-assisted development tools are extraordinary productivity multipliers for engineers who understand the code. They produce unmaintainable, inscrutable systems when used by teams who do not.
The Pattern

The average failed enterprise AI pilot costs $2.1 million in direct spend and 8.4 months of engineering time, per MIT Project NANDA methodology. The cost is not just financial. It is organizational credibility, engineering morale, and the window of competitive opportunity that closes while the pilot stalls.

The 5% that succeed share one trait: they have people embedded in the work who understand both transformer architectures and enterprise data lineage. They have engineers who can read a query plan, diagnose a latency spike, and architect a deterministic guardrail around a probabilistic model. That combination is rare. It is what a Forward Deployed Engineer brings.

03 The Core Tension

Probabilistic vs. Deterministic: Why AI and Enterprise Systems Collide

Ask an LLM the same question twice. You will get two different answers. This is not a bug. It is a fundamental property of how these systems work. And it is directly incompatible with how enterprise systems are designed.

Enterprise systems are built on determinism. The same input must produce the same output, every time, without exception. ACID transactions, idempotent operations, deterministic state machines, audit trails that reconstruct exactly what happened and why: these are not preferences. They are the architectural foundation of every regulated, high-stakes enterprise system ever built.

Large language models are probabilistic by design. Temperature settings, sampling strategies, and the stochastic nature of transformer inference mean that the same prompt will produce meaningfully different outputs across runs. This is a feature when you are brainstorming. It is a critical failure mode when you are processing a financial transaction, generating a compliance report, or making a clinical recommendation.

Enterprise Systems Require
LLMs Provide
Deterministic outputs: Same input, same output, always
Probabilistic outputs: Same input, different output, by design
Auditable reasoning: Reconstruct every decision
Opaque reasoning: Attention weights are not an audit trail
Bounded failure modes: Errors are predictable and recoverable
Unbounded failure modes: Hallucinations are confident and silent
Compliance-ready: HIPAA, SOC 2, PCI-DSS, GDPR by default
Compliance-naive: Requires architectural guardrails at every layer
Cost-predictable: Query costs are known at design time
Cost-variable: Token consumption scales with input complexity

Wide-open agent architectures compound this problem. When you chain multiple LLM calls together, each with its own probabilistic variance, the error surface area multiplies with every hop. An agent that is 95% reliable on each individual step is only 77% reliable across five steps. At ten steps, it is 60%. This is not a theoretical concern. It is the operational reality of every agentic system in production today.

The fix is not "better prompts." The fix is architectural: deterministic guardrails around probabilistic components, structured output schemas that constrain model responses, validation layers that catch hallucinations before they propagate, human-in-the-loop checkpoints where stakes are high, and fallback paths that handle model failure gracefully. None of this is in the model. All of it requires an engineer who understands both sides of the boundary.

Compliance Reality

HIPAA, SOC 2, PCI-DSS, GDPR, and the EU AI Act share one property: none of them accept "the model said so" as a defense. Compliance in AI systems is an architectural property, not a policy document. It must be designed in from the first commit, not retrofitted after the audit finding.

04 The FDE Role

What a Forward Deployed Engineer Actually Does

The term is being used loosely. Let us be precise. A Forward Deployed Engineer is not a consultant, not a staff augmentation resource, and not a solution architect. The distinction matters enormously.

A Forward Deployed Engineer is embedded in your engineering team. Not parachuting in for a discovery workshop and a slide deck. Not reviewing your architecture from a distance and writing a recommendation report. Embedded: in your Slack, in your codebase, in your pull request queue, in your incident bridge when things break at 2am.

The defining characteristics of a true FDE are:

  • Full-stack technical depth: From frontend rendering to data store internals to model fine-tuning. An FDE who cannot read a Cosmos DB query plan or interpret a RAG retrieval trace is not an FDE. They are a consultant with a different job title.
  • Product instinct: Knowing what to build, not just how to build it. The ability to look at a failing system and identify the real problem, which is almost never the problem the customer is pointing at.
  • Speed as a value system: Shipping in days, not quarters. The 18-minute fix inside a 30-minute meeting is not a party trick. It is a reflection of how an FDE thinks about time. Every hour of downtime, every week of stalled deployment, has a real cost that the FDE feels personally.
  • Diagnostic skill under pressure: The ability to navigate an unfamiliar codebase, identify the critical path, and produce a working solution in a constrained window. This is the skill that separates FDEs from everyone else.
  • Bridge fluency: Translating between business stakeholders, ML researchers, platform teams, and compliance officers. The FDE is the person in the room who everyone can talk to.
  • Outcome ownership: An FDE owns the result, not the ticket. If the system is not working, the FDE is not done.
Forward Deployed Engineer
Embedded. Owns outcomes. Ships.
  • In your codebase from day one
  • Full-stack: frontend to data store to model
  • Diagnoses the real problem, not the stated one
  • Ships in days, measures in outcomes
  • Compliance-aware by default
  • Honest: tells you when 300 RUs is not good enough
What an FDE Is Not
Advisory. Ticket-driven. Slide-first.
  • Traditional consultant: Slide-driven, recommendation-driven, no code
  • Staff augmentation: Executes tickets, no strategic ownership
  • Solution architect: Designs but does not build or own outcomes
  • ML engineer: Model-focused, weak on production systems and enterprise data
  • AI prompt engineer: Can write system prompts, cannot debug a query plan
I am sure they had used AI long before contacting me, because they had this problem for months. Twelve minutes into the call, I still had not seen the code. Chander Dhall . Live Production Incident, 2026

To be precise about what an FDE is not: an FDE is not a consultant, not staff augmentation, not a system integrator, and not a solution architect. They ship code, own the outcome, and leave when the system runs without them. The distinction is not semantic. It is the difference between a recommendation and a result.

The market is beginning to understand this distinction. The companies that are winning with AI are not the ones with the most AI tools. They are the ones with the best humans applying those tools. The FDE is the highest-leverage investment a technical organization can make in its AI program right now.

05 Case Study

From 9,000 RUs to 300 in 18 Minutes

This is not a story about a clever optimization. It is a story about what human expertise, applied under pressure, can accomplish when AI tools alone have already failed.

Live Production Incident

The Setup

An enterprise team was preparing to go to production. They had been carrying this problem for months before contacting Chander, and they had almost certainly tried AI-assisted development, AI code review, and AI-generated documentation along the way. By any reasonable measure, they were an AI-forward engineering organization.

A critical query on Azure Cosmos DB was consuming more than 9,000 Request Units per execution. At production scale, this was economically unsustainable and operationally dangerous. The team had been working the problem. The tools had not turned that work into a production fix.

A 30-minute call was scheduled. Twelve minutes in, the team still had not been able to get the relevant code in front of Chander. The production launch was at risk.

Chander Dhall had not yet seen the code.

9,000+
Cosmos DB RUs consumed per query before the call
12 min
Into the call before Chander could see the relevant code
18 min
From first seeing the relevant code to deployed fix
300 RUs
After fix, deployed to production before the call ended

The Turn

Chander Dhall stepped in: "We are going to fix this."

One engineer on the call understood the codebase. Not the AI tools. Not the documentation. The actual codebase, the data model, the indexing strategy, the query patterns. He navigated to the right files. The combination of that insider knowledge and Chander's diagnostic expertise produced the diagnosis in minutes. Chander identified the fix, drove the implementation path, and got the team to production before the call ended.

Once the code was visible, the fix was implemented, tested, and deployed in the remaining 18 minutes. Before the 30-minute call ended, a query that had been consuming 9,000 Request Units was running at 300. The production launch was back on schedule.

What This Means

Chander's assessment afterward: "Three hundred RUs is not very good. But it solved the purpose." That sentence is the most important part of this story. A lesser engineer celebrates the win. An FDE notes the win, ships the fix, and flags the next optimization target. The standard does not move because the crisis passed.

Five lessons from this incident that apply directly to your organization:

  • AI had already been tried. The team had been working the problem for months before contacting Chander. AI tools amplify human expertise. They do not replace it. When the right expertise is absent, the tools do not create the answer.
  • Time is the asset. Thirty minutes prevented weeks of escalation, a delayed launch, and the organizational cost of a visible production failure. The FDE's value is not measured in hours billed. It is measured in crises averted and launches shipped.
  • The codebase navigator matters. AI cannot replace the engineer who knows where the bodies are buried. The insider who understood the data model was as critical as the diagnostic expertise Chander brought. FDEs work best when there is at least one person on the client side who can navigate the system.
  • Honesty is a technical skill. Saying "300 RUs is not very good" after solving a crisis is not modesty. It is the discipline that separates engineers who maintain standards from engineers who celebrate adequacy. A strong FDE makes good enough uncomfortable.
  • Human ingenuity is non-negotiable. As Top Gun: Maverick frames it, it is not the plane, it is the pilot. The tools mattered, but Chander had not even seen the code by minute 12. Once the code was visible, his experience turned months of friction into a deployed fix in the remaining 18 minutes.
The Broader Lesson

This incident is a microcosm of the enterprise AI problem. The tools are there. The talent to apply them effectively is scarce. The organizations that close that gap first will win. The ones that assume the tools are sufficient will spend the next five years in the 95%.

06 The Chander Dhall Model

Why the Chander Dhall Forward Deployed Engineering Practice Works

This FDE practice is not a rebranded consulting offering. It is built on a specific model of engagement, a specific caliber of engineer, and a specific set of standards that most organizations have never encountered before.

Chander Dhall is a Microsoft Regional Director, a 15-time Microsoft AI MVP, and a Google Developer Expert. He has advised organizations including Google, Microsoft, LinkedIn, Thomson Reuters, Bank of America, Dell, and more. The depth of expertise he brings to an engagement is not assembled from a bench of available resources. It is the product of decades of building, breaking, and fixing systems at the frontier of what is technically possible.

The FDE practice is built around executive outcomes: speed, automation, production recovery, and teams that keep improving after the engagement. The technical work matters because it creates business leverage: faster delivery, better automation, lower operating cost, and systems that can create billions in value while saving hundreds of millions in waste.

  • Reviving failed projects: diagnosing the real technical and organizational blocker, then turning stalled programs into production systems.
  • Building high-performance automation: creating the workflows, code paths, data systems, and AI-assisted operating models that speed up technology delivery and reduce waste.
  • Training teams: raising the capability of internal engineering teams so the organization keeps improving after the engagement.
  • Augmenting teams with senior experts: adding senior practitioners who work beside the client team, close the gap, and make the delivery system stronger.
  • Creating measurable business leverage: turning software engineering and AI expertise into faster launches, lower operating cost, and durable enterprise value.

The differentiators that matter most in practice:

  • Top-1% engineering caliber: engagements are not staffed from a bench of available resources. Every FDE engagement involves practitioners who have solved the class of problem you are facing before.
  • Honest assessments: If 300 RUs is not good enough, you will hear it. If your architecture has a fundamental flaw that no amount of optimization will fix, you will hear that too. The value of an FDE is not validation. It is truth.
  • Bias toward shipping: The goal is always production. Not a better architecture document. Not a more thorough discovery phase. Production.
  • Outcome ownership: FDEs do not hand off recommendations. They stay until the system works.
Microsoft Regional Director 15x Microsoft AI MVP Google Developer Expert Azure Architecture Cosmos DB Optimization RAG Systems Enterprise AI Fortune 500 Advisory
07 Diagnostic

How to Know You Need a Forward Deployed Engineer

Most organizations that need an FDE do not know they need one. They think they have a model problem, a data problem, or a vendor problem. They have a human expertise problem.

If any of the following describes your current situation, you are likely in the 95%:

  • Your AI PoC has been "almost ready" for six or more months. The demo works. Production does not. Nobody can explain exactly why.
  • Costs are spiraling and nobody can explain why. Token consumption, RU charges, and API costs are growing faster than usage. The economics that worked in the pilot do not work at scale.
  • Latency is unpredictable. Response times vary by an order of magnitude between runs. Users are complaining. The team is shrugging.
  • Compliance or legal is blocking go-live. The system works technically but cannot pass the compliance review. Nobody on the team knows how to architect the guardrails the legal team is asking for.
  • Engineering and ML teams are pointing at each other. The ML team says the model is fine. The engineering team says the infrastructure is fine. The system is not fine.
  • You have agents in production making decisions you cannot audit. The system is running. You do not know what it is deciding or why. You would not be able to reconstruct a decision if you needed to.
  • Your data layer is the bottleneck and nobody is deep on it. Cosmos DB, PostgreSQL, Pinecone, Weaviate: the vector store or the primary data store is the performance constraint, and the team's expertise stops at the API surface.

If three or more of these apply, the cost of not engaging an FDE is already higher than the cost of engaging one. The question is not whether you can afford it. It is how much the delay is already costing you.

08 The Path Forward

AI Is Real. The Gap Is Real. The Fix Is Human.

The companies that win the next five years will be those that pair frontier AI with frontier engineers. The ones that assume the tools are sufficient will spend those years in the 95%.

The productivity gains from AI are real. Uber's experience is not an outlier. Engineers with AI tools are genuinely more productive, sometimes by an order of magnitude. The models are improving faster than any previous technology wave. The case for investing in AI is not hype. It is operational reality.

But AI without expert humans embedded in the work is the 95% failure path. The tools amplify expertise. They do not create it. A team without deep enterprise systems knowledge, using AI tools, produces AI-assisted mediocrity at scale. A team with deep expertise, using AI tools, produces results that were previously impossible.

The Forward Deployed Engineer is the bridge. Not a consultant who will tell you what to build. Not a staff augmentation resource who will execute your tickets. An embedded practitioner who will diagnose your real problem, architect the right solution, implement it, and stay until it works in production.

The 18-minute fix inside a 30-minute meeting is not a story about a clever trick. It is a story about what becomes possible when you have the right human in the right place at the right time. Twelve minutes into that call, Chander still had not seen the code. Eighteen minutes after that, the fix was deployed to production. That is the value of a Forward Deployed Engineer.

If you are reading this and recognizing your own organization in the 95%, the question is not whether you can afford an FDE. It is how much the current situation is already costing you. Every week a production launch is delayed, every month a PoC stalls, every quarter a compliance review blocks go-live: that is the cost. It is already running. The only variable is whether you close it.

The Standard

Three hundred RUs is not very good. But it solved the purpose, and it was deployed before the call ended. The next conversation is about getting it right. That is the standard: ship the fix, maintain the standard, keep moving. The work is never done. The bar never moves down.

09 Sources

Sources and References

  1. MIT Project NANDA (2025). The GenAI Divide: State of AI in Business 2025. Enterprise GenAI adoption and measurable-return findings.
  2. RAND Corporation (2024). Why AI Projects Fail and How They Can Succeed. More than 80% of AI projects fail, roughly twice the failure rate of non-AI IT projects.
  3. Gartner / APMdigest (2024). GenAI project abandonment forecast. At least 30% of GenAI projects will be abandoned after proof of concept by end of 2025 due to poor data quality, inadequate risk controls, escalating costs, or unclear business value.
  4. McKinsey State of AI summary. State of AI 2025: McKinsey Report. AI adoption and value-creation trends across organizations.
  5. Forbes / Janakiram MSV (May 2026). "Uber Burns Its 2026 AI Budget In Four Months On Claude Code." Uber's AI budget pressure, Claude Code adoption, and AI-generated code discussion.
  6. Business Insider / Yahoo Finance (May 2026). Uber COO Andrew Macdonald on AI token spending and syndicated coverage of the same comments.
  7. Benzinga / Yahoo Finance (2026). Uber CTO Praveen Neppalli Naga on AI coding adoption and Yahoo Finance coverage of Uber's AI coding metrics.
  8. Palantir / Silicon Valley Product Group. Palantir AI FDE documentation and SVPG analysis of Forward Deployed Engineers.
  9. Chander Dhall (2026). Anonymized production incident recounted in this report: Azure Cosmos DB query optimization from 9,000 RUs to 300 RUs in the 18 minutes after the relevant code became visible. Identifying client details are omitted.
Work With Chander
Your AI project belongs outside the 95%.
Book a 30-minute architecture review. If we cannot find a production-grade fix, you owe nothing.

If your AI PoC has stalled, your production costs are spiraling, or your team has declared a problem unsolvable, you need a Forward Deployed Engineer. Not a consultant. Not a slide deck. An embedded practitioner who will diagnose the real problem and ship the fix.

Twelve minutes into the call, Chander still had not seen the code. Eighteen minutes after that, the fix was deployed to production.

That is what a Forward Deployed Engineer delivers. That is the standard we hold ourselves to.
Microsoft Regional Director 15x Microsoft AI MVP Google Developer Expert Fortune 500 Advisory Azure . AWS . GCP Cosmos DB . RAG . Agents
Book a 30-Minute Architecture Review → If Chander cannot identify at least one production-grade fix in that call, you owe nothing. No pitch deck. No discovery phase. A direct technical conversation about your specific problem. FDE engagements are capacity-limited and not staffed from a bench.