Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: pub@towardsai.net
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab VeloxTrend Ultrarix Capital Partners Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Our 15 AI experts built the most comprehensive, practical, 90+ lesson courses to master AI Engineering - we have pathways for any experience at Towards AI Academy. Cohorts still open - use COHORT10 for 10% off.

Publication

The Real Reason Enterprise Agents Are So Hard (And It’s Not the AI)
Latest   Machine Learning

The Real Reason Enterprise Agents Are So Hard (And It’s Not the AI)

Last Updated on October 7, 2025 by Editorial Team

Author(s): ravindu somawansa

Originally published on Towards AI.

Enterprise demos make agents look effortless. But the real pain starts after the pitch — when AI agents, workflows, legacy systems, and evaluation come into play.

The Real Reason Enterprise Agents Are So Hard (And It’s Not the AI)
Failing Agents in Enterprise

Why this matters now

Agents are everywhere. Demo videos flood LinkedIn. Vendors promise “autonomous copilots” that will run your entire department while you sip a latte. And, to be fair, the prototypes are pretty nice.

But if you’ve ever tried moving from slides to production, you know: the AI is not the hardest part. Models are improving fast, and calling an API isn’t rocket science. The true obstacles are older, messier, and deeply human.

When enterprises stall on agents, they run into four walls:

  1. Seeing Agents everywhere (the not to do).
  2. Defining what should be automated (workflow clarity).
  3. Integrating with what already exists (legacy systems and APIs).
  4. Proving it works reliably (evaluation and monitoring).

Let’s break these down.

What really makes it hard

Yes, there’s architecture, frameworks, memory, multimodal, and real-time. Those matter. But compared to the big three walls, they’re solvable engineering problems.

The mess comes from aligning people, processes, and creaky infrastructure. That’s where enterprise projects sink or swim.

Wall #1 —Seeing Agents everywhere (the not to do)

Seeing Agents Everywhere

Before anything, one thing is worth saying out loud: you don’t need Agentic systems everywhere. In fact, many enterprise problems are better solved with simpler, sturdier approaches:

  • Classic code — if the process is repetitive and well-defined, a script or service will run faster, cheaper, and more reliably than an agent.
  • Traditional machine learning — when the task is about predictions from structured data, a regression or classifier often outperforms a reasoning loop.
  • Graphical interfaces and workflow engines — sometimes the real need is clarity and usability; mapping a flow in a UI solves more than adding autonomy.
  • Straightforward LLM calls — in plenty of cases, a few carefully-prompted API calls provide all the “intelligence” required, without orchestration overhead.

Agents are best reserved for messy, multi-step, dynamic workflows where flexibility is crucial. For everything else, matching the right tool to the task avoids extra cost, fragility, and integration headaches.

Wall #2— workflow definition (the what)

Definin Process Workflow

Here’s the truth: enterprises rarely have clean workflows.

Processes live in people’s heads. Exceptions pile up. Compliance adds hidden steps. By the time you ask “what exactly should the agent handle?” you’re in a swirl of never ending meetings, outdated specs, and side comments like: “Oh, but for client X we do it differently.”

This is why workflow modernization is step zero:

  • Sit with the business, map the workflow, details each actions taken and by who, how manual is it.
  • Clarify what gets automated, how it will be automated (not everything needs Agentic), what stays human, and how they are linked together.
  • Document the messy reality, show what the workflow could be and validate it.

Without this groundwork, your agent will either:

  • Automate the wrong thing.
  • Automate half the thing and stall.
  • Or get quietly ignored by the very people it was supposed to help.

Wall #3 — integration with existing systems (the how)

Integrating into Existing Systems

Once you know what to automate, you face the second wall: plugging into the systems already in place.

And here’s the kicker — most of those systems were never designed with agents in mind. Many weren’t even designed with APIs in mind.

  • Legacy ERPs that require brittle connectors.
  • CRMs or ticketing systems with half-documented endpoints.
  • In-house apps written a decade ago in frameworks nobody touches anymore.
  • Authentication schemes, role-based access, compliance hoops.
  • Backend system with workflow so complex you will need 3 days just to understand what it is being used for.

Integration isn’t just “connecting to an API.” It’s navigating decades of tech debt, ownership silos, and brittle dependencies.

This is why a demo agent running smoothly on a greenfield app stack suddenly collapses in the real world. It has to speak to systems that were patched and customised over years.

In enterprise reality, integration equals:

  • Find the legacy systems workflow and how to use them.
  • Get the system expert to help us (they don’t have time !)
  • Translating between old and new data formats.
  • Handling rate limits and reliability issues.
  • Negotiating with IT/security teams for access (sometimes the hardest part).

Until this wall is climbed, agents remain stuck in prototype loop.

Wall #4 — evaluation (the proof)

Getting Proof that It Works

Even if you nail the workflow and survive integration, you hit the third wall: how do you know it works?

Evaluation in agents is notoriously slippery (and completely avoided):

  • Task-level metrics: Did the agent complete the workflow as defined? What are the pourcentage of completion ? of false positive ?
  • Agentic-level metrics: Did the agent followed the workflow and generated correct planning ? Did we catch all the process in errors and transfered them to be handled by humans ?
  • Business metrics: Did it save time, reduce cost, or improve accuracy?
  • Safety metrics: Did it avoid hallucinations, policy violations, compliance breaches and basically did nothing we do not want it to do ?

The usual ML trick — accuracy on a benchmark dataset — doesn’t cut it. Each enterprise has unique needs.

Practical patterns here include:

  • Evaluation datasets: curated inputs with expected agents planning and outputs.
  • Real Agentic evaluation: don’t only evaluate the results, evaluate the agent plans and delegation (when agent leave the floor to another).
  • Shadow mode: agents run alongside humans before taking full control.
  • Continuous monitoring: track drift, performance, and regressions over time.

Without evaluation discipline, agents either look magical in a demo and fail quietly in production — or worse, they break something critical without anyone noticing.

Conclusion — why agents really fail in enterprise

So let’s recap.

The toughest part of enterprise agents is not the AI itself. It’s:

  1. Agent Mirage (the not to do): seeing Agents everywhere where there is no need.
  2. Clarity (the what): defining workflows with the business, modernizing where needed.
  3. Integration (the how): plugging into legacy systems, brittle APIs, and decades of tech debt.
  4. Evidence (the proof): evaluating agents continuously to build trust.

Ignore these, and your “autonomous copilot” (if you even need this level of agentic) will stay trapped in prototype purgatory. Embrace them, and you turn AI from shiny demos into enterprise-grade assets.

The lesson? Treat agent adoption not as an AI project, but as a workflow + integration modernization project — with evaluation built in from day one.

👉 If you enjoyed this article and want to read more about AI, MCP, and Multi-Agent systems, follow me here on Medium or connect with me directly on LinkedIn!

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.

Published via Towards AI


Take our 90+ lesson From Beginner to Advanced LLM Developer Certification: From choosing a project to deploying a working product this is the most comprehensive and practical LLM course out there!

Towards AI has published Building LLMs for Production—our 470+ page guide to mastering LLMs with practical projects and expert insights!


Discover Your Dream AI Career at Towards AI Jobs

Towards AI has built a jobs board tailored specifically to Machine Learning and Data Science Jobs and Skills. Our software searches for live AI jobs each hour, labels and categorises them and makes them easily searchable. Explore over 40,000 live jobs today with Towards AI Jobs!

Note: Content contains the views of the contributing authors and not Towards AI.