Anthropic wants to own its agent’s memory, evaluations, and orchestration, and that should make companies nervous.



Just a few weeks after announcing Claude Managed AgentsAnthropic has updated the platform with three new capabilities that collapse infrastructure layers such as memory, evaluation, and orchestration from multiple agents, into a single runtime.

This measure could threaten the independent tools that many companies improvise.

The new capabilities (‘Dreaming’, ‘Outcomes’ and ‘Multi-Agent Orchestration’) aim to make agents within Claude Managed Agents “better capable of handling complex tasks with minimal direction,” Anthropic said in a press release.

Dreams are all about memory, where agents “reflect” on their numerous sessions and curate memories to learn and bring to light unknown patterns. Results allow teams to define and set specific rubrics to measure an agent’s success, while multi-agent orchestration breaks down jobs so a lead agent can delegate to other agents.

Ideally, Claude Managed Agents provides enterprises with a simpler path to deploying agents and integrates orchestration logic into the model layer. It is an end-to-end platform for managing state, execution graphs, and routing. With the addition of Dreaming, Outcomes, and Multi-Agent Orchestration, Claude Managed Agents further expands capabilities and competes directly with tools like LangGraph or CrewAI, as well as external evaluation frameworks, RAG memory architectures, and quality control loops.

An integration threat

Companies now need to ask themselves: Should we ditch our flexible, modular system in favor of an agent platform that includes almost everything in-house?

Anthropic designed Claude Managed Agents to share context, status, and traceability in one place. This means the platform sees all the decisions agents make, rather than companies having to connect separate systems. It seems practical to have one platform that does it all. But not all companies want a full-service system.

Claude Managed Agents already faces criticism for fostering vendor lock-in because it owns most of the architecture and tools that govern agents. In the current paradigm, an organization can run managed agents, but keeping orchestration, memory, or evaluations of multiple agents in a separate space ensures flexibility.

The platform offers a fully hosted runtime, meaning that memory and orchestration run on infrastructure that the company does not own. This can become a compliance nightmare for some organizations that have to prove data residency.

Another issue to consider is that companies already in the midst of large-scale AI transformations must improvise solutions to address the limitations of their technology stack. Not all workflows can be easily replaced when switching to Claude Managed Agents.

Dreams and results versus current tools

Most companies have a fragmented approach to implementing AI.

For example, they can use LangGraph or Crew AI for agent routing and workflow management, Pinecone as a vector database for long-term memory, DeepEval for external evaluation, and a human QA present to review some tasks. Anthropic hopes to put an end to all that.

With Dreaming, Anthropic addresses memory by allowing users to actively rewrite it between sessions, so the agent essentially learns from its mistakes. Anthropic says this capability is useful for long-running states and orchestrations. Current systems often handle memory persistence by storing embeddings, retrieving relevant context, and adding more state over time.

The results address the evaluation part by detailing the agents’ expectations. Instead of external quality checks, which are often performed by a team of humans, Anthropic brings evaluation to the orchestration layer instead of above it.

But it’s the multi-agent orchestration capability that pits Claude Managed Agents against orchestration frameworks from Microsoft, LangChain, CrewAI, and others. Modeling vendors like Anthropic and OpenAI have already started aggressively entering this space, arguing that bringing it to the model layer gives teams better control.

Big decisions to make

Companies face a big decision, and it could depend on their agent maturity level.

If an organization is still experimenting with agents and hasn’t deployed many into production, they may find it much easier to move to Claude Managed Agents and configure Dreaming and Outcomes to their needs. This is the stage of development where even if companies use a third-party orchestrator like LangChain, they are still customizing it.

But for those who are further along in the process, the calculation becomes more complicated. Now it is a matter of parallel evaluation and better understanding of your processes.

However, companies will face the same decision even if they do not intend to use Claude Managed Agents. Anthropic has noted that other model and platform vendors will likely shift their product roadmaps to a similar model that keeps everything locked in the same system, because models may become interchangeable, but tools and orchestration infrastructure will not.



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *