Developers can now debug and test AI agents locally with Raindrop’s open source Workshop tool.



Observability start Raindrop AIThe new open source, licensed by MIT "Workshop" The tool, released today, offers developers something they’ve probably wanted, perhaps unconsciously, since the agent AI era began in earnest last year: a local debugger and evaluation tool designed specifically for AI agents, allowing developers to see all traces of what their agent has been doing in a single, lightweight Structured Query Language (SQL) database file (.db).

It works as a local daemon and user interface that transmits each token, tool call and decision to a local panel, usually hosted on localhost:5899—at the moment it happens. By visiting your local server, developers can see everything their agent was doing (including bugs or errors) and identify what went wrong, when, and, ideally, discern why. It’s all stored in a single .db file, which takes up relatively little memory, according to an X direct message VentureBeat received from Ben Hylak, co-founder and CTO of Raindrop (and former Apple and SpaceX engineer).

This real-time telemetry eliminates the latency of traditional polling and addresses a growing concern among developers regarding the privacy of sending local traces to external servers.

The tool is available for macOS, Linux and Windows. It can be installed using a one-line shell command that automates binary location and PATH configuration for bash, zsh, and fish shells. For developers who prefer to build from source, the repository is hosted on GitHub and uses the Bun runtime.

The product: establishing a self-healing evaluation cycle

The most notable feature of the platform is the "self-healing evaluation loop," allowing coding agents like Claude Code to read traces, write evaluations to the codebase, and repair broken code autonomously.

In a practical application, if a veterinary assistant agent does not ask the necessary follow-up questions, Workshop captures the entire trajectory. Claude Code then reads this trace, writes a specific evaluation, identifies the logical error in the message or code, and reruns the agent until all assertions pass.

Compatibility and integration of ecosystems.

Workshop supports a wide range of programming languages, including TypeScript, Python, Rust, and Go.

It integrates with popular frameworks and SDKs such as Vercel AI SDK, OpenAI, Anthropic, LangChain, LlamaIndex, and CrewAI. It is also designed to work seamlessly with several coding agents, including Claude Code, Cursor, Devin, and OpenCode.

Licenses and community implications.

Workshop is published under the MIT license, ensuring that it remains free and open source for all users. This permissive license is intended to encourage community contribution and allow business users to maintain data sovereignty.

Hylak noted in X that the tool was created to provide a "sane" way to debug agents locally, changing the way your team and early customers build autonomous systems.

To celebrate the launch, Raindrop offered limited edition physical products to users who installed the tool and executed a specific task. "drip" domain.



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *