AI EngineerJune 26, 202618m

Agents in Production: How OpenGov Built and Scaled OG Assist - Gabe De Mesa, OpenGov

TL;DR

Effect library bet paid off: OpenGov moved from LangGraph to a custom Effect Native Agent Loop, gaining full control over tracing, structured concurrency, and logging while maintaining the ability to hot-swap language models via dependency injection.
A2A protocol drives alignment: Adopting Google's Agent-to-Agent protocol gave them a rigorous spec (including agent cards with name and description) that served as the contract between front-end and back-end, accelerating development.
Shipping is the start, not the finish: They combine thumbs up/down user feedback with automated evals in CI to test prompts against real completions, checking if tools were hit correctly.
Safety through determinism: Humans must explicitly approve or reject tool calls that require approval, and agents execute code in ephemeral sandboxes that tear down automatically after use.
Rolling summarization beats context stuffing: Instead of stuffing all messages into context, they use running summaries after N messages plus recall over the summary, keeping only the most recent messages in full.
Agents build their own tools internally: OpenGov uses Claude and Cursor to accelerate their own development velocity while building customer-facing agent capabilities.

The Breakdown

OpenGov built OG Assist, an AI assistant integrated across their entire ERP product suite, by betting big on Effect (a TypeScript library) and building their own agent loop after outgrowing LangGraph. The talk covers their full stack approach including A2A protocol for front-end/back-end alignment, automated evals in CI, sandboxing for safe code execution, and rolling summarization for handling long conversations.