AI Systems

Production AI is a different problem than prototype AI. I write about the gaps between them: evaluation systems that catch failures before users do, agent architectures that fail gracefully, retrieval pipelines that hold up under real query distributions, and the engineering decisions that most tutorials skip.

March 8, 2026

Agentic AI and Data Products: A Framework for Real-Time Autonomous Decision Systems

A high-level framework for integrating agentic AI with real-time analytics pipelines, covering architecture, governance, trade-offs, risks, and evaluation across finance, IoT, and healthcare.

agentic-ai real-time-analytics data-products streaming

March 8, 2026

Skills vs MCP: Designing the Next Generation of AI Agent Architecture

Not all problems an AI agent solves are the same. This piece unpacks the distinction between Skills (knowledge layer) and MCP (execution layer) — and why separating them leads to more efficient, maintainable agent systems.

mcp agent-skills context-engineering ai-architecture

February 22, 2025

Continuously Improving Agent Quality Using Evaluators Across Single-Turn, Trajectory, and Multi-Turn Interactions

A production-grade guide to agent evaluation: single-turn unit evals, trajectory scoring, multi-turn simulation, harness engineering, and LangSmith-driven continuous improvement loops.

langchain langgraph langsmith evaluation