AI-401

Agentic Systems in Production

Beyond a single tool call: building, debugging, and operating agents that take many actions on behalf of users — without setting the building on fire.

ABOUT THIS COURSE

What you will learn.

Agents are where the field is right now — and also where most of the bad code is. The gap between an agent that demos well and an agent that runs reliably in front of real users is, in our experience, the widest gap in modern AI engineering. This course is about closing it.

We spend seven weeks on agentic systems as engineering objects: planning architectures, the tradeoffs between linear, tree-search, and react-style control flow, tool design for long-horizon tasks, memory and state management, sandboxing and permission models, cost and latency control for multi-step systems, and — critically — how to debug an agent that does the wrong thing on step seven of a forty-step trajectory. Your capstone is an agent that performs a real task end-to-end with a human-in-the-loop boundary you design and defend.

This is our most demanding course. Students should arrive having shipped at least one LLM feature in production. The pace is fast and the reading list is non-trivial.

WHAT YOU’LL BUILD

Four substantial projects.

Project 01

A planning-and-execution agent

Build a multi-step agent that plans, executes, recovers from tool errors, and reports honestly when it gets stuck.

Project 02

A sandboxed tool environment

Design a tool surface that an agent can use without risking the host system or downstream services.

Project 03

A debugging harness

Build the tooling that lets you replay and inspect any trajectory, then use it to fix two real bugs.

Project 04

A human-in-the-loop interface

Design the moments where the agent pauses for human review — and make those moments worth the friction.

CURRICULUM

Week by week.

FIT

Who this is for — and who it is not.

For you if

Engineers who have shipped LLM features and are now being asked to build agents.
Tech leads at companies that have placed a strategic bet on agentic products.
Senior engineers who want to develop hands-on agent intuitions before the next hype cycle.

Probably not for you if

Engineers new to LLMs — take AI-101 or AI-201 first.
People hoping to fine-tune their own agentic model — that is research, not this course.
Anyone looking for a one-weekend bootcamp; this is a serious time commitment.

YOUR INSTRUCTOR

Taught by an operator.

Principal AI Engineer

Dr. Lena Park

Lena was a tech lead on Anthropic’s applied alignment team and previously led a model-evaluation group at Google DeepMind. Her PhD work at Stanford focused on calibration in large transformer systems. She still consults part-time for two foundation-model labs, which keeps the course material honest about what the frontier actually looks like — not what blog posts say it looks like.

FAQ

Questions we’re asked often.

AI-401 · Next cohort starts soon