Blog

Writing

Life, tech, and everything in-between.

Guardrails for LLM Agentic Systems: Layered Defense

May 4, 20267 minAI

Safety instructions in a system prompt are not sufficient for production. Benchmark agents still behave unsafely in a significant fraction of cases. The fix is layered defense: prompt-level, tool-level, model-level, and system-level guardrails stacked so any one failure is caught by the next.

Human-in-the-Loop Patterns for LLM Agents

Apr 29, 20267 minAI

Full autonomy is not the right target for every agent. Put a human in the loop at the points that matter: before dangerous actions, for ambiguous decisions, as an escalation path when the agent is stuck. Checkpointing makes the pauses cheap.