Blog

Writing

Life, tech, and everything in-between.

Semantic RAG

May 11, 20266 minAI

Semantic RAG is a fixed pipeline: chunk, embed, search, generate. Tuned well, it handles most production lookup workloads at a cost the alternatives cannot match. A short walkthrough of the pieces and the upgrades that matter.

The Augmented LLM: Retrieval, Tools, and Memory

Feb 18, 20266 minAI

A bare language model is a closed system. It cannot look up a fresh fact, cannot act in the world, cannot remember the last conversation. Every non-trivial agent is an attempt to lift one or more of those three constraints.