AI & Agents — Reading Order
A structured path through 24 posts — from ML foundations to production-grade agentic systems. Whether you're an engineer building your first RAG pipeline or a leader evaluating agent frameworks, start here.
A structured path through my AI & Agents posts. The progression moves from foundations through LLMs and retrieval into agentic systems and production concerns. Each section builds on the previous one.
1. Foundations — What AI Actually Is
Start here if you’re coming from a pure software engineering background. These posts build the mental model for everything that follows.
- AI Hierarchy — the nested capability stack: AI ⊃ ML ⊃ DL ⊃ GenAI
- Machine Learning Models & Learning Paradigms — supervised, unsupervised, reinforcement — when to use which
- Top 20 ML Algorithms — the algorithms that power production systems
- Evaluation Metrics — precision, recall, F1, AUC — choosing the right metric
- Deep Learning Architectures — CNNs, RNNs, transformers — the neural network zoo
2. GenAI & LLMs — How Language Models Work
The transformer revolution. Understanding this layer is prerequisite to building anything useful with LLMs.
- LLM Architecture & Training — from transformer internals to training at scale
- Controlling Randomness — Temperature, Top-k, Top-p — the creativity-coherence tradeoff
- Fine-tuning vs RAG vs Prompting — three ways to customize LLMs, with tradeoffs
- LLM Evaluation — measuring quality across accuracy, latency, cost, safety
3. RAG & Knowledge Retrieval
Grounding LLMs in your own data. This is where most enterprise AI projects start.
- RAG Architecture — retrieval-augmented generation end to end
- Vector Databases — semantic search at scale with embeddings
4. Agent Frameworks — The Toolbox
Comparative deep-dives into the major frameworks. Read the first two to understand the landscape, then the ones relevant to your stack.
- Agent Orchestration & Handoffs — patterns for wiring agents together
- Agent Memory & State — turning stateless LLMs into systems that learn
- LangChain & LangGraph — composable primitives and stateful graph-based workflows
- CrewAI — role-based multi-agent systems with minimal boilerplate
- Microsoft AutoGen — multi-agent conversations with code execution
- OpenAI Agents SDK — lightweight agents with handoffs and guardrails
- Google Agent Development Kit — hierarchical orchestration on Vertex AI
- Claude API & Agent SDK — Anthropic’s production agent framework
5. Protocols & Interoperability
How agents connect to the outside world and to each other.
- Model Context Protocol (MCP) — the open standard for agent-tool integration
- Agentic Protocol Landscape — the full protocol stack from tool calls to payments
6. Production — Shipping AI Responsibly
The hard part. Everything above is table stakes — this section is what separates demos from products.
- Harness Engineering — the operating system around the model
- Evals & Guardrails — catching regressions and preventing harm
- Cost Control & Optimization — keeping agentic workloads economically viable
- Async Agent Workflows — long-running agents that checkpoint and resume