AI Engineer

intermediate 10 hours

From consumer to builder

Build AI-powered applications from scratch. Master embeddings, RAG pipelines, tool calling, and agentic architectures.

For: Developers

What You’ll Learn

Understand tokenization and attention at implementation level
Implement semantic search using embeddings
Build RAG pipelines from scratch
Create custom tools and MCP servers
Design agentic systems with proper guardrails
Build evaluation frameworks using LLM-as-Judge

Learning Path

Level 4: AI Primitives

How the engine works under the hood: tokens, attention, embeddings, and type-safe schemas

LLM Foundations

Demystify language models: tokens, attention, context windows, and embeddings. Understand how models process and represent meaning.

60 min

Type Safety for AI

Get structured, predictable output from LLMs using JSON Schema, Zod, and Pydantic. Build reliable AI integrations.

50 min

Level 5: Retrieval Systems

Connect LLMs to your data with RAG pipelines and semantic search

RAG Fundamentals

Connect LLMs to your data with Retrieval-Augmented Generation. Build pipelines that answer questions using your documents.

60 min

Level 6: Agentic Systems

Build loops, tools, and autonomous systems with proper safety and evaluation

Tools & MCP

Extend LLM capabilities beyond text generation. Learn how tool calling works, build your own tools, and implement the Model Context Protocol.

45 min

Agentic Architecture

Design autonomous AI systems with loops, guardrails, and evaluation. Build agents that plan, execute, and learn.

75 min

Concepts Covered

Fundamentals

Tokens & Tokenization beginner

LLMs process text as tokens — chunks of characters that form the atomic units of input and output, directly affecting pricing and context limits.

10 min

Context Windows beginner

The context window is the maximum amount of text (in tokens) an LLM can 'see' at once, including prompts, history, injected documents, and responses.

12 min

Attention Mechanism intermediate

The core mechanism that allows language models to understand how words relate to each other by dynamically focusing on relevant parts of the input.

15 min

Embeddings & Vector Representations beginner

Embeddings convert text into numerical vectors that capture semantic meaning, enabling similarity search, clustering, and the foundation for RAG systems.

15 min

Structured Outputs: Reliable AI Data Extraction beginner

Constrain AI responses to follow a specific format using JSON Schema, enabling reliable data extraction and type-safe integrations.

15 min

Retrieval-Augmented Generation (RAG) intermediate

RAG combines document retrieval with LLM generation, allowing AI to answer questions grounded in your specific data without fine-tuning.

20 min

Type Systems

JSON Schema: Defining Data Contracts beginner

JSON Schema defines the exact structure and constraints for LLM outputs, ensuring type-safe, validated responses without post-processing guesswork.

15 min

Zod for TypeScript Runtime Validation intermediate

Zod brings runtime validation to TypeScript AI applications, ensuring LLM outputs match your types at runtime while maintaining compile-time type safety.

20 min

Pydantic for Type-Safe AI intermediate

Pydantic brings runtime validation and type safety to Python AI applications, automatically converting JSON Schema to validated Python objects with IDE autocomplete.

20 min

Patterns

Golden Dataset Curation intermediate

Building high-quality evaluation datasets that anchor AI system quality - because an eval is only as good as its test cases.

12 min

Agentic Loops advanced

Iterative refinement pattern where AI generates output, evaluates it against criteria, and improves through multiple cycles until quality thresholds are met.

12 min

Guardrails intermediate

Safety constraints and validation mechanisms that prevent AI systems from producing harmful, incorrect, or policy-violating outputs.

10 min

AI System Monitoring intermediate

Observability practices for AI systems that track model performance, costs, latency, and output quality in production.

12 min

Cost Optimization for AI Systems intermediate

Strategies to reduce AI infrastructure costs by 50-90% through prompt caching, batch APIs, model tiering, and context pruning.

10 min

Protocols

Tool Use & Function Calling intermediate

Tool use enables LLMs to interact with external systems by generating structured function calls that applications execute and return results for.

15 min

Model Context Protocol (MCP) intermediate

MCP is an open protocol by Anthropic that standardizes how AI applications connect to data sources and tools through a unified server architecture.

12 min

Hands-On Exercises

Design a Type-Safe AI Schema beginner

Design and implement a Zod schema for structured AI outputs. Learn to constrain LLM responses for reliable data extraction.

build30 min

Structured Output Extraction beginner

Learn to extract structured data from unstructured text using JSON Schema constraints, ensuring type-safe outputs from LLMs.

build25 min

Implement a RAG Pipeline intermediate

Build a Retrieval-Augmented Generation system from scratch. Index documents, embed queries, retrieve relevant chunks, and generate sourced answers.

build45 min

Build a Golden Dataset from Git History intermediate

Learn to mine git commit history for evaluation test cases, creating a robust dataset that captures real-world code patterns and edge cases.

build40 min

Compare Chat vs RAG vs Agent intermediate

Evaluate three different AI architectures for a real-world scenario to understand when to use each approach.

compare45 min

Debug a Hallucinating AI intermediate

Learn to identify, diagnose, and fix AI hallucinations in a production-like scenario using grounding techniques.

debug35 min

Build an LLM Tool By Hand intermediate

Implement a tool from scratch without using a framework, to deeply understand how tool calling works under the hood.

build30 min

Build a Read-Only MCP Server intermediate

Create a Model Context Protocol server that exposes safe, read-only tools to AI assistants. Learn MCP architecture and identify security vulnerabilities.

build45 min

Build a Multi-Step Agent advanced

Implement an agent that can plan, execute, and iterate on multi-step tasks with tool use and state management.

build60 min

Build an LLM-as-Judge Evaluation Pipeline intermediate

Learn to evaluate AI outputs using model-graded evaluation (LLM-as-Judge), the pattern where a stronger model grades outputs from weaker models.

build45 min

Build a Prompt Injection Firewall intermediate

Learn to build a security layer that detects and blocks prompt injection attacks before they reach your main LLM application.

build45 min

Decision Guides

Chat vs RAG vs Agent beginner

When should I use a simple chatbot vs RAG vs an autonomous agent?

20 min

Fine-Tuning vs RAG intermediate

Should I fine-tune a model or use RAG for domain-specific knowledge?

20 min

LangChain vs Custom Implementation intermediate

Should I use LangChain (or similar frameworks) or build custom?

15 min

AI Assisted Developer Track - Master using AI tools first
RAG vs Fine-tuning - When to use retrieval vs training
LangChain vs Custom - Framework vs building from scratch

Tempered AI — Forged Through Practice, Not Hype

? Keyboard shortcuts

AI Engineer

What You’ll Learn

Learning Path

Level 4: AI Primitives

LLM Foundations

Type Safety for AI

Level 5: Retrieval Systems

RAG Fundamentals

Level 6: Agentic Systems

Tools & MCP

Agentic Architecture

Concepts Covered

Fundamentals

Type Systems

Patterns

Protocols

Hands-On Exercises

Decision Guides

Related Resources