Week 2 (Friday): RAG and Embeddings - grounding in your own data

What you keep

How to make a model answer from your data instead of its training, and why naive RAG fails.

You ship

Your endpoint answering from your own document set.

Lessons

Live

Similar meanings sit close together in vector space - retrieval quality depends on this mental picture.

Live

Embed, store, retrieve, stuff into prompt - then break it instructively.

Live

How you split documents decides what retrieval can find - most bad RAG is bad chunking.

Live

Reranking reorders candidates by relevance; hybrid search catches exact terms embeddings miss.

Live

Measure retrieval and generation separately - wrong chunks vs right chunks ignored.

Deep dive

When knowledge has structure and relationships, plain vector RAG leaves value on the table.

Deep dive

Retrieve over images, tables, and diagrams - not just prose.

Assignment

RAG with deliberate chunking and reranking, plus a basic retrieval eval.

Lessons in this module

Enroll via Maven

Covered by the Maven Guarantee