Build Production-Ready Agentic-RAG Applications From Scratch Course: What we are going to build

Sep 02, 2025

On Saturday, September 27th, I am launching a new course: Build Production-Ready Agentic-RAG Applications From Scratch! This is a fully hands-on course where we are going to deploy a production-ready Agentic-RAG application with LangGraph, FastAPI, and React! Here is what we are going to build.

Signup!

What we are going to build

We are going to build a fun web application where we can demonstrate how to orchestrate a robust RAG application using LangGraph, FastAPI, and React. Here is what we are going to build:

A user can pass a GitHub repository URL
The files of the related repository are scraped and indexed in a vector database
Now the code is available for the user to ask questions about.

On the frontend, we will need two main functionalities:

A page where we can input the repository URL and start the crawling and indexing processes:

And a chatbot interface to ask questions about the code in the repository:

On the backend, we will need the related endpoints:

The indexing endpoint will respond to the provided GitHub repository URL and the “crawl“ action to start the crawling and indexing processes.
The chat endpoint that will respond to messages sent by the user from the chatbot interface.

We are going to use the following tools:

React for the frontend
FastAPI for the backend
LangGraph for the agentic orchestration
Pinecone for the vector database
Langsmith for observability
Deploy everything on Google Cloud!

Project-based course

We will focus on building the project from the ground up, as we would on the job. Here is how we are going to structure the project development:

Introduction
- What we want to build
- Setting up the environment
The RAG Application
- The Data Parsing Pipeline
- The Indexing Pipeline
- The Basic RAG Pipeline
- Adding Observability to the Pipeline with Langsmith
- Going Agentic
The Backend Application
- The Indexing API Endpoint
- Adding Memory
- Administering the Database Data
The Frontend Application
- The Indexing Page
- The Chatbot Page
Deploying to GCP

Each session will be a live, hands-on coding session where we are going to implement every component from scratch

Going Agentic

“Agentic” means that we are going to use an LLM as a decision engine to enhance the quality of our pipeline. We will focus on improving the accuracy of the pipeline at the cost of latency and cost, and discuss the opportunities to reduce those induced negative points with small language models and fine-tuning. In the RAG pipeline, we are going to build a subagent for each of the main components:

Intent router: the entry point of the pipeline that will decide if a RAG pipeline is required.
The retriever: The sub-agent that will extract the right data
The generator: The sub-agent that will generate the response to the user

Scaling up

With this course, I want to focus on what we would need to do to deploy the application to 1M users. We will make sure to design every endpoint to be asynchronous, queue the indexing requests, and deploy the application with elastic load balancing to scale the application horizontally.

This is going to be a fun ride! Make sure to join us!

Signup!

Discussion about this post

Ready for more?