AI Automation

Build a RAG AI Chatbot with Python, React, OpenAI, and ChromaDB

Learn how to build a GitHub-ready RAG AI chatbot application using Python, FastAPI, React, OpenAI embeddings, ChromaDB, pyenv, and a modern full-stack architecture.

May 8, 2026 Danish Ashraf 9 min read

Introduction

AI chatbots are becoming an important part of modern business applications. They can answer customer questions, search documentation, support internal teams, qualify leads, and automate repetitive communication.

However, a normal chatbot has one major limitation: it may not know your private business data.

This is where Retrieval-Augmented Generation, also known as RAG, becomes useful.

A RAG chatbot searches your documents first, retrieves the most relevant information, and then generates a response using that context.

Instead of guessing, the chatbot answers based on your own knowledge base.

In this project, I built a complete GitHub-ready RAG AI chatbot using:

Python 3.12 with pyenv
FastAPI
React
Vite
OpenAI embeddings
OpenAI Responses API
ChromaDB as a local vector database
A knowledge.txt file for sample company documents
A one-command development script to run backend and frontend together

Build a RAG AI Chatbot with Python, React, OpenAI, and ChromaDB

Watch the full project demo on YouTube.

RAG AI Chatbot GitHub Repository View the complete source code for the Python, FastAPI, React, OpenAI, and ChromaDB RAG chatbot project. github.com View GitHub Repo

What We Are Building

We are building a full-stack AI chatbot application where:

Sample company documents are stored in backend/app/documents/knowledge.txt.
The backend splits the document content into smaller chunks.
Each chunk is converted into an embedding using OpenAI.
Embeddings are stored in ChromaDB.
A user asks a question from the React frontend.
The Python backend searches the most relevant chunks.
OpenAI generates an answer using the retrieved context.
The answer is returned to the chatbot UI.

This kind of application is useful for:

Customer support chatbots
Internal company assistants
SaaS help centers
Documentation search
AI knowledge bases
Lead qualification bots
Product FAQ assistants

Application Architecture

The application has two main parts.

Backend

The backend handles:

Loading environment variables
Reading the knowledge.txt file
Splitting content into chunks
Creating OpenAI embeddings
Storing vectors in ChromaDB
Searching relevant context
Calling OpenAI for final answers
Returning chatbot responses through FastAPI

Frontend

The React frontend handles:

Chat interface
Index Docs button
User input
API requests
Loading states
Displaying AI responses

Project Folder Structure

The current GitHub project structure looks like this. Runtime-generated files such as .venv, node_modules, dist, __pycache__, .env, and ChromaDB database files are intentionally ignored.

Project Structure

rag-ai-chatbot/├── .gitignore├── .python-version├── README.md├── backend/│   ├── .env.example│   ├── requirements.txt│   └── app/│       ├── __init__.py│       ├── chroma_telemetry.py│       ├── config.py│       ├── main.py│       ├── rag.py│       └── documents/│           └── knowledge.txt├── frontend/│   ├── index.html│   ├── package.json│   ├── vite.config.js│   └── src/│       ├── App.css│       ├── App.jsx│       └── main.jsx└── scripts/    └── dev.sh

Step 1 - Clone the GitHub Repository

Clone the project from GitHub:

Clone Project

$git clone https://github.com/danishashraf047/rag-ai-chatbot.git$cd rag-ai-chatbot

git clone https://github.com/danishashraf047/rag-ai-chatbot.git
cd rag-ai-chatbot

Step 2 - Use pyenv for Python

This project uses pyenv instead of global Python.

The repository includes a .python-version file:

.python-version

13.12.9

3.12.9

Install and select the Python version:

Setup Python with pyenv

$pyenv install 3.12.9$pyenv local 3.12.9$python --version

pyenv install 3.12.9
pyenv local 3.12.9
python --version

Expected Python version:

Expected Python Version

1Python 3.12.9

Python 3.12.9

Step 3 - Create the Virtual Environment

Create a project-level virtual environment:

Create Virtual Environment

$python -m venv .venv$source .venv/bin/activate

python -m venv .venv
source .venv/bin/activate

Install backend dependencies:

Install Backend Dependencies

$cd backend$../.venv/bin/python -m pip install --upgrade pip$../.venv/bin/pip install -r requirements.txt

cd backend
../.venv/bin/python -m pip install --upgrade pip
../.venv/bin/pip install -r requirements.txt

Step 4 - Add Environment Variables

Create the backend environment file:

Create Environment File

$cp .env.example .env

cp .env.example .env

Then add your OpenAI API key:

backend/.env

1OPENAI_API_KEY=your_openai_api_key_here2OPENAI_EMBEDDING_MODEL=text-embedding-3-small3OPENAI_CHAT_MODEL=gpt-4.1-mini4CHROMA_COLLECTION_NAME=rag_chatbot_docs5CHROMA_STORAGE_PATH=./chroma_storage6FRONTEND_ORIGINS=http://localhost:5173,http://127.0.0.1:5173

OPENAI_API_KEY=your_openai_api_key_here
OPENAI_EMBEDDING_MODEL=text-embedding-3-small
OPENAI_CHAT_MODEL=gpt-4.1-mini
CHROMA_COLLECTION_NAME=rag_chatbot_docs
CHROMA_STORAGE_PATH=./chroma_storage
FRONTEND_ORIGINS=http://localhost:5173,http://127.0.0.1:5173

Important: never expose your OpenAI API key in the React frontend. Keep it only in the Python backend.

Step 5 - Understand the Knowledge File

The project includes a sample company knowledge file:

Knowledge File Path

1backend/app/documents/knowledge.txt

backend/app/documents/knowledge.txt

This file contains sample company documents for CodeHills, including:

Company overview
Core services
AI and RAG development process
Example use cases
Project timelines
Pricing approach
Security and privacy notes
Deployment and maintenance details
Frequently asked questions

When you click Index Docs, the backend reads this file, splits it into chunks, creates embeddings, and stores the chunks in ChromaDB.

You can replace this file with your own:

FAQs
Product documentation
Company policies
Help center articles
Service descriptions
Customer support content

Step 6 - Backend Configuration

The backend configuration lives in:

Backend Config File

1backend/app/config.py

backend/app/config.py

It loads environment variables, resolves file paths, configures ChromaDB storage, and supports both frontend origins:

Allowed Frontend Origins

1http://localhost:51732http://127.0.0.1:5173

http://localhost:5173
http://127.0.0.1:5173

This matters because browsers treat localhost and 127.0.0.1 as different origins.

Step 7 - RAG Logic

The RAG logic lives in:

RAG Logic File

1backend/app/rag.py

backend/app/rag.py

This file handles:

OpenAI client setup
ChromaDB collection setup
Text chunking
Embedding generation
Document indexing
Context retrieval
AI answer generation

The backend also includes:

Chroma Telemetry Adapter

1backend/app/chroma_telemetry.py

backend/app/chroma_telemetry.py

This disables noisy ChromaDB telemetry messages during local development.

Step 8 - FastAPI Backend

The FastAPI app lives in:

FastAPI App

1backend/app/main.py

backend/app/main.py

It provides these routes:

Backend Routes

1GET  /2GET  /health3POST /index4POST /chat

GET  /
GET  /health
POST /index
POST /chat

The /index route indexes the knowledge.txt file.

The /chat route accepts a user question, retrieves relevant context, and returns an AI-generated answer.

Step 9 - React Frontend

The React app lives in:

React App

1frontend/src/App.jsx

frontend/src/App.jsx

The UI includes:

A clean chatbot layout
An Index Docs button
A message area
Loading states
A text input
A send button

The frontend calls the backend at:

Frontend API URL

1http://127.0.0.1:8000

http://127.0.0.1:8000

Step 10 - Run Both Projects with One Script

The project includes a helper script:

Development Script

1scripts/dev.sh

scripts/dev.sh

Run both backend and frontend from the project root:

Run Full Project

$./scripts/dev.sh

./scripts/dev.sh

This script:

Checks for pyenv
Creates .venv if missing
Creates backend/.env if missing
Installs backend dependencies
Installs frontend dependencies if needed
Starts FastAPI on port 8000
Starts Vite React on port 5173
Stops both servers when you press Ctrl+C

After running the script, open:

Frontend URL

1http://127.0.0.1:5173

http://127.0.0.1:5173

Backend docs are available at:

FastAPI Docs

1http://127.0.0.1:8000/docs

http://127.0.0.1:8000/docs

Step 11 - Index the Documents

Open the React app and click:

Index Button

1Index Docs

Index Docs

This sends a request to:

Index API

1POST http://127.0.0.1:8000/index

POST http://127.0.0.1:8000/index

Expected response:

Index Response

1{2  "message": "Document indexed successfully",3  "chunks": 74}

{
  "message": "Document indexed successfully",
  "chunks": 7
}

The exact chunk count may change depending on the content in knowledge.txt.

Step 12 - Ask Questions

After indexing, ask questions based on the content you added inside backend/app/documents/knowledge.txt.

For example, if your knowledge.txt contains company services, ask about services. If it contains product documentation, ask product questions. If it contains policies, ask policy-related questions.

The chatbot retrieves relevant chunks from ChromaDB and sends that context to OpenAI before generating the final answer.

How the RAG Chatbot Works Internally

When a user asks a question:

React sends the question to FastAPI.
FastAPI converts the question into an embedding.
ChromaDB searches for similar document chunks.
The retrieved chunks are added as context.
OpenAI generates an answer based on that context.
The response is sent back to the React chatbot UI.

This makes the chatbot more reliable because it answers using your own data instead of relying only on model memory.

Production Improvements

This project is a practical starting point. For production, you can add PDF uploads, source citations, authentication, chat history, streaming responses, metadata filters, rate limiting, logging, monitoring, and Docker deployment.

Keep API keys in the backend, re-index documents after changing knowledge.txt, and make sure your frontend origin is included in FRONTEND_ORIGINS.

Final Thoughts

RAG makes AI chatbots more useful by grounding answers in your own documents. With Python, FastAPI, React, OpenAI, and ChromaDB, this project gives you a clean foundation for customer support bots, internal knowledge assistants, SaaS help centers, and documentation search.

Ready to turn this idea into a working system?

Share your goals, workflow, or product challenge. We will review the details and recommend the most practical next step.

Build a RAG AI Chatbot with Python, React, OpenAI, and ChromaDB

Introduction

What We Are Building

Application Architecture

Backend

Frontend

Project Folder Structure

Step 1 - Clone the GitHub Repository

Step 2 - Use pyenv for Python

Step 3 - Create the Virtual Environment

Step 4 - Add Environment Variables

Step 5 - Understand the Knowledge File

Step 6 - Backend Configuration

Step 7 - RAG Logic

Step 8 - FastAPI Backend

Step 9 - React Frontend

Step 10 - Run Both Projects with One Script

Step 11 - Index the Documents

Step 12 - Ask Questions

How the RAG Chatbot Works Internally

Production Improvements

Final Thoughts

Ready to turn this idea into a working system?

Discussion

Leave a Comment Cancel reply