Custom Chatbot Builder & Hosting Platform

Creators can upload documents for knowledge base, configure guardrails and tone, then deploy via API or an embeddable web widget. The system is multi‑tenant with isolated data for each user, session memory per conversation, and allows users to set system prompt and score threshold.

FastAPI React Qdrant Vector DB RAG AWS (EC2 · S3 · CloudFront · Cognito)
🔗 View Live Demo

Architecture & Stack

  • Frontend: React Web App
  • Backend: FastAPI on EC2, Open AI API: Embedding & Chat
  • Data: Qdrant (vector DB), Redis (session memory)
  • Storage: AWS S3 for document storage / retrieval

Challenges

  • Deciding how to score chunk revelevancy, how many chunks to feed into chat context
  • Once a user deletes a document from a project, the embeddings from this doc are removed from the vector DB

What I Learned

  • RAG pipelines rely heavily on document parsing and storage
  • How to manage knowledge bases, allow user deletions / changes