Ahmad Hamze Homepage
  • Posts
  • Mapping Earthquakes Locations on a Map
  • Pictionary using websockets
  • AI Blogs
    • Chatbot with Vector Database and FastAPI
    • Medical Chatbot
  • Algorithmics Blogs
    • Recursion with Memoization
  • Automation Programs
    • Puppeteer vs Selenium
    • Selenium Speed Test
  • Cellular Automata
  • Infrastructure & DevOps
    • Chatbot Deployment
  • Mathematics
    • From random walk to diffusion
    • Simple C++ matrix calculation
  • Opinion Blogs
    • With due apology to front-end developers
    • Advice to my past self
    • Changing Jobs and Tech Stacks
    • Web Dev Learned Lessons
  • React Applications
    • React Trivia Quiz
    • Storybook with React
RAG Chatbot: Vector Database Approach

In a previous blog, I wrote about a RAG chatbot that I created using a dataset of medical questions and answers. The RAG chatbot embedded the user’s question and retrieved the most relevant answers from an embedding file. Then, using the retrieved answers, GPT-4o-mini generated the final answer. In this blog, we will get rid of the embedding file and use a vector database instead, the code will be repurposed to be smaller in size in order to deploy the chatbot using Docker and AWS.

May 1, 2025 Read
RAG Medical Chatbot

A few years ago, when I started delving deeper into programming, AI was one of the main topics that captivated my interest. It was only natural for me to stumble upon this subject since I was drawn to scientific programming and Python. Back in the day, AI wasn’t the hype as it is today. Most importantly, when it was discussed, it was associated with different domains, not just LLMs as it is today.

April 9, 2025 Read
Navigation
  • About me
  • Skills
  • Projects
  • Blogs
Contact me:
  • Email: ahmadhamze@yahoo.com

Toha
© 2021 Copyright.
Powered by Hugo Logo