RAG-based medical Q&A over the Merck Manual (19th ed.) — ChromaDB, GTE-large embeddings, Mistral 7B (GGUF). Local, privacy-first, GPU-accelerated.
▶ View full notebook (rendered HTML) — opens the entire notebook in your browser (GitHub README cannot embed HTML; this link shows the full report).
This project implements a Retrieval-Augmented Generation (RAG) system that gives healthcare professionals instant, contextually accurate access to medical knowledge from the Merck Manual of Diagnosis and Therapy (19th Edition). It reduces information overload and supports clinical decision-making with a local, privacy-first design.
| File | Description |
|---|---|
RAG_medical_assistant.ipynb |
Jupyter notebook: data load, indexing, RAG pipeline, and evaluation |
RAG-Medical-Assistant.html |
Static HTML export of the notebook for viewing in a browser |
git clone https://github.com/ananttripathi/RAG-based-Medical-Assistant.git
cd RAG-based-Medical-Assistant
pip install -r requirements.txt
RAG_medical_assistant.ipynb in Jupyter.merck_pdf_path to the path of your Merck Manual PDF (or use the Google Drive setup in the notebook).View the notebook: Use the View full notebook link at the top, or open RAG-Medical-Assistant.html locally in a browser.
# CUDA
CMAKE_ARGS="-DLLAMA_CUBLAS=on" pip install llama-cpp-python --force-reinstall --no-cache-dir
# CPU only
CMAKE_ARGS="-DLLAMA_CUBLAS=off" pip install llama-cpp-python --force-reinstall --no-cache-dir
Flow: Query → embed → retrieve top-k chunks from ChromaDB → build context → Mistral generates answer.
This project is licensed under the MIT License.
For questions or suggestions, open a GitHub Issue.