Skip to content

LashawnFofung/RAG-Pipelines

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

223 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI-Powered Document Automation Platform: A RAG Journey 🚀

This repository documents my technical evolution from writing basic LLM prompts to engineering a production-ready Retrieval-Augmented Generation (RAG) Proof of Concept (PoC). Each folder and notebook represents a critical milestone in mastering LlamaIndex, open-source model deployment, and intelligent document orchestration.

🎯 Foundational Concepts

The foundational concepts to building a fully functional RAG application:

  • RAG Fundamentals: Gain a deep understanding of how RAG pipelines work and the core concepts of information retrieval and processing in the context of LLMs.

  • LlamaIndex Mastery: Learn to effectively use LlamaIndex for organizing, indexing, and efficiently searching through large, unstructured document sets.

  • Model Integration: Practice integrating open-source Large Language Models (LLMs) into the RAG workflow.

  • Practical Application: Build a functional, simple chatbot that uses the RAG pipeline to retrieve and synthesize relevant data for accurate, context-aware responses.

🗺️ The Development Roadmap

Phase 1: Foundations of LLM & RAG 🏗️

Focus: Establishing basic interaction, initial RAG logic, and environment setup.


Phase 2: Retrieval Science & Benchmarking 🧪

Focus: Optimizing how data is processed, stored, and retrieved.


Phase 3: Intelligent Routing & Privacy 🔓

Focus: Managing multi-document "blobs," metadata, and local open-source deployment.


Phase 4: UI Development & Lite Implementation 🎨

Focus: Creating user-friendly interfaces for research and deployment.


Phase 5: Production PoC Deliverable 🏆

Focus: Integrating all modules into a unified, enterprise-grade platform.


🛠️ Tech Stack & Skills

  • Orchestration: LlamaIndex

  • Models: Google Gemini, Mistral 7B (Local), Phi-2, TinyLlama

  • Vector DB/Indices: FAISS, VectorStoreIndex

  • Embeddings: BGE, HuggingFace Transformers

  • UI/UX: Gradio 5.x

  • Processing: PyMuPDF, OpenCV, Multithreading


🎯 Final POC Feature Highlights

Feature Technical Solution Benefit
Context Isolation Semantic Routing via Gemini Prevents "Context Contamination" from irrelevant docs.
High-Speed Ingestion Parallel ThreadPool Processing 5x faster parsing of 100+ page documents.
Source Trust Metadata Tagging & Citations Real-time source badges for every AI response.
Open-Source Ready GGUF & LlamaCPP Integration Zero-cost, private deployment options.


About

This repository provides a comprehensive, hands-on guide to building a Retrieval-Augmented Generation (RAG) pipeline. RAG is a critical architecture for grounding Large Language Models (LLMs) with external, up-to-date, or private knowledge bases, effectively turning a generic AI into an expert on your specific data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors