Spaces:
Sleeping
Sleeping
A newer version of the Gradio SDK is available:
5.14.0
metadata
title: Easy RAG
emoji: 🐢
colorFrom: purple
colorTo: indigo
sdk: gradio
sdk_version: 5.8.0
app_file: app.py
pinned: false
license: mit
short_description: Chat with your docs very Easy
DocumentGPT - Advanced Document Analysis with RAG
Overview
DocumentGPT is a cutting-edge document analysis system that leverages the power of Retrieval Augmented Generation (RAG) to provide intelligent responses based on your documents. Built with advanced AI technologies, it allows users to upload multiple document types and get accurate, context-aware responses to their questions.
Key Features
- Multi-Format Support: Process PDF, DOCX, CSV, and TXT files seamlessly
- Advanced RAG Implementation: Using state-of-the-art LLM technology with Llama-2
- GPU-Accelerated: Optimized performance with GPU acceleration
- Real-Time Processing: Dynamic document processing and instant responses
- Source Attribution: Every response includes references to source documents
- Interactive Interface: User-friendly Gradio interface for easy interaction
Technical Stack
- Large Language Model: Llama-2-7b-chat-hf
- Embeddings: multilingual-e5-large
- Vector Store: FAISS
- Framework: Gradio
- Processing: Langchain
- Acceleration: HuggingFace Accelerate
Getting Started
Prerequisites
- Python 3.8 or higher
- GPU support (recommended)
- HuggingFace account with access to Llama-2
Installation
# Clone the repository
git clone https://github.com/yourusername/documentgpt.git
cd documentgpt
# Install dependencies
pip install -r requirements.txt
# Set up environment variables
export HUGGINGFACE_TOKEN=your_token_here
Running the Application
python app.py
Usage
- Launch the application
- Upload your documents (PDF, DOCX, CSV, or TXT)
- Wait for the processing to complete
- Start asking questions about your documents
- View responses with source attributions
Advanced Features
- Dynamic Knowledge Base: Updates in real-time as new documents are added
- Memory Management: Efficient handling of document processing
- Source Tracking: Transparent attribution of information sources
- Optimized Performance: GPU acceleration for faster processing
Author
Camilo Vega
- AI Professor and Solutions Consultant
- LinkedIn: Camilo Vega
- GitHub: CamiloVega
Acknowledgments
Special thanks to:
- HuggingFace for providing GPU acceleration support
- Meta AI for the Llama-2 model
- The Langchain community for their excellent tools
Contact
For questions and support, please reach out through:
- LinkedIn: Camilo Vega
Made with by Camilo Vega