--- title: Easy RAG emoji: 🐢 colorFrom: purple colorTo: indigo sdk: gradio sdk_version: 5.8.0 app_file: app.py pinned: false license: mit short_description: Chat with your docs very Easy --- # DocumentGPT - Advanced Document Analysis with RAG ## Overview DocumentGPT is a cutting-edge document analysis system that leverages the power of Retrieval Augmented Generation (RAG) to provide intelligent responses based on your documents. Built with advanced AI technologies, it allows users to upload multiple document types and get accurate, context-aware responses to their questions. ## Key Features - Multi-Format Support: Process PDF, DOCX, CSV, and TXT files seamlessly - Advanced RAG Implementation: Using state-of-the-art LLM technology with Llama-2 - GPU-Accelerated: Optimized performance with GPU acceleration - Real-Time Processing: Dynamic document processing and instant responses - Source Attribution: Every response includes references to source documents - Interactive Interface: User-friendly Gradio interface for easy interaction ## Technical Stack - Large Language Model: Llama-2-7b-chat-hf - Embeddings: multilingual-e5-large - Vector Store: FAISS - Framework: Gradio - Processing: Langchain - Acceleration: HuggingFace Accelerate ## Getting Started ### Prerequisites - Python 3.8 or higher - GPU support (recommended) - HuggingFace account with access to Llama-2 ### Installation ```bash # Clone the repository git clone https://github.com/yourusername/documentgpt.git cd documentgpt # Install dependencies pip install -r requirements.txt # Set up environment variables export HUGGINGFACE_TOKEN=your_token_here ``` ### Running the Application ```bash python app.py ``` ## Usage 1. Launch the application 2. Upload your documents (PDF, DOCX, CSV, or TXT) 3. Wait for the processing to complete 4. Start asking questions about your documents 5. View responses with source attributions ## Advanced Features - Dynamic Knowledge Base: Updates in real-time as new documents are added - Memory Management: Efficient handling of document processing - Source Tracking: Transparent attribution of information sources - Optimized Performance: GPU acceleration for faster processing ## Author **Camilo Vega** - AI Professor and Solutions Consultant - LinkedIn: [Camilo Vega](https://www.linkedin.com/in/camilo-vega-169084b1/) - GitHub: [CamiloVega](https://github.com/camilovegag) ## Acknowledgments Special thanks to: - HuggingFace for providing GPU acceleration support - Meta AI for the Llama-2 model - The Langchain community for their excellent tools ## Contact For questions and support, please reach out through: - LinkedIn: [Camilo Vega](https://www.linkedin.com/in/camilo-vega-169084b1/) --- Made with by Camilo Vega