pdf-chat-bot / README.md
MrHalk's picture
Update README.md
b275738 verified

A newer version of the Streamlit SDK is available: 1.43.1

Upgrade
metadata
title: Pdf Chat Bot
emoji: πŸ‘
colorFrom: blue
colorTo: blue
sdk: streamlit
sdk_version: 1.42.0
app_file: app.py
pinned: false
short_description: This project is a Streamlit-based application.
thumbnail: >-
  https://cdn-uploads.huggingface.co/production/uploads/662ba6f848b0b3742e6f9261/sYYSsREDiHjnWpDR4jBFS.png

PDF-Chat-Bot

Upload the PDF Files and ask questions

To see Preview

https://pdf-chat-bot-arman.streamlit.app/

Also hosted on Huggingface

https://huggingface.co./spaces/MrHalk/pdf-chat-bot

Chat with PDF using Gemini πŸ’

This project is a Streamlit-based application that allows users to interactively chat with PDF documents using Google's Gemini AI. It enables users to upload PDF files, process them, and ask questions based on the content of the files.


Features

  • πŸ“„ Upload and process multiple PDF files.
  • πŸ’¬ Ask questions based on the content of the PDFs.
  • ⚑ Fast and detailed answers using Google Gemini AI.
  • 🧠 Leverages FAISS for efficient similarity search and text embeddings.
  • πŸ”₯ Supports chunked text processing for large PDFs.

Tech Stack

  • Python 🐍
  • Streamlit for UI
  • PyPDF2 for PDF reading
  • LangChain for text processing and AI integration
  • FAISS for vector similarity search
  • Google Gemini AI (Generative AI model)
  • dotenv for managing environment variables

Prerequisites

  1. Python 3.9 or later
  2. A valid Google API Key.
  3. The following Python libraries installed:
    • streamlit
    • PyPDF2
    • langchain
    • langchain_google_genai
    • google-generativeai
    • faiss-cpu
    • python-dotenv

Known Issues

  1. Pickle Deserialization Warning: Ensure FAISS index files are from trusted sources. If encountering a ValueError, delete the faiss_index folder and reprocess the PDFs.
  2. Large PDFs: Processing very large PDFs may take additional time.

PDF Chat Bot

So called ArmanπŸ˜‡