fitz pymupdf PyPDF2 transformers==4.41.2 torch streamlit python-docx nltk presidio-analyzer frontend