langchain openai streamlit pinecone-client chromadb unstructured pdf2image pytesseract tiktoken pymupdf tabulate sentence-transformers llama-cpp-python altair<5