βοΈ Cloud Cents RAG Model βοΈ
π Model Description
This model is a fine-tuned GPT-2 model designed to answer cloud-related questions about AWS, Azure, GCP, and other cloud platforms. It uses Retrieval-Augmented Generation (RAG) to combine document retrieval with text generation, leveraging cloud-related documents and real-time pricing information.
π Intended Use
- π Cloud service comparisons (e.g., AWS vs Azure)
- π° Real-time cloud pricing queries
- π General information about cloud platforms and services
π§ Pipeline
text-generation
with retrieval-augmented generation (RAG)
- π Document retrieval from
rag_contents
andpricing_info
- βοΈ Text generation using fine-tuned GPT-2
π Datasets
- rag_contents: Contains cloud-related documents from sources such as AWS, Azure, GCP.
- pricing_info: Provides real-time pricing details for cloud services (e.g., EC2, Blob Storage, Container Registry).
π Metrics
- Perplexity: Evaluated for fluency of text generation.
- BLEU: Used for measuring the accuracy of generated answers.
- Retrieval Accuracy: Custom metric for FAISS-based document retrieval.
β οΈ Limitations
- π The model may not always retrieve the most up-to-date information about cloud services.
- π The retrieval and generation quality is based on the documents stored in the
rag_contents
table.
- Downloads last month
- 183
Evaluation results
- Retrieval Accuracy on rag_contentsself-reported85.000
- Pricing Retrieval Accuracy on pricing_infoself-reported75.000