Edit model card

☁️ Cloud Cents RAG Model ☁️

πŸ“– Model Description

This model is a fine-tuned GPT-2 model designed to answer cloud-related questions about AWS, Azure, GCP, and other cloud platforms. It uses Retrieval-Augmented Generation (RAG) to combine document retrieval with text generation, leveraging cloud-related documents and real-time pricing information.

πŸ›  Intended Use

  • πŸ” Cloud service comparisons (e.g., AWS vs Azure)
  • πŸ’° Real-time cloud pricing queries
  • 🌐 General information about cloud platforms and services

πŸ”§ Pipeline

text-generation with retrieval-augmented generation (RAG)

  • πŸ“š Document retrieval from rag_contents and pricing_info
  • ✍️ Text generation using fine-tuned GPT-2

πŸ“Š Datasets

  • rag_contents: Contains cloud-related documents from sources such as AWS, Azure, GCP.
  • pricing_info: Provides real-time pricing details for cloud services (e.g., EC2, Blob Storage, Container Registry).

πŸ“ˆ Metrics

  • Perplexity: Evaluated for fluency of text generation.
  • BLEU: Used for measuring the accuracy of generated answers.
  • Retrieval Accuracy: Custom metric for FAISS-based document retrieval.

⚠️ Limitations

  • πŸ•‘ The model may not always retrieve the most up-to-date information about cloud services.
  • πŸ“‚ The retrieval and generation quality is based on the documents stored in the rag_contents table.
Downloads last month
183
Safetensors
Model size
124M params
Tensor type
F32
Β·
Inference API
Unable to determine this model's library. Check the docs .

Evaluation results