Spaces:
Running
Running
File size: 1,383 Bytes
2580556 2210481 2580556 2210481 2580556 2210481 2580556 2210481 97335b3 2210481 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 |
---
title: Deploy Qdrant RAG
emoji: π
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false
license: apache-2.0
---
# Deploying RAG powered by Qdrant as vector db and fastembed for embedding and retrieval
#### β QUESTION #1:
Why do we want to support streaming? What about streaming is important, or useful?
#### ANSWER #1:
The goal of streaming in this context is to render the generated answers in chunks. Thus reducing latency specifically for answers containing a lot of tokens
#### β QUESTION #2:
Why are we using User Session here? What about Python makes us need to use this? Why not just store everything in a global variable?
#### ANSWER #2:
Users sessions are used to keep track of users activity. It can be used to retrieve contxt from previous conversations or separate conversions
#### β Discussion Question #1:
Upload a PDF file of the recent DeepSeek-R1 paper and ask the following questions:
1. What is RL and how does it help reasoning?
2. What is the difference between DeepSeek-R1 and DeepSeek-R1-Zero?
3. What is this paper about?
Does this application pass your vibe check? Are there any immediate pitfalls you're noticing?
#### β Discussion
Not really. He doesnt know what is RL but he can respond to the other questions...
data:image/s3,"s3://crabby-images/47f57/47f57860932057d715766fb280cb1347b9ed4ae8" alt="image"
## π§ CHALLENGE MODE π§
Added Qdrant as vector db
Hugging Face Space link :
|