search_engine / README.md
Vitomir Jovanović
Glancing + new data
591de4e
---
title: Search Engine
emoji: 🔥
colorFrom: green
colorTo: red
sdk: streamlit
sdk_version: 1.39.0
app_file: app.py
pinned: false
short_description: Semantic Search engine with Faiss
---
Check out the configuration reference at https://huggingface.co./docs/hub/spaces-config-
### For local deployment run
```
fast_api.py
```
Script creates swagger app with endpoints on [localhost:8084](http://127.0.0.1:8084/docs). First endpoint return the top k semanticaly most similar prompts with query prompt. Second endpoint returns all similarites with query (only applicable for very small datasets).
### Data Ingestion
```
data_reader.py
```
creates data of various prompts for encoding into vector database, from prompt-picture dataset.
Local database encoded only 11000 prompts.
Faiss index that is used is small and not optimized, used for experimental datasets. Search is brute force, not optimised.
### Streamlit
```
streamlit run app.py
```
Should be run for streamlit app, it can be assessed locally on http://localhost:8501.
### Docker
```
docker build -t my-streamlit-app .
```
from main dir