samtrack / tutorial /tutorial for WebUI-1.5-Version.md
aikenml's picture
Upload folder using huggingface_hub
c985ba4
# Tutorial for WebUI 1.5 Version
## We have added two new features
- We have added text prompts to allow for interactive selection of objects that will be tracked in the video.
- We can now interactively add multiple objects for tracking in the video.
## Text-Prompts
### 1. Clone Grounding-DINO to `./src`
```
pip install -e git+https://github.com/IDEA-Research/GroundingDINO.git@main#egg=GroundingDINO
```
### 2. Switch to Text-Tab by clicking `Text` Tab
<p align="center">
<img src="./img/switch2textT.jpg" height="400">
</p>
### 3. Upload video or use example dicectly
### 4. Enter text to select the objects you are interested in
- The `.` is used to split text, just like in the original Grounding-Dino setting.
<p align="center">
<img src="./img/enter_text.jpg" height="400", width="400">
</p>
### 5. Get mask of selected object by clicking `Detect` button
- SAMTrack initialization may take some time.
<p align="center">
<img src="./img/detect_result.jpg" height="400", width="400">
</p>
### 6. Track in video
## Multi-Objects select
### 1. Once we interactively add an object mask, we can click the `Add new object button` to prepare to add a new object.
<p align="center">
<img src="./img/new_object.jpg" height="400", width="400">
</p>
### 2. Add a new object by clicking object
<p align="center">
<img src="./img/second_object.jpg" height="400", width="400">
</p>
### 3. You can add as many objects as you want by clicking `Add new object` button.