|
# Tutorial for WebUI 1.5 Version |
|
## We have added two new features |
|
- We have added text prompts to allow for interactive selection of objects that will be tracked in the video. |
|
- We can now interactively add multiple objects for tracking in the video. |
|
|
|
|
|
## Text-Prompts |
|
### 1. Clone Grounding-DINO to `./src` |
|
``` |
|
pip install -e git+https://github.com/IDEA-Research/GroundingDINO.git@main#egg=GroundingDINO |
|
``` |
|
|
|
### 2. Switch to Text-Tab by clicking `Text` Tab |
|
|
|
<p align="center"> |
|
<img src="./img/switch2textT.jpg" height="400"> |
|
</p> |
|
|
|
### 3. Upload video or use example dicectly |
|
|
|
### 4. Enter text to select the objects you are interested in |
|
- The `.` is used to split text, just like in the original Grounding-Dino setting. |
|
|
|
<p align="center"> |
|
<img src="./img/enter_text.jpg" height="400", width="400"> |
|
</p> |
|
|
|
### 5. Get mask of selected object by clicking `Detect` button |
|
- SAMTrack initialization may take some time. |
|
|
|
<p align="center"> |
|
<img src="./img/detect_result.jpg" height="400", width="400"> |
|
</p> |
|
|
|
### 6. Track in video |
|
|
|
## Multi-Objects select |
|
### 1. Once we interactively add an object mask, we can click the `Add new object button` to prepare to add a new object. |
|
|
|
<p align="center"> |
|
<img src="./img/new_object.jpg" height="400", width="400"> |
|
</p> |
|
|
|
### 2. Add a new object by clicking object |
|
|
|
<p align="center"> |
|
<img src="./img/second_object.jpg" height="400", width="400"> |
|
</p> |
|
|
|
### 3. You can add as many objects as you want by clicking `Add new object` button. |