English
File size: 1,104 Bytes
1622f6b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
# What's New

[2024/10] A new [VideoCLIP-XL-v2](https://huggingface.co./alibaba-pai/VideoCLIP-XL-v2) model has been released.

[2024/10] Initial commit for the [VideoCLIP-XL](https://huggingface.co./alibaba-pai/VideoCLIP-XL) model, the [VILD](https://huggingface.co./alibaba-pai/VILD) dataset, and the [LVDR](https://huggingface.co./alibaba-pai/LVDR) benchmark.

# VideoCLIP-XL (eXtra Length)

This model is proposed from [VideoCLIP-XL paper](https://arxiv.org/abs/2410.00741). 
It aims to advance long description understanding for video CLIP Models.

# Install
~~~
# 1. Create your environment
# 2. Install torch
# 3. Then:
pip install -r requirements.txt
~~~

# Usage
Please refer to ```demo.py```.

# Source
~~~
@misc{wang2024videoclipxladvancinglongdescription,
      title={VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models}, 
      author={Jiapeng Wang and Chengyu Wang and Kunzhe Huang and Jun Huang and Lianwen Jin},
      year={2024},
      eprint={2410.00741},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2410.00741}, 
}
~~~