Le0jc commited on
Commit
3ce060a
·
verified ·
1 Parent(s): 631d2b7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -84
README.md CHANGED
@@ -1,7 +1,6 @@
1
  <div align="center">
2
 
3
 
4
- <img src="icon.jpg" width="250"/>
5
 
6
  <h2><center>Tora: Trajectory-oriented Diffusion Transformer for Video Generation</h2>
7
 
@@ -17,8 +16,12 @@ Zhenghao Zhang\*, Junchao Liao\*, Menghao Li, Zuozhuo Dai, Bingxue Qiu, Siyu Zhu
17
 
18
  <a href='https://modelscope.cn/models/xiaoche/Tora'><img src='https://img.shields.io/badge/🤖_ModelScope-weights-%23654dfc'></a>
19
  <a href='https://huggingface.co/Le0jc/Tora'><img src='https://img.shields.io/badge/🤗_HuggingFace-weights-%23ff9e0e'></a>
 
20
  </div>
21
 
 
 
 
22
 
23
  ## 💡 Abstract
24
 
@@ -26,6 +29,8 @@ Recent advancements in Diffusion Transformer (DiT) have demonstrated remarkable
26
 
27
  ## 📣 Updates
28
 
 
 
29
  - `2024/10/31` Model weights uploaded to [HuggingFace](https://huggingface.co/Le0jc/Tora). We also provided an English demo on [ModelScope](https://www.modelscope.cn/studios/Alibaba_Research_Intelligence_Computing/Tora_En).
30
  - `2024/10/23` 🔥🔥Our [ModelScope Demo](https://www.modelscope.cn/studios/xiaoche/Tora) is launched. Welcome to try it out! We also upload the model weights to [ModelScope](https://www.modelscope.cn/models/xiaoche/Tora).
31
  - `2024/10/21` Thanks to [@kijai](https://github.com/kijai) for supporting Tora in ComfyUI! [Link](https://github.com/kijai/ComfyUI-CogVideoXWrapper)
@@ -33,15 +38,6 @@ Recent advancements in Diffusion Transformer (DiT) have demonstrated remarkable
33
  - `2024/08/27` We released our v2 paper including appendix.
34
  - `2024/07/31` We submitted our paper on arXiv and released our project page.
35
 
36
- ## 📑 Table of Contents
37
-
38
- - [Showcases](#%EF%B8%8F-showcases)
39
- - [Model Weights](#-model-weights)
40
- - [Inference](#-inference)
41
- - [Acknowledgements](#-acknowledgements)
42
- - [Our previous work](#-our-previous-work)
43
- - [Citation](#-citation)
44
-
45
  ## 🎞️ Showcases
46
 
47
  https://github.com/user-attachments/assets/949d5e99-18c9-49d6-b669-9003ccd44bf1
@@ -52,80 +48,6 @@ https://github.com/user-attachments/assets/4026c23d-229d-45d7-b5be-6f3eb9e4fd50
52
 
53
  All videos are available in this [Link](https://cloudbook-public-daily.oss-cn-hangzhou.aliyuncs.com/Tora_t2v/showcases.zip)
54
 
55
-
56
- ## 📦 Model Weights
57
-
58
- ### Folder Structure
59
-
60
- ```
61
- Tora
62
- └── sat
63
- └── ckpts
64
- ├── t5-v1_1-xxl
65
- │ ├── model-00001-of-00002.safetensors
66
- │ └── ...
67
- ├── vae
68
- │ └── 3d-vae.pt
69
- └── tora
70
- └── t2v
71
- └── mp_rank_00_model_states.pt
72
- ```
73
-
74
- ### Download Links
75
- *Note: Downloading the `tora` weights requires following the [CogVideoX License](CogVideoX_LICENSE).* You can choose one of the following options: HuggingFace, ModelScope, or native links.
76
- After downloading the model weights, you can put them in the `Tora/sat/ckpts` folder.
77
-
78
- #### HuggingFace
79
-
80
- ```bash
81
- # This can be faster
82
- pip install "huggingface_hub[hf_transfer]"
83
- HF_HUB_ENABLE_HF_TRANSFER=1 huggingface-cli download Le0jc/Tora --local-dir ckpts
84
- ```
85
-
86
- or
87
-
88
- ```bash
89
- # use git
90
- git lfs install
91
- git clone https://huggingface.co/Le0jc/Tora
92
- ```
93
- #### ModelScope
94
- - SDK
95
- ```bash
96
- from modelscope import snapshot_download
97
- model_dir = snapshot_download('xiaoche/Tora')
98
- ```
99
-
100
- - Git
101
- ```bash
102
- git clone https://www.modelscope.cn/xiaoche/Tora.git
103
- ```
104
-
105
- #### Native
106
-
107
- - Download the VAE and T5 model following [CogVideo](https://github.com/THUDM/CogVideo/blob/main/sat/README.md#2-download-model-weights):
108
- - VAE: https://cloud.tsinghua.edu.cn/f/fdba7608a49c463ba754/?dl=1
109
- - T5: [text_encoder](https://huggingface.co/THUDM/CogVideoX-2b/tree/main/text_encoder), [tokenizer](https://huggingface.co/THUDM/CogVideoX-2b/tree/main/tokenizer)
110
- - Tora t2v model weights: [Link](https://cloudbook-public-daily.oss-cn-hangzhou.aliyuncs.com/Tora_t2v/mp_rank_00_model_states.pt). Downloading this weight requires following the [CogVideoX License](CogVideoX_LICENSE).
111
-
112
-
113
-
114
- ## 🔄 Inference
115
-
116
- please refer to our [Github](https://github.com/alibaba/Tora) or [modelscope online demo](https://www.modelscope.cn/studios/Alibaba_Research_Intelligence_Computing/Tora_En)
117
-
118
- ### Recommendations for Text Prompts
119
-
120
- For text prompts, we highly recommend using GPT-4 to enhance the details. Simple prompts may negatively impact both visual quality and motion control effectiveness.
121
-
122
- You can refer to the following resources for guidance:
123
-
124
- - [CogVideoX Documentation](https://github.com/THUDM/CogVideo/blob/main/inference/convert_demo.py)
125
- - [OpenSora Scripts](https://github.com/hpcaitech/Open-Sora/blob/main/scripts/inference.py)
126
-
127
-
128
-
129
  ## 🤝 Acknowledgements
130
 
131
  We would like to express our gratitude to the following open-source projects that have been instrumental in the development of our project:
 
1
  <div align="center">
2
 
3
 
 
4
 
5
  <h2><center>Tora: Trajectory-oriented Diffusion Transformer for Video Generation</h2>
6
 
 
16
 
17
  <a href='https://modelscope.cn/models/xiaoche/Tora'><img src='https://img.shields.io/badge/🤖_ModelScope-weights-%23654dfc'></a>
18
  <a href='https://huggingface.co/Le0jc/Tora'><img src='https://img.shields.io/badge/🤗_HuggingFace-weights-%23ff9e0e'></a>
19
+
20
  </div>
21
 
22
+ This is the official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation".
23
+
24
+ ## Please visit our [Github repo](https://github.com/alibaba/Tora) for more details.
25
 
26
  ## 💡 Abstract
27
 
 
29
 
30
  ## 📣 Updates
31
 
32
+ - `2024/12/09` 🔥🔥Diffusers version of Tora and the corresponding model weights are released. Inference VRAM requirements are reduced to around 5 GiB. Please refer to [this](diffusers-version/README.md) for details.
33
+ - `2024/11/25` 🔥Text-to-Video training code released.
34
  - `2024/10/31` Model weights uploaded to [HuggingFace](https://huggingface.co/Le0jc/Tora). We also provided an English demo on [ModelScope](https://www.modelscope.cn/studios/Alibaba_Research_Intelligence_Computing/Tora_En).
35
  - `2024/10/23` 🔥🔥Our [ModelScope Demo](https://www.modelscope.cn/studios/xiaoche/Tora) is launched. Welcome to try it out! We also upload the model weights to [ModelScope](https://www.modelscope.cn/models/xiaoche/Tora).
36
  - `2024/10/21` Thanks to [@kijai](https://github.com/kijai) for supporting Tora in ComfyUI! [Link](https://github.com/kijai/ComfyUI-CogVideoXWrapper)
 
38
  - `2024/08/27` We released our v2 paper including appendix.
39
  - `2024/07/31` We submitted our paper on arXiv and released our project page.
40
 
 
 
 
 
 
 
 
 
 
41
  ## 🎞️ Showcases
42
 
43
  https://github.com/user-attachments/assets/949d5e99-18c9-49d6-b669-9003ccd44bf1
 
48
 
49
  All videos are available in this [Link](https://cloudbook-public-daily.oss-cn-hangzhou.aliyuncs.com/Tora_t2v/showcases.zip)
50
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
51
  ## 🤝 Acknowledgements
52
 
53
  We would like to express our gratitude to the following open-source projects that have been instrumental in the development of our project: