File size: 604 Bytes
8771c62
 
 
85567f4
825286e
 
85567f4
26e6d18
 
 
 
ce2ccd2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f6d725d
85567f4
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
---
license: mit
---

### (NeurIPS 2023) Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models 
### Official Model Repo

#### Model Include:
- Stage1-CAVP Pretrained Model.
- Stage2-LDM Pretrained Model.
- Double Guidance Classifier.

<p align="center">
    <img src="teaser.png">
</p>

## BibTeX

```bibtex
@misc{luo2023difffoley, 
title={Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models}, 
author={Simian Luo and Chuanhao Yan and Chenxu Hu and Hang Zhao}, 
year={2023}, 
eprint={2306.17203}, 
archivePrefix={arXiv}, 
primaryClass={cs.SD} 
}
```