Diff-Foley / README.md
SimianLuo's picture
Update README.md
26e6d18
|
raw
history blame
604 Bytes
metadata
license: mit

(NeurIPS 2023) Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

Official Model Repo

Model Include:

  • Stage1-CAVP Pretrained Model.
  • Stage2-LDM Pretrained Model.
  • Double Guidance Classifier.

BibTeX

@misc{luo2023difffoley, 
title={Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models}, 
author={Simian Luo and Chuanhao Yan and Chenxu Hu and Hang Zhao}, 
year={2023}, 
eprint={2306.17203}, 
archivePrefix={arXiv}, 
primaryClass={cs.SD} 
}