|
---
|
|
title: my_tango
|
|
app_file: app.py
|
|
sdk: gradio
|
|
sdk_version: 4.44.1
|
|
---
|
|
<div align="center">
|
|
<!-- <p align="center"> <img src="./assets/EMAGE_2024/1711449143651.jpg" width="100px"> </p> -->
|
|
<h2>TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation</h2>
|
|
|
|
<a href='https://pantomatrix.github.io/TANGO/'><img src='https://img.shields.io/badge/Project-TANGO-blue' alt='Project'></a>
|
|
<a href='https://www.youtube.com/watch?v=_DfsA11puBc'><img src='https://img.shields.io/badge/YouTube-TANGO-rgb(255, 0, 0)' alt='Youtube'></a>
|
|
<a href='https://huggingface.co./spaces/H-Liu1997/TANGO'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue'></a>
|
|
<a href='https://arxiv.org/abs/2410.04221'><img src='https://img.shields.io/badge/Paper-ArXiv-yellow' alt='Project'></a>
|
|
|
|
</div>
|
|
|
|
# News
|
|
|
|
Welcome contributors! Feel free to submit the pull requests!
|
|
|
|
- **[2024/10]** Welcome to try our [TANGO](<(https://huggingface.co./spaces/H-Liu1997/TANGO)!>) on Hugging face space !
|
|
- **[2024/10]** Code for create gesture graph is available.
|
|
|
|
<p align=center>
|
|
<img src ="./assets/hg.png" width="60%" >
|
|
</p>
|
|
|
|
# Results Videos
|
|
|
|
<p align="center">
|
|
<img src="./assets/demo8.gif" width="32%" alt="demo0">
|
|
<img src="./assets/demo1.gif" width="32%" alt="demo1">
|
|
<img src="./assets/demo2.gif" width="32%" alt="demo2">
|
|
</p>
|
|
<p align="center">
|
|
<img src="./assets/demo3.gif" width="32%" alt="demo3">
|
|
<img src="./assets/demo5.gif" width="32%" alt="demo5">
|
|
<img src="./assets/demo0.gif" width="32%" alt="demo6">
|
|
</p>
|
|
<p align="center">
|
|
<img src="./assets/demo7.gif" width="32%" alt="demo7">
|
|
<img src="./assets/demo6.gif" width="32%" alt="demo8">
|
|
<img src="./assets/demo9.gif" width="32%" alt="demo9">
|
|
</p>
|
|
|
|
# Demo Video (on Youtube)
|
|
|
|
<p align=center>
|
|
<a href="https://youtu.be/xuhD_-tMH1w?si=Tr6jHAhOR1fxWIjb">
|
|
<img width="68%" src="./assets/video.png">
|
|
</a>
|
|
</p>
|
|
|
|
# π Release Plans
|
|
|
|
- [ ] Training codes for AuMoClip and ACInterp
|
|
- [ ] Inference codes for ACInterp
|
|
- [ ] Processed Youtube Buiness Video data (very small, around 15 mins)
|
|
- [x] Scripts for creating gesture graph
|
|
- [x] Inference codes with AuMoClip and pretrained weights
|
|
|
|
# βοΈ Installation
|
|
|
|
## Clone the repository
|
|
|
|
```shell
|
|
git clone https://github.com/CyberAgentAILab/TANGO.git
|
|
cd TANGO
|
|
git clone https://github.com/justinjohn0306/Wav2Lip.git
|
|
git clone https://github.com/dajes/frame-interpolation-pytorch.git
|
|
```
|
|
|
|
## Build Environtment
|
|
|
|
We Recommend a python version `==3.9.20` and cuda version `==11.8`. Then build environment as follows:
|
|
|
|
```shell
|
|
# [Optional] Create a virtual env
|
|
conda create -n tango python==3.9.20
|
|
conda activate tango
|
|
# Install with pip:
|
|
pip install -r ./pre-requirements.txt
|
|
pip install -r ./requirements.txt
|
|
```
|
|
|
|
# π Training and Inference
|
|
|
|
## Inference
|
|
|
|
Here is the command for running inference scripts under the path `<your root>/TANGO/`, it will take around 3 min to generate two 8s vidoes. You can visualize by directly check the video or check the result .npz files via blender using our blender addon in [EMAGE](https://github.com/PantoMatrix/PantoMatrix).
|
|
|
|
_Necessary checkpoints and pre-computed graphs will be automatically downloaded during the first run. Please ensure that at least 35GB of disk space is available._
|
|
|
|
```shell
|
|
python app.py
|
|
```
|
|
|
|
### Create the graph for custom character
|
|
|
|
```shell
|
|
python create_graph.py
|
|
```
|
|
|
|
# Copyright Information
|
|
|
|
We thanks the open-source project [Wav2Lip](https://github.com/Rudrabha/Wav2Lip), [FiLM](https://github.com/caffeinism/FiLM-pytorch), [SMPLerX](https://github.com/caizhongang/SMPLer-X).
|
|
|
|
Check out our previous works for Co-Speech 3D motion Generation <a href="https://github.com/PantoMatrix/PantoMatrix">DisCo, BEAT, EMAGE</a>.
|
|
|
|
This project is only for research or education purposes, and not freely available for commercial use or redistribution. The srcipt is available only under the terms of the [Attribution-NonCommercial 4.0 International](https://creativecommons.org/licenses/by-nc/4.0/legalcode) (CC BY-NC 4.0) license.
|
|
|