File size: 1,078 Bytes
242ad21
 
 
 
 
 
 
 
 
8db01e7
 
1446eae
 
ca9b42e
806b01e
1b3853c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---
title: README
emoji: 🦀
colorFrom: red
colorTo: pink
sdk: static
pinned: false
---

Amphion is An Open-Source Audio, Music, and Speech Generation Toolkit developed by a team led by Prof [Zhizheng Wu](https://drwuz.com/) from the Chinese University of Hong Kong, Shenzhen. The toolkit is developed in collaboration with [OpenMMLab](https://github.com/open-mmlab).

The North-Star objective of Amphion is to offer a platform for studying the conversion of any inputs into audio. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development. Amphion offers a unique feature: visualizations of classic models or architectures. We believe that these visualizations are beneficial for junior researchers and engineers who wish to gain a better understanding of the model.

Technical Report: [https://huggingface.co./papers/2312.09911](https://huggingface.co./papers/2312.09911)

Discord: [https://discord.com/invite/ZxxREr3Y](https://discord.com/invite/ZxxREr3Y)