Text-to-Audio
Audiocraft
magnet
alonzi commited on
Commit
9fd2eab
1 Parent(s): f1961dc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md CHANGED
@@ -169,3 +169,24 @@ More information can be found in the paper [Masked Audio Generation using a Sing
169
  **Risks and harms:** Biases and limitations of the model may lead to generation of samples that may be considered as biased, inappropriate or offensive. We believe that providing the code to reproduce the research and train new models will allow to broaden the application to new and more representative data.
170
 
171
  **Use cases:** Users must be aware of the biases, limitations and risks of the model. MAGNeT is a model developed for artificial intelligence research on music generation. As such, it should not be used for downstream applications without further investigation and mitigation of risks.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
169
  **Risks and harms:** Biases and limitations of the model may lead to generation of samples that may be considered as biased, inappropriate or offensive. We believe that providing the code to reproduce the research and train new models will allow to broaden the application to new and more representative data.
170
 
171
  **Use cases:** Users must be aware of the biases, limitations and risks of the model. MAGNeT is a model developed for artificial intelligence research on music generation. As such, it should not be used for downstream applications without further investigation and mitigation of risks.
172
+
173
+
174
+ ## Audio-MAGNeT - Sound-effect generation models
175
+
176
+ ### Training datasets
177
+
178
+ The audio-magnet models were trained on the following data sources: a subset of AudioSet (Gemmeke et al., 2017), [BBC sound effects](https://sound-effects.bbcrewind.co.uk/), AudioCaps (Kim et al., 2019), Clotho v2 (Drossos et al., 2020), VGG-Sound (Chen et al., 2020), FSD50K (Fonseca et al., 2021), [Free To Use Sounds](https://www.freetousesounds.com/all-in-one-bundle/), [Sonniss Game Effects](https://sonniss.com/gameaudiogdc), [WeSoundEffects](https://wesoundeffects.com/we-sound-effects-bundle-2020/), [Paramount Motion - Odeon Cinematic Sound Effects](https://www.paramountmotion.com/odeon-sound-effects).
179
+
180
+
181
+ ### Evaluation datasets
182
+
183
+ The audio-magnet models (sound effect generation) were evaluated on the [AudioCaps benchmark](https://audiocaps.github.io/).
184
+
185
+ ### Evaluation results
186
+
187
+ Below are the objective metrics obtained with the released audio-magnet models on AudioCaps (consisting of 10-second long samples).
188
+
189
+ | Model | Frechet Audio Distance | KLD |
190
+ |---|---|---|
191
+ | facebook/audio-magnet-small | 3.21 | 1.42 |
192
+ | facebook/audio-magnet-medium | 2.32 | 1.64 |