Piper recording studio dataset reference

#14
by paolapersico1 - opened

Hi, I have used my own voice using piper recording studio to create an Italian dataset - I have read all 1000+ prompts - and then I trained a medium model (finetuned using the lessac model). I would like to submit it, but I don't know what dataset reference I should put in the MODEL_CARD: do I also have to upload the dataset somewhere?

Rhasspy org

You don't have to upload it, but it would be a great contribution :)
If you're willing to donate your data to the public domain, I'd be happy to host it here: https://github.com/NabuCasa/voice-datasets

Otherwise, you can just say it came from a personal dataset.

Thank you for the answer! Since I really appreciate your work, I've decided to give something back, so I've created the PR for the model and here you can find the dataset: https://huggingface.co./datasets/paolapersico1/Voice-Dataset-Italian, I hope everything's okay :)

Rhasspy org

Looks great, this is much appreciated! Would you like the dataset to be called "paola"?

yes, thank you!

Sign up or log in to comment