why don't we try to crowdsource actual human voices? What would be the "conditions" (i.e. high-quality, wav encoding, clear pronunciation, no-noise environment, etc.)? I mean, 100 hours isn't that much, especially for a small and free model (basically a gift to mankind)?
Tom Primozic
tomprimozic
ยท
AI & ML interests
None yet
Recent Activity
View all activity