Tom Primozic

tomprimozic

AI & ML interests

None yet

Recent Activity

replied to hexgrad's post 20 days ago

📣 Looking for labeled, high-quality synthetic audio/TTS data 📣 Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below. If your data exceeds quantity & quality thresholds and is approved into the next https://hf.co/hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops. What does this mean? If you've been calling closed-source TTS or audio API endpoints to: - Build voice agents - Make long-form audio, like audiobooks or podcasts - Handle customer support, etc Then YOU can contribute to the training mix and get useful artifacts in return. ❤️ More details at https://hf.co/hexgrad/Kokoro-82M/discussions/21

View all activity

Organizations

tomprimozic's activity

replied to hexgrad's post 20 days ago

why don't we try to crowdsource actual human voices? What would be the "conditions" (i.e. high-quality, wav encoding, clear pronunciation, no-noise environment, etc.)? I mean, 100 hours isn't that much, especially for a small and free model (basically a gift to mankind)?