Joshua's picture

Joshua

Xenova

AI & ML interests

None yet

Recent Activity

updated a Space about 3 hours ago
HuggingFaceTB/SmolVLM-256M-Instruct-WebGPU
updated a Space about 4 hours ago
HuggingFaceTB/SmolVLM-500M-Instruct-WebGPU
published a Space about 4 hours ago
HuggingFaceTB/SmolVLM-500M-Instruct-WebGPU
View all activity

Articles

Organizations

Hugging Face's profile picture Google's profile picture Gradio's profile picture Hugging Face Internal Testing Organization's profile picture onnx's profile picture Huggingface.js's profile picture Huggingface Projects's profile picture Whisper Distillation's profile picture Hugging Face OSS Metrics's profile picture Core ML Projects's profile picture Blog-explorers's profile picture Useful Sensors Inc.'s profile picture Terrain Diffusion's profile picture Hugging Face TB Research's profile picture kotol's profile picture Static Templates's profile picture gg-hf's profile picture Nerfies's profile picture WebML Community's profile picture Llava Hugging Face's profile picture mozilla's profile picture Hugging Face - Visual Blocks's profile picture Social Post Explorers's profile picture hsramall's profile picture gg-tt's profile picture ONNX Community's profile picture Hugging Face Discord Community's profile picture LLHF's profile picture SLLHF's profile picture Hugging Quants's profile picture nltpt's profile picture Hugging Face Party @ PyTorch Conference's profile picture rmbg's profile picture ExecuTorch Community's profile picture Hello Games's profile picture

Posts 19

view post
Post
2774
Introducing Kokoro.js, a new JavaScript library for running Kokoro TTS, an 82 million parameter text-to-speech model, 100% locally in the browser w/ WASM. Powered by 🤗 Transformers.js. WebGPU support coming soon!
👉 npm i kokoro-js 👈

Try it out yourself: webml-community/kokoro-web
Link to models/samples: onnx-community/Kokoro-82M-ONNX

You can get started in just a few lines of code!
import { KokoroTTS } from "kokoro-js";

const tts = await KokoroTTS.from_pretrained(
  "onnx-community/Kokoro-82M-ONNX",
  { dtype: "q8" }, // fp32, fp16, q8, q4, q4f16
);

const text = "Life is like a box of chocolates. You never know what you're gonna get.";
const audio = await tts.generate(text,
  { voice: "af_sky" }, // See `tts.list_voices()`
);
audio.save("audio.wav");

Huge kudos to the Kokoro TTS community, especially taylorchu for the ONNX exports and Hexgrad for the amazing project! None of this would be possible without you all! 🤗

The model is also extremely resilient to quantization. The smallest variant is only 86 MB in size (down from the original 326 MB), with no noticeable difference in audio quality! 🤯