Steven Zheng's picture

Steven Zheng

Steveeeeeeen

AI & ML interests

speech & audio

Recent Activity

Organizations

Hugging Face's profile picture Hugging Face for Audio's profile picture huggingPartyParis's profile picture MLX Community's profile picture TTS AGI's profile picture Whisper Multilingual Distillation's profile picture Audio Collabs's profile picture open/ acc's profile picture MultiLlasa's profile picture fluxions-hf's profile picture

Steveeeeeeen's activity

upvoted an article 1 day ago
view article
Article

SigLIP 2: A better multilingual vision language encoder

113
upvoted an article 2 days ago
view article
Article

Deploying Speech-to-Speech on Hugging Face

38
upvoted an article 8 days ago
view article
Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

143
upvoted an article 10 days ago
view article
Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

89
upvoted 3 articles 15 days ago
view article
Article

1 Billion Classifications

39
view article
Article

Efficient Controllable Generation for SDXL with T2I-Adapters

7
view article
Article

Introduction to the Open Leaderboard for Japanese LLMs

35
upvoted an article 16 days ago
view article
Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By Steveeeeeeen and 1 other
25
upvoted 3 articles 18 days ago
view article
Article

The Open Arabic LLM Leaderboard 2

27
upvoted an article 21 days ago
view article
Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

62
upvoted an article 22 days ago
view article
Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

16