Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ytol
's Collections
Multimodal agents (robotics)
Robotics stack
Vision-Language-Action Models
Robotics stack
updated
Apr 23
Upvote
-
openai/whisper-base
Automatic Speech Recognition
•
Updated
Feb 29
•
446k
•
187
HuggingFaceM4/idefics2-8b-AWQ
Image-Text-to-Text
•
Updated
May 6
•
242
•
26
parler-tts/parler_tts_mini_v0.1
Text-to-Speech
•
Updated
Apr 30
•
25.3k
•
346
dora-rs/dora-idefics2
Updated
May 5
•
213
•
5
MIT/ast-finetuned-speech-commands-v2
Audio Classification
•
Updated
Sep 10, 2023
•
4.41k
•
13
jxu124/OpenX-Embodiment
Updated
25 days ago
•
3.75k
•
43
LiheYoung/depth-anything-small-hf
Depth Estimation
•
Updated
Jan 25
•
111k
•
26
ybelkada/segment-anything
Updated
Dec 26, 2023
•
95
Upvote
-
Share collection
View history
Collection guide
Browse collections