MilkyMikey1104
's Collections
Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for
Large-Scale Speech Generation
Paper
•
2407.05361
•
Published
•
2
allenai/pixmo-ask-model-anything
Viewer
•
Updated
•
162k
•
339
•
3
Viewer
•
Updated
•
272k
•
398
•
6
Viewer
•
Updated
•
717k
•
922
•
24
Viewer
•
Updated
•
195k
•
1.13k
•
12
google/paligemma2-3b-pt-896
Image-Text-to-Text
•
Updated
•
1.98k
•
23
FunAudioLLM/CosyVoice-ttsfrd
FunAudioLLM/CosyVoice-300M
FunAudioLLM/SenseVoiceSmall
Updated
•
1.26k
•
226
NexaAIDev/Octopus-v2
Text Generation
•
Updated
•
1.01k
•
880
weizhiwang/LongMem-558M
Viewer
•
Updated
•
84.1k
•
82
•
1
laion/laion-audio-preview
Viewer
•
Updated
•
4.15M
•
5.62k
•
10
laion/laion-high-resolution
Viewer
•
Updated
•
166M
•
1.17k
•
84
AIDC-AI/Marco-o1
Text Generation
•
Updated
•
7.35k
•
711
product-science/xlam-function-calling-60k-raw-augmented
Viewer
•
Updated
•
89.8k
•
176
•
1
Trust but Verify: Programmatic VLM Evaluation in the Wild
Paper
•
2410.13121
•
Published
•
2
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
•
60k
•
3.39k
•
417