Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
239
49
141
Ross Wightman
rwightman
Follow
Molbap's profile picture
KarolCodes's profile picture
a-r-r-o-w's profile picture
252 followers
Β·
109 following
wightmanr
rwightman
AI & ML interests
Computer vision, transfer learning, semi/self supervised learning, robotics.
Recent Activity
liked
a dataset
about 9 hours ago
MLCommons/unsupervised_peoples_speech
upvoted
an
article
6 days ago
Open-R1: a fully open reproduction of DeepSeek-R1
reacted
to
merve
's
post
with π₯
9 days ago
Oof, what a week! π₯΅ So many things have happened, let's recap! https://huggingface.co./collections/merve/jan-24-releases-6793d610774073328eac67a9 Multimodal π¬ - We have released SmolVLM -- tiniest VLMs that come in 256M and 500M, with it's retrieval models ColSmol for multimodal RAG π - UI-TARS are new models by ByteDance to unlock agentic GUI control π€― in 2B, 7B and 72B - Alibaba DAMO lab released VideoLlama3, new video LMs that come in 2B and 7B - MiniMaxAI released Minimax-VL-01, where decoder is based on MiniMax-Text-01 456B MoE model with long context - Dataset: Yale released a new benchmark called MMVU - Dataset: CAIS released Humanity's Last Exam (HLE) a new challenging MM benchmark LLMs π - DeepSeek-R1 & DeepSeek-R1-Zero: gigantic 660B reasoning models by DeepSeek, and six distilled dense models, on par with o1 with MIT license! π€― - Qwen2.5-Math-PRM: new math models by Qwen in 7B and 72B - NVIDIA released AceMath and AceInstruct, new family of models and their datasets (SFT and reward ones too!) Audio π£οΈ - Llasa is a new speech synthesis model based on Llama that comes in 1B,3B, and 8B - TangoFlux is a new audio generation model trained from scratch and aligned with CRPO Image/Video/3D Generation β―οΈ - Flex.1-alpha is a new 8B pre-trained diffusion model by ostris similar to Flux - tencent released Hunyuan3D-2, new 3D asset generation from images
View all activity
Articles
Timm β€οΈ Transformers: Use any timm model with transformers
18 days ago
β’
37
Trick or ResNet Treat
Oct 31, 2024
β’
5
Mamba Out
Oct 18, 2024
β’
8
Tiny Test Models
Oct 2, 2024
β’
6
Searching for better (Full) ImageNet ViT Baselines
Aug 26, 2024
β’
4
MobileNet Baselines
Jul 26, 2024
β’
23
MobileNet-V4 (now in timm)
Jun 17, 2024
β’
40
Organizations
rwightman
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
safetensors/convert
10 days ago
Allow running conversion after closing a previous PR.
8
#21 opened about 1 year ago by
rwightman
New activity in
safetensors/convert
13 days ago
Update convert.py
#37 opened 13 days ago by
rwightman
New activity in
timm/efficientformer_l7.snap_dist_in1k
18 days ago
Adding `safetensors` variant of this model
#1 opened 18 days ago by
SFconvertbot
New activity in
timm/davit_tiny.msft_in1k
18 days ago
Adding `safetensors` variant of this model
#1 opened 18 days ago by
SFconvertbot
New activity in
timm/davit_base.msft_in1k
18 days ago
Adding `safetensors` variant of this model
#2 opened 18 days ago by
SFconvertbot
New activity in
timm/levit_128.fb_dist_in1k
18 days ago
Adding `safetensors` variant of this model
#1 opened 18 days ago by
SFconvertbot
New activity in
timm/davit_small.msft_in1k
18 days ago
Adding `safetensors` variant of this model
#1 opened 18 days ago by
SFconvertbot
New activity in
timm/mobilenetv4_conv_small.e2400_r224_in1k
about 1 month ago
Can't get attribute 'UniversalInvertedResidual'
2
#7 opened about 1 month ago by
sddwadsa
New activity in
pixparse/cc3m-wds
about 1 month ago
Converting Arrow to WebDataset TAR Format for Offline Use
2
#5 opened about 1 month ago by
katie312
New activity in
timm/mobilenetv4_conv_small.e2400_r224_in1k
about 1 month ago
model.eval()οΌresults are wrong?
1
#6 opened about 1 month ago by
liyufeng
New activity in
timm/efficientformerv2_s1.snap_dist_in1k
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
timm/efficientformerv2_s2.snap_dist_in1k
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
timm/fastvit_t8.apple_dist_in1k
about 2 months ago
Update "first_conv" in config.json
#2 opened about 2 months ago by
Cinq108
New activity in
timm/efficientformer_l1.snap_dist_in1k
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
timm/efficientformerv2_l.snap_dist_in1k
about 2 months ago
Adding `safetensors` variant of this model
#1 opened about 2 months ago by
SFconvertbot
New activity in
timm/ViT-B-16-SigLIP-i18n-256
2 months ago
Are the languages that are supported documented anywhere?
2
#1 opened 2 months ago by
Jesse-marqo
New activity in
pixparse/cc12m-wds
2 months ago
Is this where all the data is?
1
#3 opened 2 months ago by
showstarpro
New activity in
laion/CLIP-ViT-B-32-xlm-roberta-base-laion5B-s13B-b90k
2 months ago
Upload pytorch_model.bin
3
#3 opened 3 months ago by
prasadpr20
New activity in
timm/efficientformerv2_s0.snap_dist_in1k
3 months ago
Adding `safetensors` variant of this model
#1 opened 3 months ago by
SFconvertbot
New activity in
laion/CLIP-ViT-L-14-CommonPool.XL-s13B-b90K
3 months ago
Adding `safetensors` variant of this model
#1 opened 3 months ago by
SFconvertbot
Load more