Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Dan Busbridge
dbusbridge
Follow
apple-intelligence's profile picture
huggingbitch's profile picture
2 followers
·
1 following
danbusbridge
dbusbridge
AI & ML interests
Deep learning, optimization, self-supervised learning, representation learning, large language modeling, equivariance, geometric deep learning, attention mechanisms, transformers
Recent Activity
authored
a paper
15 days ago
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
authored
a paper
5 months ago
Theory, Analysis, and Best Practices for Sigmoid Self-Attention
commented
on
a paper
over 1 year ago
How to Scale Your EMA
View all activity
Organizations
Papers
3
arxiv:
2501.12370
arxiv:
2409.04431
arxiv:
2307.13813
models
None public yet
datasets
None public yet