Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Daniil Tiapkin
dtiapkin
Follow
0 followers
·
1 following
https://d-tiapkin.github.io/
dtiapkin
d-tiapkin
dtiapkin.bsky.social
AI & ML interests
Reinforcement learning enjoyer
Recent Activity
upvoted
a
paper
about 24 hours ago
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
upvoted
a
paper
about 24 hours ago
On Teacher Hacking in Language Model Distillation
authored
a paper
1 day ago
Demonstration-Regularized RL
View all activity
Organizations
None yet
Papers
3
arxiv:
2502.02671
arxiv:
2310.17303
arxiv:
2303.08059
models
3
Sort: Recently updated
dtiapkin/RL-Course-ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Feb 21, 2023
dtiapkin/ppo-LunarLander-v2-try2
Updated
May 10, 2022
dtiapkin/ppo-LunalLander-v2
Reinforcement Learning
•
Updated
May 10, 2022
datasets
None public yet