Perusha Moodley
moodlep
AI & ML interests
RL, DRL, Decision Transformers, Auxiliary signals, self-supervised methods
Recent Activity
upvoted
an
article
about 13 hours ago
SmolLM - blazingly fast and remarkably powerful
liked
a Space
1 day ago
nanotron/ultrascale-playbook
liked
a dataset
about 1 month ago
Anthropic/hh-rlhf
Organizations
Collections
1
models
9
moodlep/smollm2-17b-dpo-cai-v1
Updated
•
9
moodlep/smollm2-1.7b-instr-sft-cai-v1
Updated
moodlep/smollm2-1.7b-instr-sft-cai
Updated
•
5
moodlep/mistral-7b-sft-constitutional-ai
Updated
•
6
moodlep/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
moodlep/output
Updated
moodlep/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
•
1
moodlep/ppo-Huggy
Reinforcement Learning
•
Updated
•
55
moodlep/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
3