Perusha Moodley's picture

6 7

Perusha Moodley

moodlep

·

https://www.perusha.dev/

AI & ML interests

RL, DRL, Decision Transformers, Auxiliary signals, self-supervised methods

Recent Activity

upvoted a paper 8 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

updated a model 15 days ago

moodlep/smollm2-17b-dpo-cai-v1

updated a model 21 days ago

moodlep/smollm2-1.7b-instr-sft-cai

View all activity

Organizations

moodlep's activity

upvoted a paper 8 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 30 days ago • 95

updated a model 15 days ago

moodlep/smollm2-17b-dpo-cai-v1

Updated 15 days ago • 6

updated 2 models 21 days ago

moodlep/smollm2-1.7b-instr-sft-cai

Updated 21 days ago • 14

moodlep/mistral-7b-sft-constitutional-ai

Updated 21 days ago • 5

liked a model 22 days ago

HuggingFaceTB/SmolLM2-1.7B

Text Generation • Updated Nov 24, 2024 • 25.7k • 89

upvoted a collection 24 days ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co./spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated 18 days ago • 22

liked 3 models 25 days ago

unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit

Text Generation • Updated Nov 22, 2024 • 368k • 50

NousResearch/Llama-2-7b-chat-hf

Text Generation • Updated Jun 3, 2024 • 276k • 181

unsloth/mistral-7b-v0.3-bnb-4bit

Text Generation • Updated Nov 22, 2024 • 42.2k • 14

upvoted a collection about 1 month ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 18 days ago • 64

liked a dataset 3 months ago

neuralwork/arxiver

Viewer • Updated Nov 1, 2024 • 63.4k • 354 • 357

updated a collection 9 months ago

Decision-Transformer-Related

2 items • Updated Apr 23, 2024

liked a model 9 months ago

jat-project/jat

Reinforcement Learning • Updated Apr 29, 2024 • 142 • 91

liked a dataset 9 months ago

jat-project/jat-dataset

Viewer • Updated Feb 16, 2024 • 258M • 526k • 34

upvoted an article 9 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22, 2024

• 80

updated a collection 9 months ago

Decision-Transformer-Related

2 items • Updated Apr 23, 2024

upvoted a paper 9 months ago

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Paper • 2310.20587 • Published Oct 31, 2023 • 17

updated a dataset almost 2 years ago

moodlep/dt_atari_replay_hf

Updated Apr 4, 2023 • 46

updated a model almost 2 years ago

moodlep/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Feb 23, 2023