Kashif Rasul's picture

Kashif Rasul

kashif

·

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

updated a model about 23 hours ago

google/timesfm-2.0-500m-pytorch

new activity about 23 hours ago

google/timesfm-2.0-500m-pytorch:Upload 2 files

upvoted a paper 3 days ago

MONSTER: Monash Scalable Time Series Evaluation Repository

View all activity

Organizations

kashif's activity

upvoted a paper 3 days ago

MONSTER: Monash Scalable Time Series Evaluation Repository

Paper • 2502.15122 • Published 8 days ago • 2

upvoted an article 18 days ago

Article

Open R1: Update #2

By

and 6 others •

18 days ago

• 191

upvoted an article 28 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

By

•

28 days ago

• 40

upvoted a paper about 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 258

upvoted an article about 2 months ago

Article

Process Reinforcement through Implicit Rewards

By

and 1 other •

Jan 3

• 24

upvoted 2 papers 3 months ago

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Paper • 2407.00079 • Published Jun 24, 2024 • 5

RRM: Robust Reward Model Training Mitigates Reward Hacking

Paper • 2409.13156 • Published Sep 20, 2024 • 5

upvoted 2 papers 5 months ago

A Rate-Distortion View of Uncertainty Quantification

Paper • 2406.10775 • Published Jun 16, 2024 • 1

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 137

upvoted a paper 6 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 13

upvoted a collection 6 months ago

Power-LM

Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17, 2024 • 15

upvoted a paper 6 months ago

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13, 2024 • 21

upvoted 2 papers 7 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 119

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Paper • 2405.21046 • Published May 31, 2024 • 4

upvoted 4 articles 8 months ago

Article

Putting RL back in RLHF

Jun 12, 2024

• 77

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12, 2024

• 93

Article

The Annotated Diffusion Model

Jun 7, 2022

• 150

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27, 2024

• 128

upvoted 2 papers 8 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 93

GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks

Paper • 2406.12925 • Published Jun 14, 2024 • 24