-
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
Paper • 2502.14768 • Published • 44 -
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
Paper • 2502.12853 • Published • 28 -
Diverse Inference and Verification for Advanced Reasoning
Paper • 2502.09955 • Published • 16 -
Distillation Scaling Laws
Paper • 2502.08606 • Published • 46
shanshan wang
cooleel
AI & ML interests
None yet
Recent Activity
updated
a collection
about 11 hours ago
general
updated
a collection
2 days ago
vlms
updated
a collection
2 days ago
vlms
Organizations
Collections
6
-
Prompt-to-Leaderboard
Paper • 2502.14855 • Published • 7 -
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
Paper • 2502.16894 • Published • 26 -
Generating Skyline Datasets for Data Science Models
Paper • 2502.11262 • Published • 7 -
Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Paper • 2502.12501 • Published • 6
models
None public yet