15 7

Ribbit Ribbit

ribbitribbit365

https://RibbitRibbit.co

ribbitribbit365

AI & ML interests

None yet

Recent Activity

commented on a paper about 7 hours ago

Self-rewarding correction for mathematical reasoning

commented on a paper 2 days ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

commented on a paper 14 days ago

Distillation Scaling Laws

View all activity

Organizations

None yet

ribbitribbit365's activity

commented a paper about 7 hours ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published 2 days ago • 51 •

commented a paper 2 days ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published 8 days ago • 42 •

commented a paper 14 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 16 days ago • 46 •

upvoted a paper 14 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 16 days ago • 46

commented a paper 17 days ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 18 days ago • 140 •

commented a paper 18 days ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published 22 days ago • 90 •

commented a paper 20 days ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published 23 days ago • 42 •

upvoted a paper 20 days ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published 23 days ago • 42

upvoted a paper 23 days ago

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published 24 days ago • 57

upvoted a paper 24 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 26 days ago • 111

commented a paper 25 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 28 days ago • 107 •

upvoted a paper 25 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 28 days ago • 107

commented a paper 27 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108 •

commented 7 papers about 1 month ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published Jan 24 • 51 •

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 83 •

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published Jan 7 • 53 •