Jade's picture

Jade

euclaise

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

Liger: Linearizing Large Language Models to Gated Recurrent Structures

upvoted a paper about 2 hours ago

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

liked a dataset 2 days ago

eth-nlped/mathdial

View all activity

Organizations

euclaise's activity

upvoted 2 papers about 2 hours ago

Liger: Linearizing Large Language Models to Gated Recurrent Structures

Paper • 2503.01496 • Published 3 days ago • 14

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Paper • 2503.01307 • Published 4 days ago • 27

liked 4 datasets 2 days ago

eth-nlped/mathdial

Viewer • Updated 9 days ago • 2.86k • 186 • 5

eth-nlped/stepverify

Viewer • Updated 9 days ago • 1k • 33 • 6

O1-OPEN/OpenO1-SFT

Viewer • Updated Dec 17, 2024 • 77.7k • 1.57k • 358

EricLu/SCP-116K

Viewer • Updated 28 days ago • 117k • 1.04k • 67

upvoted a paper 4 days ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published 8 days ago • 37

upvoted a paper 8 days ago

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published 13 days ago • 13

upvoted a paper 9 days ago

Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam

Paper • 2502.17055 • Published 11 days ago • 16

upvoted a paper 11 days ago

AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

Paper • 2502.14669 • Published 14 days ago • 11

upvoted 8 papers 12 days ago

Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?

Paper • 2502.12215 • Published 18 days ago • 16

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published 22 days ago • 34

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published 16 days ago • 65

REALTALK: A 21-Day Real-World Dataset for Long-Term Conversation

Paper • 2502.13270 • Published 16 days ago • 6

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published 15 days ago • 25

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published 17 days ago • 28

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Paper • 2502.13962 • Published 15 days ago • 28

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published 15 days ago • 33

liked 2 datasets 12 days ago

yuan-yang/ReWild

Preview • Updated Jun 26, 2024 • 83 • 2

GAIR/LIMR

Viewer • Updated 17 days ago • 1.39k • 374 • 21