Bingzheng Wei's picture

32 40

Bingzheng Wei

Bingzheng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

Self-rewarding correction for mathematical reasoning

upvoted a paper 2 days ago

KV-Edit: Training-Free Image Editing for Precise Background Preservation

upvoted a paper 3 days ago

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

View all activity

Organizations

None yet

Bingzheng's activity

upvoted a paper about 16 hours ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published 2 days ago • 49

upvoted a paper 2 days ago

KV-Edit: Training-Free Image Editing for Precise Background Preservation

Paper • 2502.17363 • Published 4 days ago • 29

upvoted 4 papers 3 days ago

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

Paper • 2502.17262 • Published 4 days ago • 17

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published 3 days ago • 61

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 3 days ago • 54

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published 4 days ago • 46

upvoted a paper 8 days ago

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Paper • 2502.13144 • Published 10 days ago • 36

upvoted 4 papers 9 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 9 days ago • 150

Craw4LLM: Efficient Web Crawling for LLM Pretraining

Paper • 2502.13347 • Published 10 days ago • 27

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published 9 days ago • 25

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published 11 days ago • 27

upvoted 2 papers 10 days ago

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published 12 days ago • 49

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 13 days ago • 135

upvoted a paper 11 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 15 days ago • 95

upvoted 6 papers 12 days ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published 25 days ago • 65

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published 18 days ago • 45

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 16 days ago • 182

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 24 days ago • 196

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26 • 61