Chujie Zheng's picture

Chujie Zheng

chujiezheng

·

https://chujiezheng.github.io/

AI & ML interests

Large Language Models

Recent Activity

liked a model about 16 hours ago

Qwen/Qwen2.5-VL-72B-Instruct-AWQ

upvoted a paper about 16 hours ago

Qwen2.5-VL Technical Report

authored a paper 7 days ago

Aligning Instruction Tuning with Pre-training

View all activity

Organizations

chujiezheng's activity

upvoted a paper about 16 hours ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 9 days ago • 150

upvoted a paper 8 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 8 days ago • 92

upvoted a paper about 1 month ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 63

upvoted 3 papers about 2 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 92

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 70

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 49

upvoted a collection 2 months ago

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated Jan 1 • 43

upvoted a paper 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 347

upvoted 3 papers 3 months ago

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published Dec 6, 2024 • 47

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 80

Yi-Lightning Technical Report

Paper • 2412.01253 • Published Dec 2, 2024 • 27

upvoted a collection 3 months ago

QwQ

Qwen with Questions • 2 items • Updated Nov 28, 2024 • 59

upvoted an article 4 months ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

By

•

Oct 24, 2024

• 10

upvoted a paper 4 months ago

A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Paper • 2410.13841 • Published Oct 17, 2024 • 17

upvoted a paper 5 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 141

upvoted a paper 7 months ago

I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

Paper • 2408.08072 • Published Aug 15, 2024 • 35

upvoted a collection 9 months ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated Jan 17 • 162

upvoted a paper 10 months ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11

upvoted 2 collections 10 months ago

Eurus

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Oct 22, 2024 • 24

Weak-to-Strong Extrapolation Expedites Alignment

Better aligned models obtained by weak-to-strong model extrapolation (ExPO) • 25 items • Updated 5 days ago • 17