Shanghaoran Quan's picture

7 22

Shanghaoran Quan

quanshr

·

quanshr

AI & ML interests

Large Language Model

Recent Activity

upvoted a paper 5 days ago

Qwen2.5 Technical Report

upvoted a paper 14 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

liked a model about 1 month ago

Qwen/Qwen2.5-Coder-32B-Instruct

View all activity

Organizations

quanshr's activity

upvoted a paper 5 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 5 days ago • 323

upvoted a paper 14 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 15 days ago • 68

liked a model about 1 month ago

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • Updated Nov 18 • 376k • • 1.36k

upvoted a collection about 1 month ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 27 days ago • 257

updated a dataset about 2 months ago

quanshr/LonGen

Viewer • Updated Nov 7 • 240 • 48 • 1

liked a dataset about 2 months ago

quanshr/LonGen

Viewer • Updated Nov 7 • 240 • 48 • 1

authored 2 papers about 2 months ago

Aligning CodeLLMs with Direct Preference Optimization

Paper • 2410.18585 • Published Oct 24

Language Models can Self-Lengthen to Generate Long Texts

Paper • 2410.23933 • Published Oct 31 • 17

upvoted a paper about 2 months ago

Language Models can Self-Lengthen to Generate Long Texts

Paper • 2410.23933 • Published Oct 31 • 17

authored a paper 2 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10 • 28

liked a model 2 months ago

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • Updated Nov 18 • 149k • 369

upvoted a paper 2 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10 • 28

liked a Space 4 months ago

Qwen2-VL-72B

updated 2 datasets 5 months ago

quanshr/DailyM-SFT

Viewer • Updated Jul 19 • 117k • 63 • 1

quanshr/DailyM

Viewer • Updated Jul 16 • 1k • 94 • 1

updated a model 5 months ago

quanshr/Qwen-DailyM-32B-LoRA

Updated Jul 16 • 1

upvoted 2 collections 6 months ago

AugCon

Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity • 4 items • Updated Jul 2 • 1

DMoERM

DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling • 2 items • Updated Jul 4 • 1

updated a collection 6 months ago

DMoERM

DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling • 2 items • Updated Jul 4 • 1