1 2 19

Zhiwei He

zwhe99

https://zwhe99.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 4 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

authored a paper 10 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

liked a dataset 23 days ago

HuggingFaceTB/finemath

View all activity

Organizations

None yet

zwhe99's activity

upvoted a paper 4 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 12 days ago • 34

authored a paper 10 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 12 days ago • 34

liked a dataset 23 days ago

HuggingFaceTB/finemath

Viewer • Updated 20 days ago • 48.3M • 35.3k • 243

liked a model 23 days ago

HuggingFaceTB/finemath-classifier

Text Classification • Updated 24 days ago • 195 • 8

updated a dataset 24 days ago

zwhe99/aime90

Viewer • Updated 24 days ago • 90 • 88

updated a dataset 26 days ago

zwhe99/gsm8k

Viewer • Updated 26 days ago • 8.79k • 43

updated 2 datasets 29 days ago

zwhe99/mathpile-text

Viewer • Updated 29 days ago • 469k • 37

zwhe99/mp-textbooks

Viewer • Updated 29 days ago • 3.98k • 36

updated 2 datasets about 1 month ago

zwhe99/MATH-DIFFIC

Viewer • Updated Dec 11, 2024 • 17.5k • 58

zwhe99/aime24

Viewer • Updated Dec 6, 2024 • 30 • 66 • 1

New activity in GAIR/MathPile about 1 month ago

DatasetGenerationError

#4 opened about 1 month ago by

zwhe99

authored a paper about 1 month ago

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

Paper • 2411.18462 • Published Nov 27, 2024 • 6

liked a dataset about 2 months ago

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 3.42k • 308

updated 2 datasets 2 months ago

zwhe99/MATH

Viewer • Updated Oct 31, 2024 • 17.5k • 194

zwhe99/amc23

Viewer • Updated Oct 30, 2024 • 40 • 55 • 1

updated a dataset 4 months ago

zwhe99/commonsense_170k

Viewer • Updated Sep 3, 2024 • 170k • 246 • 2

upvoted a collection 5 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 224

updated a dataset 5 months ago

zwhe99/agent-sci-general

Viewer • Updated Aug 8, 2024 • 24.5k • 29

updated a dataset 6 months ago

zwhe99/agent-general

Viewer • Updated Jul 29, 2024 • 20.9k • 28

liked a dataset 9 months ago

CherryDurian/shadow-alignment

Viewer • Updated Oct 7, 2023 • 500 • 72 • 6