Dongfu Jiang's picture

Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

upvoted an article about 10 hours ago

DualPipe could be better without the Dual

upvoted a paper 1 day ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

updated a model 1 day ago

CodeDPO/qwen25-coder-base-7b-reinforce-plus_v2_mini_processed_r1

View all activity

Organizations

Papers 11

arxiv:2502.01718

arxiv:2410.10563

arxiv:2406.15252

arxiv:2406.11069

models 38

DongfuJiang/Qwen2-VL-VAE-7B-Instruct

Image-Text-to-Text • Updated Dec 17, 2024 • 13

DongfuJiang/Qwen2-VL-VAE-7B-Instruct-mochi-vae

Text2Text Generation • Updated Dec 17, 2024 • 10

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_pt

Text Generation • Updated Dec 9, 2024 • 28

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_sft

Text Generation • Updated Dec 9, 2024 • 28

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_pt

Text Generation • Updated Dec 9, 2024 • 22

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_sft

Text Generation • Updated Dec 9, 2024 • 28

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_pt

Text Generation • Updated Dec 7, 2024 • 21

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_sft

Updated Dec 7, 2024

DongfuJiang/prm_gsm_2k_with_full_sol_mix_ref_remove_all_correct_hf

Text Generation • Updated Dec 1, 2024 • 6 • 1

DongfuJiang/prm_qwen25_math_gsm_2k_with_full_sol_mix_ref_redistribution_hf

Text Generation • Updated Dec 1, 2024 • 10

datasets 12

DongfuJiang/PRM_SFT

Viewer • Updated Dec 1, 2024 • 4.01M • 21

DongfuJiang/zeroeval

Viewer • Updated Nov 27, 2024 • 13.5k • 245

DongfuJiang/PRM_eval

Viewer • Updated Nov 27, 2024 • 9.54k • 12

DongfuJiang/eval

Viewer • Updated Nov 27, 2024 • 6k • 87

DongfuJiang/PRM_prepared

Viewer • Updated Nov 26, 2024 • 39.9k • 16

DongfuJiang/PRM_train

Viewer • Updated Nov 25, 2024 • 32.7k • 20

DongfuJiang/MATH-500

Viewer • Updated Nov 6, 2024 • 500 • 75

DongfuJiang/simpo_v2_ultrafeedback

Viewer • Updated Aug 2, 2024 • 59.9k • 52

DongfuJiang/VAPO

Viewer • Updated Jul 31, 2024 • 72.5k • 20

DongfuJiang/PairRM-data

Viewer • Updated Jul 30, 2024 • 586k • 9