Qwen
/

Qwen2.5-Math-PRM-7B

Text Classification

feature-extraction

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (2)

Ask questions about training data construction

#8 opened 3 days ago by

A question about the effectiveness of Qwen2.5-Math-PRM-7B in reinforcement learning

#7 opened 4 days ago by

If the response length exceeds 4096, is a sliding window used, or is it simply truncated?

#6 opened 7 days ago by

question about the step separato "\n\n"

#3 opened 9 days ago by

Could you clarify whether the PRM800K deduplication was performed using the original 5000-test set from MATH or the MATH500 dataset?

#2 opened 10 days ago by

vllm support

#1 opened 10 days ago by