furonghuang-lab/Easy2Hard-Bench
Viewer
•
Updated
•
92.2k
•
503
Easy2Hard-Bench offers six datasets with continuous difficulty ratings, enabling profiling of LLM performance and generalization across difficulties.