furonghuang-lab 's Collections

Easy2Hard-Bench

Easy2Hard-Bench offers six datasets with continuous difficulty ratings, enabling profiling of LLM performance and generalization across difficulties.