Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance Paper • 2410.18889 • Published 13 days ago • 15
Fine-Grained Prediction of Reading Comprehension from Eye Movements Paper • 2410.04484 • Published about 1 month ago • 1
GLEE: A Unified Framework and Benchmark for Language-based Economic Environments Paper • 2410.05254 • Published 29 days ago • 80