Revisiting Hierarchical Text Classification: Inference and Metrics Paper • 2410.01305 • Published Oct 2, 2024
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning Paper • 2502.06533 • Published 18 days ago • 18