Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? Paper • 2502.19361 • Published 2 days ago • 19
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Paper • 2502.19400 • Published 2 days ago • 34
GHOST 2.0: generative high-fidelity one shot transfer of heads Paper • 2502.18417 • Published 3 days ago • 56