microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 4 hours ago • 7.35k • 513
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 3 days ago • 54
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Paper • 2502.13092 • Published 10 days ago • 12
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment Paper • 2502.10391 • Published 14 days ago • 30
huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2 Text Generation • Updated 13 days ago • 6.59k • 118