Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 20 days ago • 52
bluepen5805/DeepSeek-R1-Distill-Qwen-14B-Japanese-gguf Text Generation • Updated 6 days ago • 15.1k • 31
TinySwallow Collection Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated 4 days ago • 12
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Paper • 2501.11858 • Published 13 days ago • 5
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 13 days ago • 48