DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper ā¢ 2501.12948 ā¢ Published 4 days ago ā¢ 192
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI ā¢ 11 days ago ā¢ 40
UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages Paper ā¢ 2411.14343 ā¢ Published Nov 21, 2024 ā¢ 7
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation ā¢ Updated Oct 25, 2024 ā¢ 334k ā¢ 2.01k
view reply Interesting, but how does this approach generalize to arbitrary user query / document domains? Would you need to train a separate network for each domain / dataset?
view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware Mar 20, 2024 ā¢ 26
Qwen2-VL Collection Vision-language model series based on Qwen2 ā¢ 16 items ā¢ Updated Dec 6, 2024 ā¢ 193