dreamorg/Meta-Llama-3.1-70B-Instruct_1220_llama3.3_bright_12k_v3_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 18.2k • 21
dreamorg/Meta-Llama-3.1-70B-Instruct_1220_llama3.3_bright_12k_v3_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 18.2k • 21
dreamorg/Meta-Llama-3.1-70B-Instruct_1130_bright_original_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 12.2k • 48
dreamorg/Meta-Llama-3.1-70B-Instruct_1130_bright_original_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 12.2k • 48
dreamorg/Meta-Llama-3.1-70B-Instruct_1206_bright_12k_v6_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 11.6k • 76
dreamorg/Meta-Llama-3.1-70B-Instruct_1206_bright_12k_v6_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 11.6k • 76
dreamorg/Meta-Llama-3.1-70B-Instruct_1202_bright_120k_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 111k • 53
dreamorg/Meta-Llama-3.1-70B-Instruct_1202_bright_120k_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 111k • 53
dreamorg/Meta-Llama-3.1-70B-Instruct_1129_bright_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 12.2k • 55
dreamorg/Meta-Llama-3.1-70B-Instruct_1129_bright_reasoning_query_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 12.2k • 55
dreamorg/Meta-Llama-3.1-70B-Instruct_1217_msmacro_v6_2k_bright_v3_12k_reasoning_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 13.9k • 41
dreamorg/Meta-Llama-3.1-70B-Instruct_1217_msmacro_v6_2k_bright_v3_12k_reasoning_seed_doc_pos_random_neg Viewer • Updated 8 days ago • 13.9k • 41
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore Paper • 2407.12854 • Published Jul 9, 2024 • 30
Language models scale reliably with over-training and on downstream tasks Paper • 2403.08540 • Published Mar 13, 2024 • 15
Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning Paper • 2402.11690 • Published Feb 18, 2024 • 9
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use Paper • 2308.06595 • Published Aug 12, 2023 • 6