Compressed Chain of Thought: Efficient Reasoning Through Dense Representations Paper • 2412.13171 • Published 7 days ago • 30
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 11 days ago • 130
LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations Paper • 2412.08580 • Published 13 days ago • 44
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published 15 days ago • 68
Maya: An Instruction Finetuned Multilingual Multimodal Model Paper • 2412.07112 • Published 15 days ago • 25
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published 19 days ago • 48
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 20 days ago • 43
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Paper • 2411.19799 • Published 25 days ago • 10
SeQwen at the Financial Misinformation Detection Challenge Task: Sequential Learning for Claim Verification and Explanation Generation in Financial Domains Paper • 2412.00549 • Published 24 days ago • 1
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use Paper • 2411.10323 • Published Nov 15 • 31
1-800-SHARED-TASKS @ NLU of Devanagari Script Languages: Detection of Language, Hate Speech, and Targets using LLMs Paper • 2411.06850 • Published Nov 11 • 3
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Paper • 2408.02545 • Published Aug 5 • 35
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22 • 55
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡ By xhluca • Jul 9 • 41
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper • 2410.02884 • Published Oct 3 • 52
Pangea Collection A Fully Open Multilingual Multimodal LLM for 39 Languages • 18 items • Updated Nov 2 • 17
M-RewardBench: Evaluating Reward Models in Multilingual Settings Paper • 2410.15522 • Published Oct 20 • 11
Large Language Model Unlearning via Embedding-Corrupted Prompts Paper • 2406.07933 • Published Jun 12 • 7