view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 208
Towards Best Practices for Open Datasets for LLM Training Paper β’ 2501.08365 β’ Published Jan 14 β’ 55
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning Paper β’ 2411.18203 β’ Published Nov 27, 2024 β’ 34
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper β’ 2410.02884 β’ Published Oct 3, 2024 β’ 54
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B Paper β’ 2406.07394 β’ Published Jun 11, 2024 β’ 27
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 13 items β’ Updated Sep 18, 2024 β’ 227
view article Article ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models By yuchenlin β’ Jul 27, 2024 β’ 31
Seamless: Multilingual Expressive and Streaming Speech Translation Paper β’ 2312.05187 β’ Published Dec 8, 2023 β’ 14