BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 13
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 13
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 13
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks? Paper • 2403.07718 • Published Mar 12, 2024 • 2
WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks Paper • 2407.05291 • Published Jul 7, 2024 • 2
WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks Paper • 2407.05291 • Published Jul 7, 2024 • 2
WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks Paper • 2407.05291 • Published Jul 7, 2024 • 2
Reducing hallucination in structured outputs via Retrieval-Augmented Generation Paper • 2404.08189 • Published Apr 12, 2024 • 1
Capture the Flag: Uncovering Data Insights with Large Language Models Paper • 2312.13876 • Published Dec 21, 2023 • 1
Multi-View Causal Representation Learning with Partial Observability Paper • 2311.04056 • Published Nov 7, 2023
XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference Paper • 2404.15420 • Published Apr 23, 2024 • 8
Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts Paper • 2304.09836 • Published Apr 19, 2023
Capture the Flag: Uncovering Data Insights with Large Language Models Paper • 2312.13876 • Published Dec 21, 2023 • 1
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting Paper • 2310.08278 • Published Oct 12, 2023 • 3