Maxwell-Jia
's Collections
Daily arXiv
updated
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System
Paper
•
2407.06027
•
Published
•
9
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper
•
2407.09025
•
Published
•
134
Toto: Time Series Optimized Transformer for Observability
Paper
•
2407.07874
•
Published
•
32
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers
Paper
•
2407.09413
•
Published
•
11
Paper
•
2407.10671
•
Published
•
161
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
Paper
•
2407.11895
•
Published
•
7
Scaling Granite Code Models to 128K Context
Paper
•
2407.13739
•
Published
•
20
Vision language models are blind
Paper
•
2407.06581
•
Published
•
83
Data Mixture Inference: What do BPE Tokenizers Reveal about their
Training Data?
Paper
•
2407.16607
•
Published
•
23
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains
Paper
•
2407.18961
•
Published
•
40
Self-Training with Direct Preference Optimization Improves
Chain-of-Thought Reasoning
Paper
•
2407.18248
•
Published
•
32
SAM 2: Segment Anything in Images and Videos
Paper
•
2408.00714
•
Published
•
113
Gemma 2: Improving Open Language Models at a Practical Size
Paper
•
2408.00118
•
Published
•
76
ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation
Extraction on an Academic Budget
Paper
•
2408.00103
•
Published
•
21
ConflictBank: A Benchmark for Evaluating the Influence of Knowledge
Conflicts in LLM
Paper
•
2408.12076
•
Published
•
12
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
•
2408.06292
•
Published
•
119
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs
with 1000x Input Token Reduction
Paper
•
2409.17422
•
Published
•
25
Enhancing Structured-Data Retrieval with GraphRAG: Soccer Data Case
Study
Paper
•
2409.17580
•
Published
•
9
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for
Data-Driven Scientific Discovery
Paper
•
2410.05080
•
Published
•
21
Cut Your Losses in Large-Vocabulary Language Models
Paper
•
2411.09009
•
Published
•
46
Open-Sora Plan: Open-Source Large Video Generation Model
Paper
•
2412.00131
•
Published
•
33
o1-Coder: an o1 Replication for Coding
Paper
•
2412.00154
•
Published
•
43
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper
•
2412.03555
•
Published
•
129
Chain-of-Retrieval Augmented Generation
Paper
•
2501.14342
•
Published
•
51
Critique Fine-Tuning: Learning to Critique is More Effective than
Learning to Imitate
Paper
•
2501.17703
•
Published
•
55
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
Paper
•
2501.18585
•
Published
•
56
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and
Understanding
Paper
•
2501.18362
•
Published
•
21
Diverse Inference and Verification for Advanced Reasoning
Paper
•
2502.09955
•
Published
•
16
FoNE: Precise Single-Token Number Embeddings via Fourier Features
Paper
•
2502.09741
•
Published
•
11
Injecting Domain-Specific Knowledge into Large Language Models: A
Comprehensive Survey
Paper
•
2502.10708
•
Published
•
4
SIFT: Grounding LLM Reasoning in Contexts via Stickers
Paper
•
2502.14922
•
Published
•
28
LightThinker: Thinking Step-by-Step Compression
Paper
•
2502.15589
•
Published
•
25