readings - a kernelpanic Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

kernelpanic 's Collections

readings

updated about 18 hours ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21 • 57
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17 • 51
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20 • 41
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20 • 52
Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29 • 92
CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation

Paper • 2408.14572 • Published Aug 26 • 7
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published Aug 28 • 34
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Paper • 2409.02889 • Published Sep 4 • 55
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Paper • 2409.02897 • Published Sep 4 • 44
Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5 • 88
Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2 • 94
Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4 • 71
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance

Paper • 2409.04593 • Published Sep 6 • 23
ProteinBench: A Holistic Evaluation of Protein Foundation Models

Paper • 2409.06744 • Published Sep 10 • 7
Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 138
Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24 • 41
Making Text Embedders Few-Shot Learners

Paper • 2409.15700 • Published Sep 24 • 29
Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21 • 27
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1 • 29
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Paper • 2410.01215 • Published Oct 2 • 30
Not All LLM Reasoners Are Created Equal

Paper • 2410.01748 • Published Oct 2 • 28
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

Paper • 2410.01044 • Published Oct 1 • 34
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis

Paper • 2410.02749 • Published Oct 3 • 12
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Paper • 2410.02367 • Published Oct 3 • 47
Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144
Selective Attention Improves Transformer

Paper • 2410.02703 • Published Oct 3 • 23
Agent S: An Open Agentic Framework that Uses Computers Like a Human

Paper • 2410.08164 • Published Oct 10 • 24
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation

Paper • 2410.09584 • Published Oct 12 • 47
A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models

Paper • 2410.13841 • Published Oct 17 • 14
HumanEval-V: Evaluating Visual Understanding and Reasoning Abilities of Large Multimodal Models Through Coding Tasks

Paper • 2410.12381 • Published Oct 16 • 42
Revealing the Barriers of Language Agents in Planning

Paper • 2410.12409 • Published Oct 16 • 24
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22 • 89
Why Does the Effective Context Length of LLMs Fall Short?

Paper • 2410.18745 • Published Oct 24 • 17
Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Dataset

Paper • 2410.22325 • Published Oct 29 • 10
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

Paper • 2410.22391 • Published Oct 29 • 22
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Paper • 2411.03823 • Published Nov 6 • 43
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5 • 63
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published Nov 5 • 64
Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets

Paper • 2305.17010 • Published May 26, 2023
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7 • 111
Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study

Paper • 2411.02462 • Published Nov 4 • 9
Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12 • 62
Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13 • 43
ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?

Paper • 2411.06469 • Published Nov 10 • 17
SlimLM: An Efficient Small Language Model for On-Device Document Assistance

Paper • 2411.09944 • Published Nov 15 • 12
SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17 • 50
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15 • 67
Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20 • 38
Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21 • 26
Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published about 1 month ago • 15
Predicting Emergent Capabilities by Finetuning

Paper • 2411.16035 • Published about 1 month ago • 6
Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published 29 days ago • 47
o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published 26 days ago • 40
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published 25 days ago • 55
VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published 19 days ago • 104
Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

Paper • 2412.04455 • Published 19 days ago • 35
Personalized Multimodal Large Language Models: A Survey

Paper • 2412.02142 • Published 22 days ago • 12
Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published 20 days ago • 43
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 18 days ago • 121
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 18 days ago • 45
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published 19 days ago • 48
Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published 19 days ago • 21
Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published 19 days ago • 47
POINTS1.5: Building a Vision-Language Model towards Real World Applications

Paper • 2412.08443 • Published 14 days ago • 38
Phi-4 Technical Report

Paper • 2412.08905 • Published 13 days ago • 92
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 12 days ago • 90
GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 12 days ago • 84
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published 6 days ago • 43
Qwen2.5 Technical Report

Paper • 2412.15115 • Published 5 days ago • 325
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 5 days ago • 30
How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published 6 days ago • 45
Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published 4 days ago • 29
SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation

Paper • 2412.13649 • Published 7 days ago • 17
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 2 days ago • 31
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published 6 days ago • 65
Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 2 days ago • 25
Revisiting In-Context Learning with Long Context Language Models

Paper • 2412.16926 • Published 3 days ago • 13
Outcome-Refining Process Supervision for Code Generation

Paper • 2412.15118 • Published 5 days ago • 12
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published 1 day ago • 10
NILE: Internal Consistency Alignment in Large Language Models

Paper • 2412.16686 • Published 4 days ago • 6
LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published 4 days ago • 11
PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Paper • 2412.17589 • Published 1 day ago • 8

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs