110 628 1

Michael Barry

MichaelBarryUK

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

upvoted a paper 9 days ago

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

upvoted a paper 9 days ago

Diverse Inference and Verification for Advanced Reasoning

View all activity

Organizations

None yet

MichaelBarryUK's activity

upvoted 12 papers 9 days ago

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Paper • 2502.08745 • Published 16 days ago • 18

I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models

Paper • 2502.10458 • Published 17 days ago • 27

Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published 15 days ago • 16

Large Language Diffusion Models

Paper • 2502.09992 • Published 15 days ago • 95

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 13 days ago • 135

Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options

Paper • 2502.12929 • Published 10 days ago • 7

Atom of Thoughts for Markov LLM Test-Time Scaling

Paper • 2502.12018 • Published 11 days ago • 12

HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

Paper • 2502.12574 • Published 11 days ago • 10

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published 10 days ago • 63

upvoted 7 papers 11 days ago

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Paper • 2502.10454 • Published 17 days ago • 7

Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems

Paper • 2502.11098 • Published 13 days ago • 11

System Message Generation for User Preferences using Open-Source Models

Paper • 2502.11330 • Published 12 days ago • 16

Building A Proof-Oriented Programmer That Is 64% Better Than GPT-4o Under Data Scarsity

Paper • 2502.11901 • Published 11 days ago • 6

CRANE: Reasoning with constrained LLM generation

Paper • 2502.09061 • Published 16 days ago • 18

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published 12 days ago • 21

SAFE-SQL: Self-Augmented In-Context Learning with Fine-grained Example Selection for Text-to-SQL

Paper • 2502.11438 • Published 12 days ago • 7

commented a paper about 1 month ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 66 •