Michael Barry's picture

Michael Barry

MichaelBarryUK

·

AI & ML interests

None yet

Organizations

None yet

MichaelBarryUK's activity

upvoted a paper 1 day ago

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published 2 days ago • 51

upvoted 3 papers 2 days ago

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published 2 days ago • 33

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Paper • 2411.04928 • Published 3 days ago • 32

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 3 days ago • 78

upvoted a paper 6 days ago

Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation

Paper • 2411.00412 • Published 9 days ago • 9

upvoted a paper 10 days ago

CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Paper • 2410.23090 • Published 11 days ago • 52

upvoted a paper 13 days ago

Reflection-Bench: probing AI intelligence with reflection

Paper • 2410.16270 • Published 20 days ago • 5

upvoted a paper 16 days ago

Lightweight Neural App Control

Paper • 2410.17883 • Published 18 days ago • 8

upvoted a paper 23 days ago

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published 24 days ago • 86

upvoted a paper 30 days ago

Agent S: An Open Agentic Framework that Uses Computers Like a Human

Paper • 2410.08164 • Published about 1 month ago • 24

upvoted 10 papers about 1 month ago

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8 • 107

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 165

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 143

RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

Paper • 2410.01044 • Published Oct 1 • 34

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2 • 25

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Paper • 2410.01215 • Published Oct 2 • 30

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30 • 53

TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1 • 28

Cottention: Linear Transformers With Cosine Attention

Paper • 2409.18747 • Published Sep 27 • 15

MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27 • 25