1 20 6

Peter

Tempo14

AI & ML interests

None yet

Recent Activity

updated a collection 12 days ago

RAG

updated a collection 12 days ago

Reasoning

upvoted a paper 12 days ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

View all activity

Organizations

Tempo14's activity

upvoted 7 papers 12 days ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published 15 days ago • 31

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 17 days ago • 45

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published 17 days ago • 35

upvoted an article 24 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

25 days ago

• 109

upvoted 4 papers 25 days ago

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published 28 days ago • 20

s1: Simple test-time scaling

Paper • 2501.19393 • Published 28 days ago • 107

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 30 days ago • 23

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 29 days ago • 56

upvoted a paper about 1 month ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published Jan 22 • 24

upvoted an article 3 months ago

Article

Brain-Inspired Efficient Pruning: Exploiting Criticality in Spiking Neural Networks

•

Nov 22, 2024

• 1

upvoted a paper 3 months ago

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 72

upvoted an article 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 223

upvoted an article 6 months ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

•

Aug 26, 2024

• 50

upvoted an article 7 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 59

upvoted an article 8 months ago

Article

Shape Rotation 101: An Intro to Einsum and Jax Transformers

•

Jun 22, 2024

• 3

upvoted a paper over 1 year ago

MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing

Paper • 2306.10012 • Published Jun 16, 2023 • 35