1 28 3

Jonathan LYS

jonathan-lys

jonathanlys01

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

Distillation Scaling Laws

upvoted a paper 17 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

upvoted an article 30 days ago

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

View all activity

Organizations

jonathan-lys's activity

upvoted a paper 15 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 16 days ago • 46

upvoted a paper 17 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 21 days ago • 120

upvoted an article 30 days ago

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

•

about 1 month ago

• 16

upvoted a paper 2 months ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published Dec 19, 2024 • 51

liked a Space 2 months ago

526

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

upvoted 3 papers 3 months ago

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

Paper • 2412.02687 • Published Dec 3, 2024 • 109

TinyFusion: Diffusion Transformers Learned Shallow

Paper • 2412.01199 • Published Dec 2, 2024 • 14

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 16

upvoted 3 papers 4 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 78

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 64

upvoted a paper 5 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145

upvoted an article 5 months ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

•

Sep 27, 2024

• 40

liked a Space 7 months ago

ASR Comparaison

🦀

upvoted 2 papers 7 months ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 113

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 114

upvoted 2 papers 9 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 64

Phased Consistency Model

Paper • 2405.18407 • Published May 28, 2024 • 48

upvoted a paper 10 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

liked a model 10 months ago

Salesforce/xgen-mm-phi3-mini-instruct-r-v1

Image-Text-to-Text • Updated 26 days ago • 1.17k • 184