37 68 264

MoonRide

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

Diffusion Autoencoders are Scalable Image Tokenizers

liked a model about 21 hours ago

FuseAI/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview

liked a model about 21 hours ago

bartowski/FuseO1-DeekSeekR1-QwQ-SkyT1-32B-Preview-GGUF

View all activity

Organizations

MoonRide's activity

upvoted a paper about 2 hours ago

Diffusion Autoencoders are Scalable Image Tokenizers

Paper • 2501.18593 • Published 4 days ago • 1

upvoted an article 2 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

6 days ago

• 570

upvoted a paper 7 days ago

Humanity's Last Exam

Paper • 2501.14249 • Published 10 days ago • 50

upvoted a paper 20 days ago

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 14

upvoted 3 papers 24 days ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published 27 days ago • 42

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published 27 days ago • 48

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 25 days ago • 87

upvoted a paper 29 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 99

upvoted a paper about 2 months ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 91

upvoted a collection about 2 months ago

Unsloth 4-bit Dynamic Quants

Collection

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 13 items • Updated about 11 hours ago • 33

upvoted 3 papers 4 months ago

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

Paper • 2410.05295 • Published Oct 3, 2024 • 12

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24, 2024 • 42

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted a paper 5 months ago

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110

upvoted 5 papers 6 months ago

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Paper • 2408.06266 • Published Aug 12, 2024 • 10

Scaling Exponents Across Parameterizations and Optimizers

Paper • 2407.05872 • Published Jul 8, 2024 • 1

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22, 2024 • 10

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 54

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

Paper • 2407.11963 • Published Jul 16, 2024 • 44

upvoted a collection 6 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 644