Elijah Wilt

ooj

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

unsloth/DeepSeek-R1-GGUF

liked a model 3 days ago

deepseek-ai/DeepSeek-R1

upvoted a paper 4 days ago

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

View all activity

Organizations

None yet

ooj's activity

liked 2 models 3 days ago

unsloth/DeepSeek-R1-GGUF

Updated 4 days ago • 14.8k • 48

deepseek-ai/DeepSeek-R1

Text Generation • Updated about 24 hours ago • 44.6k • 2.14k

upvoted 13 papers 4 days ago

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 72

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 89

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 125

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 16 days ago • 89

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 16 days ago • 245

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published 17 days ago • 14

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 15 days ago • 82

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published 14 days ago • 66

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 13 days ago • 74

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 7 days ago • 93

upvoted a collection 5 days ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated 10 days ago • 64

upvoted a paper 5 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 11 days ago • 85

updated a collection 5 days ago

Specific Models

Collection

5 items • Updated 5 days ago

upvoted 2 papers 5 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 10 days ago • 268

Trusted Machine Learning Models Unlock Private Inference for Problems Currently Infeasible with Cryptography

Paper • 2501.08970 • Published 9 days ago • 6