-
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper • 2501.08313 • Published • 268 -
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Paper • 2501.04519 • Published • 245 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 125 -
Apollo: An Exploration of Video Understanding in Large Multimodal Models
Paper • 2412.10360 • Published • 139
Nguyễn Việt Anh
thehandsomefrog4825
AI & ML interests
None yet
Recent Activity
updated
a collection
1 day ago
Model 🖥️
upvoted
a
collection
1 day ago
DeepSeek R1 (All Versions)
updated
a collection
3 days ago
Model 🖥️
Organizations
None yet
Collections
17
models
None public yet
datasets
None public yet