Starstrek

Stars321123

Stars321

AI & ML interests

Recent Activity

upvoted a collection about 19 hours ago

LLMs

upvoted a paper about 19 hours ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

upvoted a paper about 20 hours ago

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

View all activity

Organizations

None yet

Stars321123's activity

upvoted a collection about 19 hours ago

LLMs

Collection

376 items • Updated about 4 hours ago • 26

upvoted a paper about 19 hours ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 3 days ago • 59

upvoted a paper about 20 hours ago

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published 3 days ago • 22

upvoted a paper 2 days ago

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Paper • 2501.11733 • Published 3 days ago • 24

upvoted 2 articles 7 days ago

Article

The Large Language Model Course

•

8 days ago

• 71

Article

Introducing smolagents: simple agents that write actions in code.

24 days ago

• 519

upvoted an article 8 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

9 days ago

• 120

upvoted a paper 9 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 9 days ago • 268

upvoted a paper 10 days ago

An Empirical Study of Autoregressive Pre-training from Videos

Paper • 2501.05453 • Published 14 days ago • 36

upvoted 2 papers 14 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 15 days ago • 79

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Paper • 2501.03847 • Published 17 days ago • 23

upvoted a paper 15 days ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published 17 days ago • 37

upvoted 2 papers 16 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 20 days ago • 87

Personalized Graph-Based Retrieval for Large Language Models

Paper • 2501.02157 • Published 20 days ago • 28

upvoted a paper 20 days ago

A3: Android Agent Arena for Mobile GUI Agents

Paper • 2501.01149 • Published 22 days ago • 22

upvoted a collection 22 days ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 125

upvoted a collection 26 days ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated Dec 13, 2024 • 329