1 26 52

罗杰斯

rojasdiego

https://rojasdiego.com

AI & ML interests

LLMs for Code Generation

Recent Activity

upvoted a paper 2 days ago

The Curse of Depth in Large Language Models

liked a model 13 days ago

mistralai/Mistral-Small-24B-Base-2501

upvoted a paper 27 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

View all activity

Organizations

rojasdiego's activity

upvoted a paper 2 days ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published 20 days ago • 34

liked a model 13 days ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated 29 days ago • 24.9k • 224

upvoted a paper 27 days ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 29 days ago • 56

upvoted a paper 29 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108

liked 2 models about 1 month ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 4.63M • • 10.5k

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 5 days ago • 19.3k • 853

liked a dataset about 2 months ago

bigcode/the-stack-v2-train-smol-ids

Viewer • Updated Apr 23, 2024 • 40.1M • 949 • 31

liked a model about 2 months ago

numind/NuExtract-1.5

Text Generation • Updated Nov 18, 2024 • 35.3k • • 199

updated a collection about 2 months ago

Code LLMs

Collection

6 items • Updated Jan 3 • 1

liked 2 models about 2 months ago

infly/OpenCoder-1.5B-Base

Text Generation • Updated Nov 11, 2024 • 14.2k • 21

infly/OpenCoder-8B-Instruct

Text Generation • Updated Nov 14, 2024 • 1.83k • 185

updated a collection about 2 months ago

CoT Models

Collection

2 items • Updated Jan 1

liked a model about 2 months ago

PowerInfer/SmallThinker-3B-Preview

Text Generation • Updated Jan 16 • 103k • • 387

updated a collection about 2 months ago

Code LLMs

Collection

6 items • Updated Jan 3 • 1

liked a model about 2 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 257k • • 1.63k

liked a model 2 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 5 days ago • 557k • 1.58k

liked a model 3 months ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 560k • • 2.05k

upvoted a paper 4 months ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 35

liked 2 models 4 months ago

tencent/Tencent-Hunyuan-Large

Text Generation • Updated Jan 19 • 240 • 568

deepseek-ai/DeepSeek-V2

Text Generation • Updated Jun 8, 2024 • 20.5k • 310