1 140 601

Motoki Wu

tokestermw

https://motoki.co

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

microsoft/Phi-4-multimodal-instruct

upvoted a paper 2 days ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

upvoted a paper 3 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

View all activity

Organizations

tokestermw's activity

liked a model 2 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated about 5 hours ago • 7.35k • 514

upvoted a paper 2 days ago

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published 5 days ago • 22

upvoted a paper 3 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 3 days ago • 54

liked a model 4 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated 5 days ago • 1.26M • • 955

upvoted 3 papers 4 days ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published 18 days ago • 124

InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback

Paper • 2502.15027 • Published 8 days ago • 6

SIFT: Grounding LLM Reasoning in Contexts via Stickers

Paper • 2502.14922 • Published 9 days ago • 28

upvoted a collection 5 days ago

Sky-T1-7B

Collection

A series of 7B models trained with different recipes and the corresponding training data. • 8 items • Updated 15 days ago • 5

liked a model 7 days ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 257k • • 1.63k

liked a model 8 days ago

Open-Reasoner-Zero/Open-Reasoner-Zero-32B

Updated 9 days ago • 778 • 26

liked a Space 8 days ago

1.79k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 9 days ago

Process Reward Models

Collection

Model and Datasets for Qwen 2.5 Math PRM 7B • 6 items • Updated 10 days ago • 1

liked a model 10 days ago

perplexity-ai/r1-1776

Text Generation • Updated 2 days ago • 31.9k • • 1.9k

upvoted a paper 11 days ago

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment

Paper • 2502.10391 • Published 14 days ago • 30

upvoted a paper 15 days ago

Distillation Scaling Laws

Paper • 2502.08606 • Published 16 days ago • 46

liked a Space 15 days ago

592

Open Deep-Research

🏆

OpenAI's Deep Research, but open

upvoted 2 papers 18 days ago

Agency Is Frame-Dependent

Paper • 2502.04403 • Published 23 days ago • 21

ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning

Paper • 2502.04689 • Published 22 days ago • 7

upvoted an article 18 days ago

Article

Open R1: Update #2

and 6 others •

18 days ago

• 191

upvoted a paper 19 days ago

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Paper • 2502.04404 • Published 23 days ago • 22