26 25 12

Sherman Chann

152334H

https://152334H.github.io

152334H

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

deepseek-ai/DeepSeek-R1

liked a model 5 days ago

deepseek-ai/DeepSeek-R1-Zero

upvoted a paper 4 months ago

RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

View all activity

Organizations

152334H's activity

liked 2 models 5 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 1 day ago • 69.6k • 2.29k

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 1 day ago • 4.48k • 405

upvoted a paper 4 months ago

RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

Paper • 2410.01044 • Published Oct 1, 2024 • 35

updated a collection 4 months ago

mycollection1

Collection

1 item • Updated Oct 4, 2024

upvoted 2 papers 4 months ago

Not All LLM Reasoners Are Created Equal

Paper • 2410.01748 • Published Oct 2, 2024 • 28

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

commented 2 papers 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136 •

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136 •

upvoted a paper 4 months ago

Seed-Music: A Unified Framework for High Quality and Controlled Music Generation

Paper • 2409.09214 • Published Sep 13, 2024 • 51

upvoted 5 papers 5 months ago

commented 2 papers 5 months ago

DeepSpeak Dataset v1.0

Paper • 2408.05366 • Published Aug 9, 2024 • 12 •

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13, 2024 • 32 •

New activity in meta-llama/Llama-3.1-405B 6 months ago

8-kv-heads

#21 opened 6 months ago by

ArthurZ

upvoted 2 papers 6 months ago

ShieldGemma: Generative AI Content Moderation Based on Gemma

Paper • 2407.21772 • Published Jul 31, 2024 • 14

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 110