Unchun Yang's picture

Unchun Yang

ucyang

·

https://ucyang.com/

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

stepfun-ai/stepvideo-t2v-turbo

liked a model about 1 hour ago

stepfun-ai/stepvideo-t2v

liked a dataset 1 day ago

nlp-course/supervised-finetuning_quiz_student_responses

View all activity

Organizations

ucyang's activity

upvoted 2 articles 4 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

22 days ago

• 756

Article

Open-R1: Update #1

By

and 7 others •

17 days ago

• 282

upvoted 2 collections 4 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 11 items • Updated 8 days ago • 70

OpenR1-Math

Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co./blog/open-r1/update-2 • 3 items • Updated 5 days ago • 6

upvoted an article 4 days ago

Article

Open R1: Update #2

By

and 6 others •

9 days ago

• 176

upvoted a collection 4 days ago

Tools for learning AI

This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated 2 days ago • 55

upvoted 2 collections 7 days ago

Nomic Embed

Open Source Long Context Text Embedders • 8 items • Updated Feb 14, 2024 • 20

Nomic Embed v2

Multilingual Embedding Models • 4 items • Updated 3 days ago • 11

upvoted a paper 8 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 12 days ago • 108

upvoted a collection 12 days ago

Hibiki fr-en

Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 12 days ago • 48

upvoted a collection 13 days ago

Mistral-Small-24B-2501 (All Versions)

A collection of Mistral's new Small 2501 models including GGUF, 4-bit and more! • 9 items • Updated 15 days ago • 5

upvoted a paper 13 days ago

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

Paper • 2402.14207 • Published Feb 22, 2024 • 8

upvoted an article 14 days ago

Article

Open-source DeepResearch – Freeing our search agents

15 days ago

• 1.03k

upvoted a paper 15 days ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published 16 days ago • 37

upvoted 2 collections 17 days ago

Reasoning Datasets

Distilled synthetic Reasoning datasets • 7 items • Updated 17 days ago • 53

SFTvsRL Models & Data

This collection contains 4 initial checkpoints for https://github.com/LeslieTrue/SFTvsRL and necessary data for V-IRL training. • 5 items • Updated 14 days ago • 8

upvoted a paper 17 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 22 days ago • 106

upvoted a paper 19 days ago

DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 51

upvoted a collection 22 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 23 days ago • 349

upvoted a collection 24 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 24 days ago • 100