6 122 63

Quentin Tardif

ntnq

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

s1: Simple test-time scaling

upvoted an article 1 day ago

Open-R1: Update #1

upvoted an article 3 days ago

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

View all activity

Organizations

ntnq's activity

upvoted a paper about 14 hours ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 3 days ago • 48

upvoted an article 1 day ago

Article

Open-R1: Update #1

•

2 days ago

• 179

upvoted an article 3 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

3 days ago

• 22

upvoted 2 papers 6 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 6 days ago • 88

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 6 days ago • 29

upvoted an article 7 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

7 days ago

• 587

liked a Space 7 days ago

Running on Zero

287

🤪

Magic Face

Transform Your Face Into Legendary Characters!

upvoted a paper 12 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 12 days ago • 284

upvoted a collection 13 days ago

DeepSeek-R1

Collection

8 items • Updated 14 days ago • 361

upvoted a paper 14 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 18 days ago • 104

upvoted an article 19 days ago

Article

Run ComfyUI workflows for free on Spaces

Jan 14, 2024

• 49

liked a model 26 days ago

microsoft/phi-4

Text Generation • Updated 26 days ago • 375k • 1.65k

upvoted an article 26 days ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

•

Jan 3

• 32

upvoted a paper 27 days ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published 29 days ago • 41

upvoted a paper 28 days ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 16

liked a Space about 1 month ago

Running

1.28k

🐢

Qwen2.5 Coder Artifacts

upvoted a paper about 1 month ago

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

Paper • 2412.21199 • Published Dec 30, 2024 • 13

liked a model about 1 month ago

bigcode/starpii

Token Classification • Updated Jul 24, 2023 • 643 • 116

upvoted a paper about 1 month ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 23

liked a Space about 1 month ago

Running

⚡

Quentin Tardif

AI & ML interests

Recent Activity

Organizations

ntnq's activity

Open-R1: Update #1

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Open-R1: a fully open reproduction of DeepSeek-R1

Magic Face

Run ComfyUI workflows for free on Spaces

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

Qwen2.5 Coder Artifacts

Train LLMs