Rookie

Rookied

iknocho

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

facebook/Wildchat-RIP-Filtered-by-70b-Llama

upvoted an article 3 days ago

Open Source AI Agents | Github/Repo List | [2025]

liked a Space 8 days ago

vidore/vidore-leaderboard

View all activity

Organizations

Rookied's activity

upvoted an article 3 days ago

Article

Open Source AI Agents | Github/Repo List | [2025]

•

7 days ago

• 22

upvoted an article 10 days ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

11 days ago

• 89

upvoted an article 11 days ago

Article

Fine-tuning SmolLM with Group Relative Policy Optimization (GRPO) by following the Methodologies

•

11 days ago

• 17

upvoted an article 17 days ago

Article

Open R1: Update #2

and 6 others •

18 days ago

• 191

upvoted an article 18 days ago

Article

Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501

•

20 days ago

• 3

upvoted an article 22 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 151

upvoted an article 23 days ago

Article

Open-source DeepResearch – Freeing our search agents

25 days ago

• 1.11k

upvoted an article 24 days ago

Article

Open-R1: Update #1

and 7 others •

27 days ago

• 289

upvoted 3 articles 25 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

28 days ago

• 40

Article

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

•

26 days ago

• 14

Article

The AI tools for Art Newsletter - Issue 1

29 days ago

• 67

upvoted a paper 29 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 334

upvoted 2 articles 29 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 782

Article

Welcome to Inference Providers on the Hub 🔥

Jan 28

• 400

upvoted 2 articles about 1 month ago

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

•

Jan 19

• 14

Article

Hugging Face and FriendliAI partner to supercharge model deployment on the Hub

Jan 22

• 36

upvoted an article about 2 months ago

Article

Finetuning Falcon 7b in a hybrid distributed fashion

•

Dec 31, 2024

• 5

upvoted a paper 2 months ago

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22, 2024 • 13

upvoted 2 articles 3 months ago

Article

Building a MusicGen API to Generate Custom Music Tracks Locally

•

Dec 4, 2024

• 2

Article

Optimizing Deep Learning Training Techniques

•

Dec 3, 2024

• 2