Vinh Nguyen

vinhnx90

https://vinhnx.github.io

AI & ML interests

Learn by doing

Recent Activity

liked a model about 6 hours ago

lmstudio-community/DeepSeek-R1-Distill-Qwen-14B-GGUF

liked a Space about 17 hours ago

open-r1/README

upvoted an article about 17 hours ago

Open-R1: Update #1

View all activity

Organizations

None yet

vinhnx90's activity

upvoted an article about 17 hours ago

Article

Open-R1: Update #1

•

2 days ago

• 178

upvoted a paper 4 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 12 days ago • 284

upvoted an article 6 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

7 days ago

• 587

upvoted 2 articles 12 days ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

•

14 days ago

• 29

Article

Exploring Synthetic Data Generation with DataDreamer

•

13 days ago

• 6

upvoted a collection 12 days ago

llama.vim

Collection

upvoted a collection 13 days ago

DeepSeek-R1

Collection

8 items • Updated 14 days ago • 361

upvoted an article 14 days ago

Article

A Beginner-Friendly PyTorch Tutorial: Build and Train Your First Model

•

14 days ago

• 4

upvoted a collection 14 days ago

DeepSeek R1 (All Versions)

Collection

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated about 21 hours ago • 137

upvoted an article 15 days ago

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

•

15 days ago

• 13

upvoted a paper 15 days ago

Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese

Paper • 2408.12480 • Published Aug 22, 2024 • 21

upvoted an article 15 days ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 145

upvoted 8 articles 16 days ago

Article

Decoding Strategies in Large Language Models

•

Oct 29, 2024

• 39

Article

Diving into MiniMax01 405B MoE

•

19 days ago

• 17

Article

Code a simple RAG from scratch

•

Oct 29, 2024

• 19

Article

They Said It Couldn’t Be Done

•

Dec 5, 2024

• 79

Article

RLHF 101: A Technical Dive into RLHF

•

Dec 11, 2024

• 5

Article

Building an AI-powered search engine from scratch

•

Dec 12, 2024

• 9

Article

🦸🏻#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows

•

Dec 28, 2024

• 10

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

•

Jan 2

• 39