ldwang's picture

ldwang

ldwang

·

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

updated a dataset 3 days ago

BAAI/OpenSeek-Pretrain-Data-Examples

published a dataset 3 days ago

BAAI/OpenSeek-Pretrain-Data-Examples

upvoted an article 4 days ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

View all activity

Organizations

ldwang's activity

upvoted an article 4 days ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By

•

17 days ago

• 7

upvoted a paper 4 days ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 147

upvoted a collection 10 days ago

DeepSeek-R1

8 items • Updated Jan 21 • 545

upvoted an article 17 days ago

Article

Large-scale Near-deduplication Behind BigCode

May 16, 2023

• 21

upvoted a paper about 2 months ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32

upvoted a collection about 2 months ago

OpenCoder Datasets

OpenCoder datasets! • 6 items • Updated Nov 15, 2024 • 39

upvoted a paper about 2 months ago

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published Nov 7, 2024 • 115

upvoted 2 collections about 2 months ago

OpenCoder Model

OpenCoder Models • 9 items • Updated Nov 19, 2024 • 10

MiscModels

5 items • Updated 5 days ago • 1

upvoted a paper about 2 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 93

upvoted an article about 2 months ago

Article

Low Latency CPU Based Educational Value Classifier With Generic Educational Value

By

•

Jun 12, 2024

• 9

upvoted a collection 2 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 3 days ago • 535

upvoted an article 2 months ago

Article

LLM数据工程3——数据收集魔法：获取顶级训练数据的方法

By

•

Jun 4, 2024

• 17

upvoted 3 collections 2 months ago

Datasets built with ⚗️ distilabel

This collection contains some datasets generated and/or labelled using https://github.com/argilla-io/distilabel • 8 items • Updated Dec 11, 2024 • 12

Synthetic Data Generator

A collection of tools and datasets related to no-code the Synthetic Data Generation. • 21 items • Updated 18 days ago • 7

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co./spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 23

upvoted a paper 2 months ago

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 8

upvoted 3 collections 2 months ago

MiscBlogs

3 items • Updated 4 days ago • 1

MiscTools

Misc tools for llm & vlm. • 6 items • Updated Dec 23, 2024 • 1

MiscDatasets

4 items • Updated Jan 5 • 1