Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2409.17115

about 12 hours ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 143
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20 • 11
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24 • 50
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24 • 44

about 20 hours ago

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3 • 31
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17 • 25
Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27 • 121
Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17 • 21

🫐 ProX Projects

Collection for: "Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale"

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59
gair-prox/FineWeb-pro

Viewer • Updated Sep 26 • 63.1M • 3.51k • 14
gair-prox/open-web-math-pro

Viewer • Updated Sep 26 • 2.58M • 2.87k • 8
gair-prox/RedPajama-pro

Viewer • Updated Sep 26 • 10.2M • 1.27k • 4

ProX Refining Models

Adapted small language models used to generate data refining programs

gair-prox/web-doc-refining-lm

Text Generation • Updated 27 days ago • 54 • 4
gair-prox/web-chunk-refining-lm

Text Generation • Updated 27 days ago • 35 • 4
gair-prox/math-doc-refining-lm

Text Generation • Updated 27 days ago • 19 • 2
gair-prox/math-chunk-refining-lm

Text Generation • Updated 27 days ago • 27

📑Trending Papers - September 9⃣️

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 125
Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5 • 87
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4 • 87
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17 • 106

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

ProX Math Models

base models trained on ProX curated openwebmath-pro.

gair-prox/Mistral-7B-ProXMath

Text Generation • Updated Sep 28 • 49 • 3
gair-prox/TinyLlama-1.1B-ProXMath

Updated 27 days ago • 8 • 2
gair-prox/Llama-2-7B-ProXMath

Text Generation • Updated 27 days ago • 35 • 1
gair-prox/CodeLlama-7B-ProXMath

Updated 27 days ago • 18 • 1

a collection of pre-training corpora refined by ProX

gair-prox/FineWeb-pro

Viewer • Updated Sep 26 • 63.1M • 3.51k • 14
gair-prox/open-web-math-pro

Viewer • Updated Sep 26 • 2.58M • 2.87k • 8
gair-prox/RedPajama-pro

Viewer • Updated Sep 26 • 10.2M • 1.27k • 4
gair-prox/c4-pro

Viewer • Updated Sep 26 • 40.1M • 1.58k • 4

Agentic-ly agentic

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15 • 38
On the limits of agency in agent-based models

Paper • 2409.10568 • Published Sep 14 • 12
On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16 • 11
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12 • 66

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs