Daniel van Strien's picture

Daniel van Strien PRO

davanstrien

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a dataset 19 minutes ago

data-is-better-together/fineweb-c-progress

updated a dataset 31 minutes ago

librarian-bots/model_cards_with_metadata

updated a dataset 32 minutes ago

davanstrien/grpo-completions

View all activity

Organizations

davanstrien's activity

upvoted 3 collections 1 day ago

rank1

rank1 is the first test-time compute reasoning model in IR • 15 items • Updated about 22 hours ago • 3

OWLS: Scaling Laws for Speech Recognition and Translation

🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. • 6 items • Updated 3 days ago • 3

Granite 3.2 Language Models

3 items • Updated 2 days ago • 8

upvoted 2 papers 3 days ago

Minions: Cost-efficient Collaboration Between On-device and Cloud Language Models

Paper • 2502.15964 • Published 7 days ago • 1

"Actionable Help" in Crises: A Novel Dataset and Resource-Efficient Models for Identifying Request and Offer Social Media Posts

Paper • 2502.16839 • Published 4 days ago • 1

upvoted a collection 3 days ago

Slam

All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 6 items • Updated 3 days ago • 12

upvoted a paper 3 days ago

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Paper • 2502.17387 • Published 4 days ago • 3

upvoted 5 collections 3 days ago

Embeddings

1 item • Updated Dec 18, 2023 • 1

Finetuned models

3 items • Updated Dec 18, 2023 • 1

Acoustic models

3 items • Updated Dec 18, 2023 • 1

Text models

3 items • Updated Dec 18, 2023 • 1

KB-Whisper

Whisper models trained on over 50,000 hours of Swedish speech data. • 5 items • Updated 14 days ago • 4

upvoted a collection 8 days ago

ModernGLiClass

GLiClass with ModernBERT backbone • 2 items • Updated 9 days ago • 6

upvoted a collection 9 days ago

Open Image Preferences

Containing all artifacts for the Stable Diffusion 3.5L vs Flux Dev image preference community sprint. • 14 items • Updated Dec 19, 2024 • 9

upvoted a collection 10 days ago

Domain Classifiers

4 items • Updated 17 days ago • 2

upvoted a collection 11 days ago

hub-tldr

Creating a smol model for tl;dr-ing the hub • 4 items • Updated 11 days ago • 2

upvoted an article 13 days ago

Article

Faster fine-tuning using TRL & Unsloth

Jan 10, 2024

• 52

upvoted a collection 13 days ago

Maths reasoning

Maths reasoning datasets found using https://huggingface.co./spaces/librarian-bots/huggingface-datasets-semantic-search • 14 items • Updated 14 days ago • 2

upvoted a paper 14 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 24 days ago • 195

upvoted an article 15 days ago

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

17 days ago

• 49