Librarian Bots

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

davanstrien updated a dataset about 5 hours ago

librarian-bots/model_cards_with_metadata

librarian-bot new activity about 7 hours ago

librarian-bots/dataset-to-model-monitor:Discussion tracking new models trained on imdb

librarian-bot new activity about 7 hours ago

librarian-bots/dataset-to-model-monitor:Discussion tracking new models trained on google/fleurs

View all activity

librarian-bots's activity

davanstrien

updated a dataset about 5 hours ago

librarian-bots/model_cards_with_metadata

Preview • Updated about 5 hours ago • 1.07k • 13

davanstrien

posted an update about 6 hours ago

Post

173

📊 Introducing "Hugging Face Dataset Spotlight" 📊

I'm excited to share the first episode of our AI-generated podcast series focusing on nice datasets from the Hugging Face Hub!

This first episode explores mathematical reasoning datasets:

- SynthLabsAI/Big-Math-RL-Verified: Over 250,000 rigorously verified problems spanning multiple difficulty levels and mathematical domains
- open-r1/OpenR1-Math-220k: 220,000 math problems with multiple reasoning traces, verified for accuracy using Math Verify and Llama-3.3-70B models.
- facebook/natural_reasoning: 1.1 million general reasoning questions carefully deduplicated and decontaminated from existing benchmarks, showing superior scaling effects when training models like Llama3.1-8B-Instruct.

Plus a bonus segment on bespokelabs/bespoke-manim!

https://www.youtube.com/watch?v=-TgmRq45tW4

librarian-bot

in librarian-bots/dataset-to-model-monitor about 7 hours ago

Discussion tracking new models trained on imdb

195

#1 opened over 1 year ago by

librarian-bot

Discussion tracking new models trained on google/fleurs

192

#6 opened over 1 year ago by

librarian-bot

davanstrien

updated a dataset about 15 hours ago

librarian-bots/dataset_cards_with_metadata

Viewer • Updated about 15 hours ago • 219k • 1.05k • 12

librarian-bot

updated a dataset about 16 hours ago

librarian-bots/paper-recommendations-v2

Viewer • Updated about 16 hours ago • 4.27k • 1.4k • 5

davanstrien

updated a dataset about 16 hours ago

librarian-bots/dataset-columns

Viewer • Updated about 16 hours ago • 4.95M • 571

librarian-bot

in librarian-bots/dataset-to-model-monitor about 19 hours ago

Discussion tracking new models trained on HuggingFaceH4/ultrafeedback_binarized

429

#37 opened about 1 year ago by

librarian-bot

Discussion tracking new models trained on BAAI/TACO

#50 opened 11 months ago by

librarian-bot

Discussion tracking new models trained on HuggingFaceH4/ultrachat_200k

420

#15 opened over 1 year ago by

librarian-bot

davanstrien

posted an update 1 day ago

Post

1801

Quick POC: Turn a Hugging Face dataset card into a short podcast introducing the dataset using all open models.

I think I'm the only weirdo who would enjoy listening to something like this though 😅

Here is an example for eth-nlped/stepverify