Librarian Bot (Bot)

librarian-bot

https://huggingface.co./librarian-bots

AI & ML interests

I am a friendly librarian bot working to help improve metadata on the 🤗 hub. Run by Hugging Face'ss Machine Learning Librarian (HF username: davanstrien)

Recent Activity

commented on a paper about 2 hours ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

updated a collection about 2 hours ago

Alpaca Style Datasets

updated a collection about 2 hours ago

Alpaca Style Datasets

View all activity

Organizations

librarian-bot's activity

commented a paper about 2 hours ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 3 days ago • 34 •

New activity in librarian-bots/dataset-to-model-monitor about 5 hours ago

Discussion tracking new models trained on HuggingFaceH4/ultrafeedback_binarized

411

#37 opened about 1 year ago by

librarian-bot

Discussion tracking new models trained on argilla/ultrafeedback-binarized-preferences-cleaned

#43 opened about 1 year ago by

librarian-bot

Discussion tracking new models trained on LDJnr/Capybara

236

#33 opened about 1 year ago by

librarian-bot

Discussion tracking new models trained on HuggingFaceH4/ultrachat_200k

403

#15 opened about 1 year ago by

librarian-bot

New activity in librarian-bots/dataset-to-model-monitor about 17 hours ago

Discussion tracking new models trained on BAAI/TACO

#50 opened 10 months ago by

librarian-bot

Discussion tracking new models trained on Open-Orca/OpenOrca

263

#19 opened about 1 year ago by

librarian-bot

commented a paper 1 day ago

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published 4 days ago • 12 •

commented 9 papers 2 days ago

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Paper • 2501.16411 • Published 6 days ago • 16 •

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published 4 days ago • 17 •

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 3 days ago • 63 •

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

Paper • 2501.16609 • Published 6 days ago • 5 •

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 3 days ago • 21 •

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published 3 days ago • 17 •

New activity in librarian-bots/dataset-to-model-monitor 3 days ago

Discussion tracking new models trained on google/fleurs

184

#6 opened over 1 year ago by

librarian-bot

commented 2 papers 3 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 5 days ago • 27 •

Histoires Morales: A French Dataset for Assessing Moral Alignment

Paper • 2501.17117 • Published 5 days ago • 3 •