40 12 68

Ivan Fioravanti PRO

ivanfioravanti

AI & ML interests

None yet

Recent Activity

new activity 14 days ago

ivanfioravanti/Qwen2.5-3B-italian-wine-fp16:Upload folder using huggingface_hub

updated a model 14 days ago

ivanfioravanti/Qwen2.5-3B-italian-wine-fp16

published a model 14 days ago

ivanfioravanti/Qwen2.5-3B-italian-wine-fp16

View all activity

Organizations

ivanfioravanti's activity

upvoted a collection about 2 months ago

DolphinLabeled Datasets

Collection

Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated Jan 6 • 14

upvoted an article about 2 months ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

•

Jan 2

• 40

upvoted a paper about 2 months ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 55

upvoted 3 papers 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 346

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published Dec 16, 2024 • 41

upvoted an article 3 months ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

•

Dec 4, 2024

• 77

upvoted an article 4 months ago

Article

Releasing the largest multilingual open pretraining dataset

and 2 others •

Nov 13, 2024

• 99

upvoted an article 10 months ago

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

•

Apr 29, 2024

• 29

upvoted an article 11 months ago

Article

RAG Empowerment: Cohere C4AI Command-R and Transformers Unveiled

•

Apr 7, 2024

• 10