2 33 39

Edd

Erland

AI & ML interests

None yet

Recent Activity

liked a Space 5 days ago

nanotron/ultrascale-playbook

liked a model 7 days ago

CohereForAI/aya-expanse-8b

updated a dataset 9 days ago

Erland/alpaca-cleaned-1000

View all activity

Organizations

None yet

Erland's activity

liked a Space 5 days ago

1.79k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 7 days ago

CohereForAI/aya-expanse-8b

Text Generation • Updated Dec 6, 2024 • 34.7k • 335

updated a dataset 9 days ago

Erland/alpaca-cleaned-1000

Viewer • Updated 9 days ago • 1.02k • 54

updated a model 22 days ago

Erland/test_grpo_2

Updated 22 days ago

published a model 22 days ago

Erland/test_grpo_2

Updated 22 days ago

updated a model 24 days ago

Erland/test_grpo

Updated 24 days ago

published a model 24 days ago

Erland/test_grpo

Updated 24 days ago

upvoted a collection 26 days ago

Mistral-Small-24B-2501 (All Versions)

Collection

A collection of Mistral's new Small 2501 models including GGUF, 4-bit and more! • 9 items • Updated 1 day ago • 5

updated a model 26 days ago

Erland/Mistral-Small-24B-Base-ChatML-2501-bnb-4bit

Text Generation • Updated 26 days ago • 98 • 2

published a model 26 days ago

Erland/Mistral-Small-24B-Base-ChatML-2501-bnb-4bit

Text Generation • Updated 26 days ago • 98 • 2

updated a model 29 days ago

Erland/Mistral-Small-24B-Base-2501-bnb-4bit

Text Generation • Updated 29 days ago • 61

published a model 29 days ago

Erland/Mistral-Small-24B-Base-2501-bnb-4bit

Text Generation • Updated 29 days ago • 61

updated a model 29 days ago

Erland/Mistral-Small-24B-Base-2501

Updated 29 days ago

published a model 29 days ago

Erland/Mistral-Small-24B-Base-2501

Updated 29 days ago

updated a model about 1 month ago

Erland/test_lora_cpt

Updated Jan 22

published a model about 1 month ago

Erland/test_lora_cpt

Updated Jan 22

updated a model about 1 month ago

Erland/test_push_lora

Updated Jan 21

published a model about 1 month ago

Erland/test_push_lora

Updated Jan 21

liked a model about 1 month ago

unsloth/DeepSeek-R1-Distill-Qwen-7B-GGUF

Updated Jan 25 • 312k • 76

upvoted a collection about 1 month ago

DeepSeek R1 (All Versions)

Collection

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 1 day ago • 202