2 5 25

Mert Ege

mertege

mertege

AI & ML interests

None yet

Recent Activity

liked a Space 9 days ago

nanotron/ultrascale-playbook

liked a model 10 days ago

ALLaM-AI/ALLaM-7B-Instruct-preview

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Organizations

mertege's activity

liked a Space 9 days ago

1.79k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 10 days ago

ALLaM-AI/ALLaM-7B-Instruct-preview

Text Generation • Updated 11 days ago • 4.1k • 55

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 335

liked 2 models about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 5 days ago • 1.26M • • 1.2k

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 4.63M • • 10.5k

upvoted a paper 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 347

New activity in kashif/gkd_openassistant-guanaco 4 months ago

Chat template on GKD Trainer

#1 opened 4 months ago by

mertege

liked a dataset 5 months ago

abdoelsayed/Open-ArabicaQA

Preview • Updated Mar 27, 2024 • 226 • 4

liked a dataset 6 months ago

BAAI/Infinity-Instruct

Viewer • Updated 4 days ago • 20.4M • 5.34k • 595

liked a model 6 months ago

maywell/Qwen2-7B-Multilingual-RP

Text Generation • Updated Jun 25, 2024 • 3.25k • 54

liked a dataset 6 months ago

macadeliccc/opus_samantha

Viewer • Updated Jun 21, 2024 • 3.19k • 95 • 21

liked 3 models 6 months ago

liked a Space 6 months ago

136

Open Arabic LLM Leaderboard

🏆

Track, rank and evaluate open Arabic LLMs and chatbots

upvoted an article 7 months ago

Article

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

Jan 19, 2021

• 4

liked a model 7 months ago

haoranxu/ALMA-13B-Pretrain

Text Generation • Updated Oct 5, 2024 • 1.63k • 9

liked a dataset 8 months ago

mlfoundations/dclm-baseline-1.0

Preview • Updated Jul 22, 2024 • 237k • 204

upvoted a paper 8 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 93

liked a Space 8 months ago

Magpie

🐦

Generate and rate instruction-response pairs