Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

960

ssmits/Qwen2.5-95B-Instruct not running

#962

by ssmits - opened about 3 hours ago

Discussion

ssmits

about 3 hours ago

Hi HF team,

Specifically made ssmits/Qwen2.5-95B-Instruct to check if a near 100B model could beat the scores of Qwen2.5-72B-Instruct.
Unfortunately, this won't run, or at least I don't see the results show up. The architecture is exactly the same, it only has ~25 extra layers.

Cheers,
Stijn

clefourrier

Open LLM Leaderboard org about 3 hours ago

Hi @ssmits !
Can you follow the steps in the FAQ and give us the link to the request file?

ssmits

about 2 hours ago

Thank you for your swift response. I checked every step and think I've managed to correctly follow it. Just found the request dataset.
ssmits/Qwen2.5-95B-Instruct
9c0e7df57a4fcf4d364efd916a0fc0abdd2d20a3
bfloat16
94.648
Qwen2ForCausalLM
Original
RUNNING
"2024-09-26T19:13:02"
💬 : 💬 chat models (RLHF, DPO, IFT, ...)
8938706
2024-09-26T19:13:20.797580
true
ssmits

Apparently still running for (6 days), is this normal for a model this size?

ssmits changed discussion status to closed about 2 hours ago

ssmits changed discussion status to open about 2 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment