Alina Lozovskaya
alozowski
AI & ML interests
NLP in all aspects
Organizations
alozowski's activity
model evaluation failed
2
#929 opened 7 days ago
by
FuJhen
model evaluation failed
7
#895 opened 28 days ago
by
thomas-yanxin
Not results found in LLM Benchmark and no in running evaluation queue
5
#938 opened 3 days ago
by
xinchen9
not show in Open-llm-leaderboard
#18 opened 2 days ago
by
legolasyiu
Model fail, re-eval request 😊
8
#885 opened about 1 month ago
by
dnhkng
How to calculate GPQA score?
4
#928 opened 7 days ago
by
JJaeuk
🚩 Report: Not working
1
#939 opened 3 days ago
by
Lyte
Submitted model not found in any queue
1
#937 opened 4 days ago
by
bedio
Why failed
1
#936 opened 4 days ago
by
DZgas
Feature Request: add error details summary to request file when a model fails
2
#935 opened 4 days ago
by
CombinHorizon
Unable to submit model, due to "Unknown model size" (vilm/Quyen-Pro-Max-v0.1)
1
#934 opened 4 days ago
by
CombinHorizon
FAILED MODELS
2
#933 opened 5 days ago
by
MaziyarPanahi
Failed models
1
#932 opened 5 days ago
by
ThiloteE
[BUG] in the evaluation
1
#931 opened 6 days ago
by
DeepMount00
Failed model
1
#930 opened 6 days ago
by
legolasyiu
failed
6
#59 opened 6 days ago
by
legolasyiu
Regarding evaluation code version.
1
#58 opened 7 days ago
by
bedio
Renaming Fireball-Alpaca-Llama3.1.01-8B-Philos.
3
#16 opened 5 days ago
by
legolasyiu
Not show models in LLM leaderboard
1
#17 opened 4 days ago
by
legolasyiu
EpistemeAI/Athene-codegemma-2-7b-it-alpaca-v1.3 Benchmark disappered
2
#927 opened 8 days ago
by
legolasyiu
bump-up-huggingface-hub
5
#926 opened 8 days ago
by
alozowski
fix-adapters
6
#925 opened 8 days ago
by
alozowski
manage-dependencies
9
#923 opened 9 days ago
by
alozowski
Model not showing up on Voting panel after Submitting
10
#919 opened 13 days ago
by
alvations
removing model under evaluation.
2
#922 opened 9 days ago
by
bedio
Why are there two different experiment results for GPT-2 on the leaderboard?
1
#1 opened 10 days ago
by
simwit
How to add task to the leaderboard?
2
#921 opened 11 days ago
by
alvations
check-submit
5
#920 opened 11 days ago
by
alozowski
Missing Llama 3.1 405B
1
#15 opened 13 days ago
by
lukestanley
Changed model - EpistemeAI/Athena-gemma-2-2b-it
6
#917 opened 14 days ago
by
legolasyiu
Model evaluation failed
1
#916 opened 15 days ago
by
CoolSpring
bump-up-gradio
5
#918 opened 14 days ago
by
alozowski
Running Evaluation Queue appears to be stuck
1
#915 opened 16 days ago
by
Gryphe
Model evaluation failed for 4bit model
7
#902 opened 23 days ago
by
vihangd
Can't login error
2
#914 opened 16 days ago
by
legolasyiu
Upload added_IVF548_Flat_nprobe_1_HOUSHANG_v2.index
4
#913 opened 16 days ago
by
Huschang
Upload HOUSHANG.pth
5
#912 opened 16 days ago
by
Huschang
IFEval reproduction problem
8
#911 opened 17 days ago
by
LamTungTran
Still pending
6
#900 opened 25 days ago
by
legolasyiu
Incomplete model
1
#909 opened 19 days ago
by
MaziyarPanahi
bump-up-transformers
5
#910 opened 18 days ago
by
alozowski
leaderboard should be more curated
7
#908 opened 21 days ago
by
ehartford
Question: same model with very different scores
2
#904 opened 23 days ago
by
Yuma42
Failed model (anthracite-org/magnum-v2.5-12b-kto)
1
#905 opened 22 days ago
by
CombinHorizon
Phi-3.5 fine-tuned failed
1
#907 opened 21 days ago
by
MaziyarPanahi
Gated models
1
#903 opened 23 days ago
by
djstrong
add-model-generation
5
#906 opened 21 days ago
by
alozowski
phi-3-small-128k MATH Lvl 5 is 0
1
#897 opened 26 days ago
by
huu-ontocord
Model evaluations failed
4
#884 opened about 1 month ago
by
DavidGF
Incorrect ifeval benchmark
5
#879 opened about 1 month ago
by
DavidGF
all failed tests
1
#57 opened 24 days ago
by
legolasyiu
Model Failed: StableProse
3
#894 opened 28 days ago
by
nlpguy