Commit History

Update README.md
1dc36e7
Running
verified

eliebak HF staff commited on

Delete eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/aime24
b980c45
verified

edbeeching HF staff commited on

Upload ./data/eval/Llama-8B-Instruct-OT114k-linear-1e-5-adamw_torch-4k/math_500/6fa5af8a2cb06891041c06cda616db6aadf2bcc1/results_2025-01-30T18-29-34.579225.json with huggingface_hub
7d8dca8
verified

eliebak HF staff commited on

Update app.py
9b14f93
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-70B/main/math_500/results_2025-01-29T16-38-54.088382.json with huggingface_hub
88b8ff5
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B/main/math_500/results_2025-01-29T16-35-05.004956.json with huggingface_hub
4ab7102
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B/main/math_500/results_2025-01-29T16-21-19.161811.json with huggingface_hub
41cc0ab
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Llama-8B/main/math_500/results_2025-01-29T16-19-05.697532.json with huggingface_hub
7c6ca2a
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/math_500/results_2025-01-29T16-17-35.586793.json with huggingface_hub
3b48d21
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-01-29T16-16-01.453401.json with huggingface_hub
da4d084
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-01-29T15-52-00.573631.json with huggingface_hub
d140837
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/aime24/results_2025-01-29T15-50-22.917041.json with huggingface_hub
2802f4f
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-01-29T15-42-59.056415.json with huggingface_hub
d2290ef
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-01-29T15-33-18.290562.json with huggingface_hub
edcad84
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/aime24/results_2025-01-29T15-41-23.502842.json with huggingface_hub
b0cc1eb
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/aime24/results_2025-01-29T15-31-36.985431.json with huggingface_hub
7f184b4
verified

edbeeching HF staff commited on

Delete eval_results
82185cf
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/math_500/results_2025-01-24T21-17-55.827086.json with huggingface_hub
38a75ac
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/aime24/results_2025-01-24T21-14-32.206445.json with huggingface_hub
b902323
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-01-24T21-14-25.999625.json with huggingface_hub
3f56ae7
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/aime24/results_2025-01-24T21-12-48.188891.json with huggingface_hub
a5cc180
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-01-24T20-57-06.444163.json with huggingface_hub
c3b4156
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-01-24T20-45-32.875811.json with huggingface_hub
cd84348
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-01-24T20-28-49.837445.json with huggingface_hub
806f288
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/math_500/results_2025-01-24T20-16-40.303811.json with huggingface_hub
64e18b7
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/aime24/results_2025-01-24T20-16-15.533746.json with huggingface_hub
69fd962
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/math_500/results_2025-01-24T20-12-51.523364.json with huggingface_hub
91ce8e6
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B/main/aime24/results_2025-01-24T20-12-27.254461.json with huggingface_hub
b1e0442
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/math_500/results_2025-01-24T20-09-47.638338.json with huggingface_hub
9873408
verified

edbeeching HF staff commited on

Upload eval_results/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B/main/aime24/results_2025-01-24T20-09-23.001757.json with huggingface_hub
7ca694a
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-7B-Instruct/main/mini_math/results_2025-01-23T15-02-00.811603.json with huggingface_hub
ea3a1b8
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-7B-Instruct/main/aime24/results_2025-01-23T13-22-02.959452.json with huggingface_hub
e3ba967
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-7B-Instruct/main/math_500/results_2025-01-23T13-14-33.659457.json with huggingface_hub
b2277ae
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-7B-Instruct/main/mini_math/results_2025-01-23T12-18-33.129675.json with huggingface_hub
9e56168
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-72B-Instruct/main/mini_math/results_2025-01-23T11-06-46.178701.json with huggingface_hub
02ca583
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-1.5B-Instruct/main/mini_math/results_2025-01-23T09-04-03.274525.json with huggingface_hub
2633aab
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-1.5B-Instruct/main/mini_math/results_2025-01-23T08-39-01.024495.json with huggingface_hub
c06cbaf
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-1.5B-Instruct/main/mini_math/results_2025-01-22T16-09-57.737882.json with huggingface_hub
56a41e1
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-1.5B-Instruct/main/mini_math/results_2025-01-22T16-03-02.199265.json with huggingface_hub
d7a71e9
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-1.5B-Instruct/main/mini_math/results_2025-01-22T09-52-48.258956.json with huggingface_hub
3800128
verified

edbeeching HF staff commited on

Upload eval_results/Qwen/Qwen2.5-Math-1.5B-Instruct/main/mini_math/results_2025-01-22T10-02-23.802646.json with huggingface_hub
2e0e2c9
verified

edbeeching HF staff commited on

Upload eval_results/meta-llama/Llama-3.2-3B-Instruct/main/math/results_2024-10-06T16-58-55.587787.json with huggingface_hub
04f3d33
verified

lewtun HF staff commited on

Upload eval_results/meta-llama/Llama-3.2-1B-Instruct/main/math/results_2024-10-06T12-42-55.468838.json with huggingface_hub
a2d5010
verified

lewtun HF staff commited on

Upload eval_results/meta-llama/Llama-2-70b-chat-hf/main/alpaca_eval/results_2024-10-04T08-50-36.json with huggingface_hub
2cbd9c4
verified

lewtun HF staff commited on

Upload eval_results/meta-llama/Llama-2-70b-chat-hf/main/mixeval_hard/results_2024-10-04T08-18-18.json with huggingface_hub
15e429d
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/alpaca_eval/results_2024-10-04T08-06-22.json with huggingface_hub
23e3ed7
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.1/main/ifeval/results_2024-10-04T08-02-52.262386.json with huggingface_hub
eb38ba0
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.1/main/alpaca_eval/results_2024-10-04T08-00-59.json with huggingface_hub
2cfc3a4
verified

lewtun HF staff commited on

Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/mixeval_hard/results_2024-10-04T07-54-40.json with huggingface_hub
da9d439
verified

lewtun HF staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.1/main/mixeval_hard/results_2024-10-04T07-53-05.json with huggingface_hub
333bb2c
verified

lewtun HF staff commited on