Upload eval_results/HuggingFaceH4/mistral-7b-dpo/v33.11/winogrande/results_2024-03-21T23-52-16.566112.json with huggingface_hub 4a01c53 verified edbeeching HF staff commited on Mar 21, 2024
Upload eval_results/HuggingFaceH4/mistral-7b-dpo/v33.11/mmlu/results_2024-03-21T08-25-17.094972.json with huggingface_hub 148fe0c verified edbeeching HF staff commited on Mar 21, 2024
Upload eval_results/HuggingFaceH4/mistral-7b-dpo/v33.11/gsm8k/results_2024-03-20T23-43-15.016787.json with huggingface_hub de172da verified edbeeching HF staff commited on Mar 20, 2024
Upload eval_results/HuggingFaceH4/mistral-7b-dpo/v33.11/hellaswag/results_2024-03-20T23-31-21.436584.json with huggingface_hub 50246ae verified edbeeching HF staff commited on Mar 20, 2024
Upload eval_results/HuggingFaceH4/mistral-7b-dpo/v33.11/arc/results_2024-03-20T23-24-19.155896.json with huggingface_hub afe87f6 verified edbeeching HF staff commited on Mar 20, 2024
Upload eval_results/HuggingFaceH4/mistral-7b-dpo/v33.11/truthfulqa/results_2024-03-20T23-24-03.079431.json with huggingface_hub bd27b14 verified edbeeching HF staff commited on Mar 20, 2024
Upload eval_results/HuggingFaceH4/mistral-7b-dpo/v33.11/winogrande/results_2024-03-20T23-23-36.786604.json with huggingface_hub cf896c3 verified edbeeching HF staff commited on Mar 20, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-sft/v1.0/ifeval/results_2024-03-20T10-25-32.851161.json with huggingface_hub afa0a21 verified lewtun HF staff commited on Mar 20, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-sft/v1.0/bbh/results_2024-03-20T10-20-40.831195.json with huggingface_hub 4d642d8 verified lewtun HF staff commited on Mar 20, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.5/bbh/results_2024-03-19T09-27-11.722655.json with huggingface_hub acad715 verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.4/bbh/results_2024-03-19T09-26-27.067508.json with huggingface_hub d64ecd4 verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.0/bbh/results_2024-03-19T09-26-07.598400.json with huggingface_hub fdb6355 verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.2/bbh/results_2024-03-19T09-26-04.565494.json with huggingface_hub 92d255d verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v1.3/bbh/results_2024-03-19T09-25-53.215602.json with huggingface_hub fe6a135 verified lewtun HF staff commited on Mar 19, 2024
Upload eval_results/mistralai/Mixtral-8x7B-Instruct-v0.1/main/bbh/results_2024-03-18T20-58-12.014656.json with huggingface_hub 1956f9c verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/gsm8k/results_2024-03-18T20-52-36.143317.json with huggingface_hub 406cb35 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/deepseek-ai/deepseek-llm-67b-chat/main/bbh/results_2024-03-18T20-51-01.093533.json with huggingface_hub a866518 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.2/gsm8k/results_2024-03-18T20-49-24.740849.json with huggingface_hub 4ae95c5 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/google/gemma-7b-it/main/gsm8k/results_2024-03-18T20-47-15.598662.json with huggingface_hub e7751e6 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/NousResearch/Nous-Hermes-2-Yi-34B/main/bbh/results_2024-03-18T20-43-57.392460.json with huggingface_hub ae13434 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-72B-Chat/main/bbh/results_2024-03-18T20-40-36.554904.json with huggingface_hub af53d82 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/google/gemma-2b-it/main/gsm8k/results_2024-03-18T20-39-56.154693.json with huggingface_hub 40e84db verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO/main/bbh/results_2024-03-18T20-39-32.747348.json with huggingface_hub d30acf6 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/google/gemma-7b-it/main/bbh/results_2024-03-18T20-38-30.588009.json with huggingface_hub 4c6f00d verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/google/gemma-2b-it/main/bbh/results_2024-03-18T20-36-40.384973.json with huggingface_hub 589bc06 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta/main/bbh/results_2024-03-18T20-36-18.052319.json with huggingface_hub 6cb7ce4 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/zephyr-7b-beta-ift/v0.2/bbh/results_2024-03-18T20-33-46.216888.json with huggingface_hub ba09d8b verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-14B-Chat/main/bbh/results_2024-03-18T20-23-46.729702.json with huggingface_hub 737bf1f verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-4B-Chat/main/bbh/results_2024-03-18T20-20-24.322898.json with huggingface_hub 1b733bf verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-1.8B-Chat/main/bbh/results_2024-03-18T20-11-24.511185.json with huggingface_hub 754f9ed verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/teknium/OpenHermes-2.5-Mistral-7B/main/bbh/results_2024-03-18T19-49-31.908303.json with huggingface_hub 40f3905 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/bbh/results_2024-03-18T19-43-00.075213.json with huggingface_hub 7add925 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/ifeval/results_2024-03-18T19-36-03.767073.json with huggingface_hub 73b2107 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/mmlu/results_2024-03-18T17-01-47.545823.json with huggingface_hub b9d64e1 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/gsm8k/results_2024-03-18T17-01-30.856762.json with huggingface_hub 29394c9 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/hellaswag/results_2024-03-18T16-55-05.213121.json with huggingface_hub 8e02df0 verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/arc/results_2024-03-18T16-50-06.159327.json with huggingface_hub 1a1c5de verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/truthfulqa/results_2024-03-18T16-49-56.225132.json with huggingface_hub 3c8fa5b verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/Qwen/Qwen1.5-0.5B-Chat/main/winogrande/results_2024-03-18T16-49-32.523482.json with huggingface_hub 64a7a0d verified lewtun HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.13/gsm8k/results_2024-03-18T16-42-34.222418.json with huggingface_hub 221b580 verified edbeeching HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.13/ifeval/results_2024-03-18T16-40-04.435577.json with huggingface_hub e9088c8 verified edbeeching HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.9/gsm8k/results_2024-03-18T15-21-24.552464.json with huggingface_hub 437b3d5 verified edbeeching HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.9/ifeval/results_2024-03-18T15-18-41.904972.json with huggingface_hub 06c723e verified edbeeching HF staff commited on Mar 18, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/gsm8k/results_2024-03-16T22-45-24.270002.json with huggingface_hub 47e1f74 verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.7/gsm8k/results_2024-03-16T22-43-25.759701.json with huggingface_hub 63ae265 verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.8/gsm8k/results_2024-03-16T22-43-20.193012.json with huggingface_hub f7627c3 verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.6/ifeval/results_2024-03-16T22-42-33.074718.json with huggingface_hub 2836f47 verified edbeeching HF staff commited on Mar 16, 2024
Upload eval_results/HuggingFaceH4/qwen-1.5-1.8b-dpo/v0.7/ifeval/results_2024-03-16T22-41-15.459095.json with huggingface_hub 3374a12 verified edbeeching HF staff commited on Mar 16, 2024