The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models Jan 29 • 15
pminervini/Mistral-7B-v0.1_bs_2_lr_1e-04_collator_completion_gas_8_ms_1024_lrank_32_ws_64 Updated Apr 22
pminervini/Llama-2-7b-hf_bs_2_lr_3e-04_collator_completion_gas_16_ms_1024_lrank_32_ws_32 Updated Apr 22
pminervini/Llama-2-7b-hf_bs_2_lr_5e-05_collator_completion_gas_8_ms_1024_lrank_32_ws_64 Updated Apr 22
pminervini/Mistral-7B-v0.1_bs_4_lr_5e-05_collator_completion_gas_8_ms_1024_lrank_64_ws_32 Updated Apr 22
pminervini/Mistral-7B-v0.1_bs_4_lr_1e-04_collator_completion_gas_8_ms_1024_lrank_64_ws_64 Updated Apr 22
pminervini/Llama-2-7b-hf_bs_2_lr_5e-05_collator_completion_gas_32_ms_1024_lrank_64_ws_32 Updated Apr 22
pminervini/Llama-2-7b-hf_bs_2_lr_5e-05_collator_completion_gas_8_ms_1024_lrank_32_ws_32 Updated Apr 22
pminervini/Llama-2-7b-hf_bs_4_lr_5e-04_collator_completion_gas_16_ms_1024_lrank_64_ws_64 Updated Apr 22