PolarisEvals/llm_dataset_completness_2stage_justification_score Viewer • Updated Jun 13, 2024 • 54.3k • 51
PolarisEvals/shikib_dataset_completeness_2stage_unittest_debug Viewer • Updated Jun 12, 2024 • 100 • 55
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest_response Viewer • Updated Jun 12, 2024 • 5.47k • 44
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_completeness_2stage_unittest Viewer • Updated Jun 11, 2024 • 912 • 47
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts_filtering_debug Viewer • Updated Jun 11, 2024 • 100 • 46
PolarisEvals/trainingcriteria_fct_iter2_fctO16_all_3K_unique_prompts Viewer • Updated Jun 11, 2024 • 912 • 44
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions_filtering_debug Viewer • Updated Jun 5, 2024 • 100 • 49
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_questions Viewer • Updated Jun 5, 2024 • 982 • 56
PolarisEvals/training_criteria_dpo_distill_completeness_2stage_gpt-4-0613_outputs_json_True_debug Viewer • Updated Jun 4, 2024 • 100 • 56
PolarisEvals/training_criteria_dpo_distill_relevance_gpt-4-0613_outputs_json_True_debug Viewer • Updated Jun 3, 2024 • 100 • 61
PolarisEvals/synqa_hudson_300_samples_relevance_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 43
PolarisEvals/synqa_hudson_300_samples_completeness_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 43
PolarisEvals/synqa_hudson_300_samples_clarity_gpt-4-0613_outputs_json_True_debug Viewer • Updated May 30, 2024 • 100 • 44