paraschopra/llama-31-8b-instruct-120k-correct-plus-real-low-lr_MERGED Text Generation • Updated Oct 25, 2024 • 10
paraschopra/llama-31-8b-instruct-120k-correct-plus-real-low-lr Text Generation • Updated Oct 23, 2024 • 13
paraschopra/llama-31-8b-instruct-500-samples-min_p-no-system-prompt Text Generation • Updated Oct 22, 2024 • 9
paraschopra/llama-31-8b-instruct-regenerated-100-cot-no-system-prompt_MERGED_SLERP Text Generation • Updated Oct 21, 2024 • 11
paraschopra/llama-31-8b-instruct-regenerated-100-cot-long-system-prompt Text Generation • Updated Oct 21, 2024 • 14
paraschopra/llama-31-8b-instruct-regenerated-100-cot-no-system-prompt Text Generation • Updated Oct 21, 2024 • 12
paraschopra/llama-31-8b-instruct-regenerated-100-cot_MERGED_SLERP Text Generation • Updated Oct 21, 2024 • 10
paraschopra/llama-31-8b-base-all-24k-correct_MERGED_SLERP Text Generation • Updated Oct 18, 2024 • 10
paraschopra/llama-31-8b-base-all-24k-correct_MERGED_LINEAR Text Generation • Updated Oct 18, 2024 • 13
paraschopra/llama-31-8b-base-120k_minp_plus_train_500_v2_ADAPTOR_MERGED Text Generation • Updated Oct 17, 2024 • 11