JunxiongWang
/

llama3_mamba_0_5_dpo_ep1

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Junxiong Wang commited on Jul 20, 2024

Commit

bcdc659

·

1 Parent(s): c73f7ee

add models

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -1,21 +1,23 @@
 ---
-base_model: /data/junxiong/sft/llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05/
 tags:
 - alignment-handbook
 - generated_from_trainer
 datasets:
 - HuggingFaceH4/ultrafeedback_binarized
 model-index:
-- name: llama3_0_5_dpo_open_not_openhermes_progressive_train_largest_dataset_2e-05_ep1
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# llama3_0_5_dpo_open_not_openhermes_progressive_train_largest_dataset_2e-05_ep1
-This model is a fine-tuned version of [/data/junxiong/sft/llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05/](https://huggingface.co//data/junxiong/sft/llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05/) on the HuggingFaceH4/ultrafeedback_binarized dataset.
 ## Model description

 ---
+base_model: JunxiongWang/llama3_mamba_0_5_sft
 tags:
 - alignment-handbook
 - generated_from_trainer
 datasets:
 - HuggingFaceH4/ultrafeedback_binarized
 model-index:
+- name: JunxiongWang/llama3_mamba_0_5_dpo_ep1
   results: []
 ---
+Please check [here](https://github.com/jxiw/MambaInLlama/tree/main) for details.
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# JunxiongWang/llama3_mamba_0_5_dpo_ep1
+This model is a fine-tuned version of [JunxiongWang/llama3_mamba_0_5_sft](https://huggingface.co/JunxiongWang/llama3_mamba_0_5_sft) on the HuggingFaceH4/ultrafeedback_binarized dataset.
 ## Model description