Junxiong Wang commited on
Commit
bcdc659
·
1 Parent(s): c73f7ee

add models

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -1,21 +1,23 @@
1
  ---
2
- base_model: /data/junxiong/sft/llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05/
3
  tags:
4
  - alignment-handbook
5
  - generated_from_trainer
6
  datasets:
7
  - HuggingFaceH4/ultrafeedback_binarized
8
  model-index:
9
- - name: llama3_0_5_dpo_open_not_openhermes_progressive_train_largest_dataset_2e-05_ep1
10
  results: []
11
  ---
12
 
 
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
- # llama3_0_5_dpo_open_not_openhermes_progressive_train_largest_dataset_2e-05_ep1
17
 
18
- This model is a fine-tuned version of [/data/junxiong/sft/llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05/](https://huggingface.co//data/junxiong/sft/llama3_0_5_sft_open_not_openhermes_progressive_train_largest_dataset_2e-05/) on the HuggingFaceH4/ultrafeedback_binarized dataset.
19
 
20
  ## Model description
21
 
 
1
  ---
2
+ base_model: JunxiongWang/llama3_mamba_0_5_sft
3
  tags:
4
  - alignment-handbook
5
  - generated_from_trainer
6
  datasets:
7
  - HuggingFaceH4/ultrafeedback_binarized
8
  model-index:
9
+ - name: JunxiongWang/llama3_mamba_0_5_dpo_ep1
10
  results: []
11
  ---
12
 
13
+ Please check [here](https://github.com/jxiw/MambaInLlama/tree/main) for details.
14
+
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
18
+ # JunxiongWang/llama3_mamba_0_5_dpo_ep1
19
 
20
+ This model is a fine-tuned version of [JunxiongWang/llama3_mamba_0_5_sft](https://huggingface.co/JunxiongWang/llama3_mamba_0_5_sft) on the HuggingFaceH4/ultrafeedback_binarized dataset.
21
 
22
  ## Model description
23