smohammadi commited on
Commit
f4e85c6
1 Parent(s): c4e5ade

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: meta-llama/Llama-3.2-3B-Instruct
3
+ datasets:
4
+ - tatsu-lab/alpaca
5
+ language: en
6
+ tags:
7
+ - torchtune
8
+ ---
9
+
10
+ # my_cool_model
11
+
12
+ This model is a finetuned version of [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) on the [tatsu-lab/alpaca](https://huggingface.co/tatsu-lab/alpaca) dataset.
13
+
14
+ # Model description
15
+
16
+ More information needed
17
+
18
+ # Training and evaluation results
19
+
20
+ More information needed
21
+
22
+ # Training procedure
23
+
24
+ This model was trained using the [torchtune](https://github.com/pytorch/torchtune) library using the following command:
25
+
26
+ ```bash
27
+ ppo_full_finetune_single_device.py \
28
+ --config ./target/7B_full_ppo_low_memory_single_device.yaml \
29
+ device=cuda \
30
+ metric_logger._component_=torchtune.utils.metric_logging.WandBLogger \
31
+ metric_logger.project=torchtune_ppo \
32
+ forward_batch_size=2 \
33
+ batch_size=64 \
34
+ ppo_batch_size=32 \
35
+ gradient_accumulation_steps=16 \
36
+ compile=True \
37
+ optimizer._component_=bitsandbytes.optim.PagedAdamW \
38
+ optimizer.lr=3e-4
39
+ ```
40
+
41
+
42
+
43
+ # Framework versions
44
+
45
+ - torchtune
46
+ - torchao 0.5.0
47
+ - datasets 2.20.0
48
+ - sentencepiece 0.2.0
49
+