Model save
Browse files
README.md
CHANGED
@@ -5,14 +5,10 @@ tags:
|
|
5 |
- trl
|
6 |
- sft
|
7 |
- generated_from_trainer
|
8 |
-
- flash_attention_2
|
9 |
-
- QLoRA
|
10 |
base_model: tiiuae/falcon-7b
|
11 |
model-index:
|
12 |
- name: falcon7b-linear-equations
|
13 |
results: []
|
14 |
-
datasets:
|
15 |
-
- Menouar/LinearEquations
|
16 |
---
|
17 |
|
18 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -48,7 +44,7 @@ The following hyperparameters were used during training:
|
|
48 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
49 |
- lr_scheduler_type: constant
|
50 |
- lr_scheduler_warmup_ratio: 0.03
|
51 |
-
-
|
52 |
|
53 |
### Training results
|
54 |
|
|
|
5 |
- trl
|
6 |
- sft
|
7 |
- generated_from_trainer
|
|
|
|
|
8 |
base_model: tiiuae/falcon-7b
|
9 |
model-index:
|
10 |
- name: falcon7b-linear-equations
|
11 |
results: []
|
|
|
|
|
12 |
---
|
13 |
|
14 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
44 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
45 |
- lr_scheduler_type: constant
|
46 |
- lr_scheduler_warmup_ratio: 0.03
|
47 |
+
- num_epochs: 3
|
48 |
|
49 |
### Training results
|
50 |
|
adapter_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1044418808
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9596d56721cd4fff95ffbef94801ad476222f877115706d310cc6fad60b08e68
|
3 |
size 1044418808
|
runs/Jan30_14-19-33_db0b26526538/events.out.tfevents.1706624382.db0b26526538.1078.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bf80c2b9d20b8bc8c926fa7f5ab080bb9fe6d0854822f91d9ca8f84309e1b75f
|
3 |
+
size 16938
|