chameleon-lizard commited on
Commit
7e48234
·
verified ·
1 Parent(s): 77c7453

Added README

Browse files
Files changed (1) hide show
  1. README.md +34 -3
README.md CHANGED
@@ -1,3 +1,34 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - SubMaroon/DTF_Comments_Responses_Counts
5
+ language:
6
+ - ru
7
+ base_model:
8
+ - unsloth/Qwen2.5-7B
9
+ pipeline_tag: text-generation
10
+ ---
11
+
12
+ A continued pretrained version of unsloth/Qwen2.5-7B model using unsloth's low rank adaptation on a dataset of [DTF](dtf.ru) posts.
13
+
14
+ For pretraining, posts from [SubMaroon/DTF_comments_Responses_Counts](https://huggingface.co/datasets/SubMaroon/DTF_Comments_Responses_Counts) were selected, deduplicated by simple `df.unique` and filtered by length of 1000 < x < 128000 tokens.
15
+
16
+ Hyperparameters:
17
+
18
+ ```
19
+ num_train_epochs=2
20
+ train_batch_size=8
21
+ gradient_accumulation_steps=16
22
+ gradient_checkpointing=False
23
+ optim="adamw_8bit"
24
+ weight_decay=4e-2
25
+ bf16=True
26
+ learning_rate=5e-5
27
+ lr_scheduler_type="cosine"
28
+ packing=True,
29
+ seed=42
30
+ ```
31
+
32
+ [Wandb](https://wandb.ai/a_okshus/DTF_comments/runs/fr5hfq6g?nw=nwusera_okshus)
33
+
34
+ [GitHub: TODO]()