Femboyuwu2000 commited on
Commit
521cbeb
1 Parent(s): 1a4bf79

Femboyuwu2000/bloomz-1b1-vn-chat

Browse files
README.md CHANGED
@@ -39,12 +39,14 @@ The following hyperparameters were used during training:
39
  - train_batch_size: 2
40
  - eval_batch_size: 8
41
  - seed: 42
42
- - gradient_accumulation_steps: 4
43
- - total_train_batch_size: 8
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: cosine
46
- - training_steps: 1000
47
- - mixed_precision_training: Native AMP
 
 
 
 
48
 
49
  ### Framework versions
50
 
 
39
  - train_batch_size: 2
40
  - eval_batch_size: 8
41
  - seed: 42
 
 
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: cosine
44
+ - lr_scheduler_warmup_ratio: 0.03
45
+ - training_steps: 60
46
+
47
+ ### Training results
48
+
49
+
50
 
51
  ### Framework versions
52
 
wandb/debug-internal.log CHANGED
@@ -108,3 +108,4 @@ subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after
108
  2024-04-12 07:28:13,812 DEBUG SenderThread:216 [sender.py:send_request():406] send_request: summary_record
109
  2024-04-12 07:28:13,812 INFO SenderThread:216 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
110
  2024-04-12 07:28:14,043 INFO Thread-12 :216 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240412_072736-gyl3aena/files/wandb-summary.json
 
 
108
  2024-04-12 07:28:13,812 DEBUG SenderThread:216 [sender.py:send_request():406] send_request: summary_record
109
  2024-04-12 07:28:13,812 INFO SenderThread:216 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
110
  2024-04-12 07:28:14,043 INFO Thread-12 :216 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240412_072736-gyl3aena/files/wandb-summary.json
111
+ 2024-04-12 07:28:18,814 DEBUG HandlerThread:216 [handler.py:handle_request():146] handle_request: status_report
wandb/run-20240412_072736-gyl3aena/logs/debug-internal.log CHANGED
@@ -108,3 +108,4 @@ subprocess.TimeoutExpired: Command '['conda', 'env', 'export']' timed out after
108
  2024-04-12 07:28:13,812 DEBUG SenderThread:216 [sender.py:send_request():406] send_request: summary_record
109
  2024-04-12 07:28:13,812 INFO SenderThread:216 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
110
  2024-04-12 07:28:14,043 INFO Thread-12 :216 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240412_072736-gyl3aena/files/wandb-summary.json
 
 
108
  2024-04-12 07:28:13,812 DEBUG SenderThread:216 [sender.py:send_request():406] send_request: summary_record
109
  2024-04-12 07:28:13,812 INFO SenderThread:216 [sender.py:_save_file():1390] saving file wandb-summary.json with policy end
110
  2024-04-12 07:28:14,043 INFO Thread-12 :216 [dir_watcher.py:_on_file_modified():288] file/dir modified: /kaggle/working/wandb/run-20240412_072736-gyl3aena/files/wandb-summary.json
111
+ 2024-04-12 07:28:18,814 DEBUG HandlerThread:216 [handler.py:handle_request():146] handle_request: status_report