vwxyzjn commited on
Commit
b7bfe11
·
verified ·
1 Parent(s): e146e04

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -151,7 +151,7 @@ See the Falcon 180B model card for an example of this.
151
  ## Hyperparamters
152
 
153
  DPO:
154
- - **Learning Rate**: 5 × 10⁻⁷ (8B), 2.0e-7 (70B), 2.0e-7 (405B)
155
  - **Learning Rate Schedule**: Linear
156
  - **Batch Size (effective)**: 32 (8B), 128 (70B), 256(405B)
157
  - **KL Penalty Coefficient**: 5
 
151
  ## Hyperparamters
152
 
153
  DPO:
154
+ - **Learning Rate**: 5 × 10⁻⁷ (8B), 2.0e-7 (70B, 405B)
155
  - **Learning Rate Schedule**: Linear
156
  - **Batch Size (effective)**: 32 (8B), 128 (70B), 256(405B)
157
  - **KL Penalty Coefficient**: 5