hamishivi commited on
Commit
2dab518
·
verified ·
1 Parent(s): 10509a2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -153,7 +153,7 @@ See the Falcon 180B model card for an example of this.
153
  DPO:
154
  - **Learning Rate**: 5 × 10⁻⁷ (8B), 2.0e-7 (70B, 405B)
155
  - **Learning Rate Schedule**: Linear
156
- - **Batch Size (effective)**: 32 (8B), 128 (70B), 256(405B)
157
  - **KL Penalty Coefficient**: 5
158
  - **Warm-up Ratio**: 0.1
159
  - **Max Sequence Length**: 2,048
 
153
  DPO:
154
  - **Learning Rate**: 5 × 10⁻⁷ (8B), 2.0e-7 (70B, 405B)
155
  - **Learning Rate Schedule**: Linear
156
+ - **Batch Size (effective)**: 128 (8B), 128 (70B), 256(405B)
157
  - **KL Penalty Coefficient**: 5
158
  - **Warm-up Ratio**: 0.1
159
  - **Max Sequence Length**: 2,048