kangdawei commited on
Commit
f9607e1
·
verified ·
1 Parent(s): 71968b0

Training in progress, step 450

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ca9ac9b7cf788e2ca85dd3771f807df3fbc70bfb097391951c60f8172f5a8a78
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0d94c38d1037e5f95ae18ba29df7dcf4790ccf610274dc856f289adcb124f2b3
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e89971657aaba95c248c0f26b1e40b995ef0c95ebe2e7cbe8583a92efecf7372
3
- size 351535981
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:258ed499daa896f9252b0187eca18ef9dd254a54c29b374013d999d42c2847bd
3
+ size 373789979
reward_plots/advantage_plot_step_400.png ADDED
reward_plots/advantage_plot_step_410.png ADDED
reward_plots/advantage_plot_step_420.png ADDED
reward_plots/advantage_plot_step_430.png ADDED
reward_plots/advantage_plot_step_440.png ADDED
reward_plots/reward_comparison_step_400.png ADDED
reward_plots/reward_comparison_step_410.png ADDED
reward_plots/reward_comparison_step_420.png ADDED
reward_plots/reward_comparison_step_430.png ADDED
reward_plots/reward_comparison_step_440.png ADDED
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7d2df5c310919447c5191424f28635aa0563964fd2324fa4296051eb0058c372
3
  size 8504
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eeeb1d3e2e07bbcb3f52d1585f334319bee057838a5a9b3c97a3734919cf1404
3
  size 8504