kangdawei commited on
Commit
b835560
·
verified ·
1 Parent(s): 87bd1fb

Training in progress, step 400

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:60013fe7d1d4781636a042f0dee13e46431f7387b2b66c33f26f6504774fff6d
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dfb32defe67a1bf746506764f27ca89af65a2d080badcf26c9d62676bedfe196
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2ab0bfd655b1e510d461da48edde13b76c1394e9d8a2ac86d03cbe41ba226b31
3
- size 223268759
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6f1a0f1a2fc53284027595e903dc5aaaa60248643bda0387bc062a391aa8763
3
+ size 242104661
reward_plots/advantage_plot_step_350.png ADDED
reward_plots/advantage_plot_step_360.png ADDED
reward_plots/advantage_plot_step_370.png ADDED
reward_plots/advantage_plot_step_380.png ADDED
reward_plots/advantage_plot_step_390.png ADDED
reward_plots/reward_comparison_step_350.png ADDED
reward_plots/reward_comparison_step_360.png ADDED
reward_plots/reward_comparison_step_370.png ADDED
reward_plots/reward_comparison_step_380.png ADDED
reward_plots/reward_comparison_step_390.png ADDED