kangdawei commited on
Commit
87bcc7c
·
verified ·
1 Parent(s): 9353bab

Training in progress, step 150

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:773c0a566c1e6add5b7a6899194cd4f171ee0e08ec0295b3e23e2e5d59426cb0
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:322077610c9bdb9d414492ab3e5c04da436e67c5a0c925c9296c6b44b093c759
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b40065eb6d330d2ae6805956a0d90c3ef0e3239ed74a0e3bbb91205984fc361e
3
- size 86002529
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13b12e988c7f82da5a3582da570201393d2f5b9a7b51816f8070b90efbf012d4
3
+ size 128894049
reward_plots/advantage_plot_step_100.png ADDED
reward_plots/advantage_plot_step_110.png ADDED
reward_plots/advantage_plot_step_120.png ADDED
reward_plots/advantage_plot_step_130.png ADDED
reward_plots/advantage_plot_step_140.png ADDED
reward_plots/reward_comparison_step_100.png ADDED
reward_plots/reward_comparison_step_110.png ADDED
reward_plots/reward_comparison_step_120.png ADDED
reward_plots/reward_comparison_step_130.png ADDED
reward_plots/reward_comparison_step_140.png ADDED