kangdawei commited on
Commit
38bcf97
·
verified ·
1 Parent(s): b835560

Training in progress, step 500

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dfb32defe67a1bf746506764f27ca89af65a2d080badcf26c9d62676bedfe196
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f10d42fed7fcb69070a0212fcd360eaa357731fb7c627f319b72dc47b7626fe4
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f6f1a0f1a2fc53284027595e903dc5aaaa60248643bda0387bc062a391aa8763
3
- size 242104661
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9783635808d02f6cd8aa625971711b1f2a053bbecfd8b26e1357800670478f7d
3
+ size 284384172
reward_plots/advantage_plot_step_400.png ADDED
reward_plots/advantage_plot_step_410.png ADDED
reward_plots/advantage_plot_step_420.png ADDED
reward_plots/advantage_plot_step_430.png ADDED
reward_plots/advantage_plot_step_440.png ADDED
reward_plots/advantage_plot_step_450.png ADDED
reward_plots/advantage_plot_step_460.png ADDED
reward_plots/advantage_plot_step_470.png ADDED
reward_plots/advantage_plot_step_480.png ADDED
reward_plots/advantage_plot_step_490.png ADDED
reward_plots/reward_comparison_step_400.png ADDED
reward_plots/reward_comparison_step_410.png ADDED
reward_plots/reward_comparison_step_420.png ADDED
reward_plots/reward_comparison_step_430.png ADDED
reward_plots/reward_comparison_step_440.png ADDED
reward_plots/reward_comparison_step_450.png ADDED
reward_plots/reward_comparison_step_460.png ADDED
reward_plots/reward_comparison_step_470.png ADDED
reward_plots/reward_comparison_step_480.png ADDED
reward_plots/reward_comparison_step_490.png ADDED