kangdawei commited on
Commit
4f585b0
·
verified ·
1 Parent(s): 64465dc

Training in progress, step 500

Browse files
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:813b0dd03877615b6f2f84ed5b82b88b3d24d51aafcff4d7cf65a907d6325661
3
  size 3554214752
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1d6fbfdb985f0963b7756a2b147ba2ea0e03076bd8c5c8bd8ec9b3626d7ff79e
3
  size 3554214752
reward_data/all_rewards.csv CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a96469d6a1046dd951cc1167049ebcee436de885d302366eb201b0ed86745f04
3
- size 250628068
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f3cfb130e217256af8ede1e221b11bc1fda6425330daf8871266a8f2863d6d38
3
+ size 301803451
reward_plots/advantage_plot_step_400.png ADDED
reward_plots/advantage_plot_step_410.png ADDED
reward_plots/advantage_plot_step_420.png ADDED
reward_plots/advantage_plot_step_430.png ADDED
reward_plots/advantage_plot_step_440.png ADDED
reward_plots/advantage_plot_step_450.png ADDED
reward_plots/advantage_plot_step_460.png ADDED
reward_plots/advantage_plot_step_470.png ADDED
reward_plots/advantage_plot_step_480.png ADDED
reward_plots/advantage_plot_step_490.png ADDED
reward_plots/reward_comparison_step_400.png ADDED
reward_plots/reward_comparison_step_410.png ADDED
reward_plots/reward_comparison_step_420.png ADDED
reward_plots/reward_comparison_step_430.png ADDED
reward_plots/reward_comparison_step_440.png ADDED
reward_plots/reward_comparison_step_450.png ADDED
reward_plots/reward_comparison_step_460.png ADDED
reward_plots/reward_comparison_step_470.png ADDED
reward_plots/reward_comparison_step_480.png ADDED
reward_plots/reward_comparison_step_490.png ADDED