Text Classification
Transformers
PyTorch
English
llama
text-generation-inference
saumyamalik commited on
Commit
00536ee
·
verified ·
1 Parent(s): 59082c9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -17,7 +17,7 @@ library_name: transformers
17
  Llama-3.1-Tulu-3-8B-RL-RM-RB2 is one of 7 sets of reward models (RMs) released with Reward Bench 2.
18
  We have released a large set of 70 total reward model checkpoints that we used to develop the benchmark and correlate it with downstream PPO / Best-of-N performance.
19
 
20
- [Models](https://huggingface.co/collections/allenai/reward-bench-2-683d2612a4b3e38a3e53bb51) | [Code](https://github.com/allenai/reward-bench) | [Eval. Dataset v2](https://huggingface.co/datasets/allenai/reward-bench-2) | [Results v2](https://huggingface.co/datasets/allenai/reward-bench-2-results) | [Paper](https://github.com/allenai/reward-bench/blob/main/paper-v2.pdf)
21
 
22
 
23
  ## Model Details
 
17
  Llama-3.1-Tulu-3-8B-RL-RM-RB2 is one of 7 sets of reward models (RMs) released with Reward Bench 2.
18
  We have released a large set of 70 total reward model checkpoints that we used to develop the benchmark and correlate it with downstream PPO / Best-of-N performance.
19
 
20
+ [Models](https://huggingface.co/collections/allenai/reward-bench-2-683d2612a4b3e38a3e53bb51) | [Code](https://github.com/allenai/reward-bench) | [Eval. Dataset v2](https://huggingface.co/datasets/allenai/reward-bench-2) | [Results v2](https://huggingface.co/datasets/allenai/reward-bench-2-results) | [Paper](https://arxiv.org/abs/2506.01937)
21
 
22
 
23
  ## Model Details