Update README.md
Browse files
README.md
CHANGED
|
@@ -30,6 +30,10 @@ pipeline_tag: text-generation
|
|
| 30 |
- Space Demo: [https://huggingface.co/spaces/llm-blender/LLM-Blender](https://huggingface.co/spaces/llm-blender/LLM-Blender)
|
| 31 |
|
| 32 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
## Introduction
|
| 34 |
|
| 35 |
Pairwise Reward Model (PairRM) takes an instruction and a **pair** of output candidates as the input,
|
|
|
|
| 30 |
- Space Demo: [https://huggingface.co/spaces/llm-blender/LLM-Blender](https://huggingface.co/spaces/llm-blender/LLM-Blender)
|
| 31 |
|
| 32 |
|
| 33 |
+
## News
|
| 34 |
+
|
| 35 |
+
- Check out our results on AlpacaEval leaderboard: [Twitter](https://x.com/billyuchenlin/status/1732198787354067380?s=20) [Leaderboard](https://tatsu-lab.github.io/alpaca_eval/)
|
| 36 |
+
|
| 37 |
## Introduction
|
| 38 |
|
| 39 |
Pairwise Reward Model (PairRM) takes an instruction and a **pair** of output candidates as the input,
|