Spaces:
Sleeping
Sleeping
Update app.py
Browse files
app.py
CHANGED
|
@@ -102,7 +102,23 @@ This method introduces sampling control along three key dimensions:
|
|
| 102 |
Among these, the novel **reasoning depth `H`** plays a critical role: by sampling outputs at different depths of partially completed reasoning chains, the model creates multiple sets of "fragmented thoughts + solutions," which are then jointly evaluated to select the most trustworthy outcome.
|
| 103 |
""")
|
| 104 |
gr.Image("figs/frac-frame.png", label="Framework", show_label=False, elem_id="my-img")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 105 |
gr.Image("figs/single.png", label="Framework", show_label=False, elem_id="my-img")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 106 |
gr.Image("figs/combine.png", label="Framework", show_label=False, elem_id="my-img")
|
| 107 |
gr.Markdown(
|
| 108 |
"""
|
|
|
|
| 102 |
Among these, the novel **reasoning depth `H`** plays a critical role: by sampling outputs at different depths of partially completed reasoning chains, the model creates multiple sets of "fragmented thoughts + solutions," which are then jointly evaluated to select the most trustworthy outcome.
|
| 103 |
""")
|
| 104 |
gr.Image("figs/frac-frame.png", label="Framework", show_label=False, elem_id="my-img")
|
| 105 |
+
gr.Markdown(
|
| 106 |
+
"""
|
| 107 |
+
### 🔍 Scaling Analysis of *n*, *m*, and *H* in DeepSeek-R1 Models
|
| 108 |
+
|
| 109 |
+
A detailed test-time scaling analysis on the DeepSeek-R1 series reveals the individual impact of the three sampling dimensions: `n` (number of reasoning paths), `m` (number of answers per path), and `H` (depth-wise reasoning samples).
|
| 110 |
+
|
| 111 |
+
Across multiple reasoning benchmarks, the results show that increasing **`H` — sampling across reasoning depths — yields the highest cost-effectiveness**. That is, sampling more intermediate answers along the depth of a single reasoning path leads to **greater accuracy improvements with fewer additional tokens**, compared to simply increasing the number of paths (`n`) or answers (`m`).
|
| 112 |
+
""")
|
| 113 |
gr.Image("figs/single.png", label="Framework", show_label=False, elem_id="my-img")
|
| 114 |
+
gr.Markdown(
|
| 115 |
+
"""
|
| 116 |
+
### 🔄 Joint Sampling of *n*, *m*, and *H* for Enhanced Accuracy
|
| 117 |
+
|
| 118 |
+
In practical scenarios, the sampling dimensions `n`, `m`, and `H` can be **jointly optimized** rather than tuned in isolation. By **dynamically allocating the sampling budget across these dimensions**, the model can significantly enhance its reasoning accuracy.
|
| 119 |
+
|
| 120 |
+
This joint sampling strategy leverages the complementary strengths of each dimension—diversity (`n`), redundancy (`m`), and depth-awareness (`H`)—to achieve robust performance under a fixed token budget.
|
| 121 |
+
""")
|
| 122 |
gr.Image("figs/combine.png", label="Framework", show_label=False, elem_id="my-img")
|
| 123 |
gr.Markdown(
|
| 124 |
"""
|