Upload folder using huggingface_hub
Browse files- index.html +2 -2
index.html
CHANGED
|
@@ -552,7 +552,7 @@
|
|
| 552 |
</div>
|
| 553 |
<div class="stage-detail"><div class="stage-detail-inner">
|
| 554 |
<p>Uses <strong>causal attention</strong>: each audio frame only attends to itself and past frames — never future ones. This is what makes streaming possible.</p>
|
| 555 |
-
<p><strong>Sliding window attention</strong> (window = 750, ~
|
| 556 |
<div class="key-point">Traditional encoders (like Whisper's) use bidirectional attention — every frame sees every other frame. That gives better accuracy but requires full audio upfront. Causal attention trades a small amount of accuracy for the ability to stream.</div>
|
| 557 |
</div></div>
|
| 558 |
</div>
|
|
@@ -689,7 +689,7 @@
|
|
| 689 |
|
| 690 |
<div class="window-viz">
|
| 691 |
<div class="window-info">
|
| 692 |
-
<span class="wi-label">Encoder window: <span class="wi-value">750 frames</span> (~
|
| 693 |
<span class="wi-label">Position: <span class="wi-value" id="window-pos">frame 8</span></span>
|
| 694 |
</div>
|
| 695 |
|
|
|
|
| 552 |
</div>
|
| 553 |
<div class="stage-detail"><div class="stage-detail-inner">
|
| 554 |
<p>Uses <strong>causal attention</strong>: each audio frame only attends to itself and past frames — never future ones. This is what makes streaming possible.</p>
|
| 555 |
+
<p><strong>Sliding window attention</strong> (window = 750, ~15s of audio) bounds memory regardless of audio length.</p>
|
| 556 |
<div class="key-point">Traditional encoders (like Whisper's) use bidirectional attention — every frame sees every other frame. That gives better accuracy but requires full audio upfront. Causal attention trades a small amount of accuracy for the ability to stream.</div>
|
| 557 |
</div></div>
|
| 558 |
</div>
|
|
|
|
| 689 |
|
| 690 |
<div class="window-viz">
|
| 691 |
<div class="window-info">
|
| 692 |
+
<span class="wi-label">Encoder window: <span class="wi-value">750 frames</span> (~15s)</span>
|
| 693 |
<span class="wi-label">Position: <span class="wi-value" id="window-pos">frame 8</span></span>
|
| 694 |
</div>
|
| 695 |
|