shreyask commited on
Commit
9b16ef9
·
verified ·
1 Parent(s): 2e7453f

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. index.html +2 -2
index.html CHANGED
@@ -552,7 +552,7 @@
552
  </div>
553
  <div class="stage-detail"><div class="stage-detail-inner">
554
  <p>Uses <strong>causal attention</strong>: each audio frame only attends to itself and past frames — never future ones. This is what makes streaming possible.</p>
555
- <p><strong>Sliding window attention</strong> (window = 750, ~60s of audio) bounds memory regardless of audio length.</p>
556
  <div class="key-point">Traditional encoders (like Whisper's) use bidirectional attention — every frame sees every other frame. That gives better accuracy but requires full audio upfront. Causal attention trades a small amount of accuracy for the ability to stream.</div>
557
  </div></div>
558
  </div>
@@ -689,7 +689,7 @@
689
 
690
  <div class="window-viz">
691
  <div class="window-info">
692
- <span class="wi-label">Encoder window: <span class="wi-value">750 frames</span> (~60s)</span>
693
  <span class="wi-label">Position: <span class="wi-value" id="window-pos">frame 8</span></span>
694
  </div>
695
 
 
552
  </div>
553
  <div class="stage-detail"><div class="stage-detail-inner">
554
  <p>Uses <strong>causal attention</strong>: each audio frame only attends to itself and past frames — never future ones. This is what makes streaming possible.</p>
555
+ <p><strong>Sliding window attention</strong> (window = 750, ~15s of audio) bounds memory regardless of audio length.</p>
556
  <div class="key-point">Traditional encoders (like Whisper's) use bidirectional attention — every frame sees every other frame. That gives better accuracy but requires full audio upfront. Causal attention trades a small amount of accuracy for the ability to stream.</div>
557
  </div></div>
558
  </div>
 
689
 
690
  <div class="window-viz">
691
  <div class="window-info">
692
+ <span class="wi-label">Encoder window: <span class="wi-value">750 frames</span> (~15s)</span>
693
  <span class="wi-label">Position: <span class="wi-value" id="window-pos">frame 8</span></span>
694
  </div>
695