File size: 2,024 Bytes
2dc363b
 
 
 
 
 
 
 
 
 
 
 
4c709e2
2dc363b
4c709e2
 
 
 
 
2dc363b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4c709e2
2dc363b
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
---
license: apache-2.0
library_name: torch
base_model:
- microsoft/wavlm-large
pipeline_tag: audio-to-audio
---

# ⚡ FocalCodec

A low-bitrate single-codebook 16 / 24 kHz speech codec based on [focal modulation](https://arxiv.org/abs/2203.11926).

This repository contains the **50 Hz causal checkpoint with a codebook size of 65536** trained on **Libri-Light**, as described in the preprints.

- 📜 **Preprints**:

    - [FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks](https://arxiv.org/abs/2502.04465)

    - [FocalCodec-Stream: Streaming Low-Bitrate Speech Coding via Causal Distillation](https://arxiv.org/abs/2509.16195)

- 🌐 **Project Page**: https://lucadellalib.github.io/focalcodec-web/

- 💾 **GitHub**: https://github.com/lucadellalib/focalcodec

<img src="focalcodec-stream.png" width="700">

---------------------------------------------------------------------------------------------------------

## ▶️ Quickstart

See the readme at: https://github.com/lucadellalib/focalcodec

---------------------------------------------------------------------------------------------------------

## @ Citing

```
@article{dellalibera2025focalcodec,
    title   = {{FocalCodec}: Low-Bitrate Speech Coding via Focal Modulation Networks},
    author  = {Luca {Della Libera} and Francesco Paissan and Cem Subakan and Mirco Ravanelli},
    journal = {arXiv preprint arXiv:2502.04465},
    year    = {2025},
}

@article{dellalibera2025focalcodecstream,
    title   = {{FocalCodec-Stream}: Streaming Low-Bitrate Speech Coding via Causal Distillation},
    author  = {Luca {Della Libera} and Cem Subakan and Mirco Ravanelli},
    journal = {arXiv preprint arXiv:2509.16195},
    year    = {2025},
}
```

---------------------------------------------------------------------------------------------------------

## 📧 Contact

[luca.dellalib@gmail.com](mailto:luca.dellalib@gmail.com)

---------------------------------------------------------------------------------------------------------