Add license, pipeline tag, and library name metadata
Browse filesThis PR updates the model card by adding essential metadata:
- The `license: apache-2.0` for proper attribution and legal clarity, verified via the linked PyPI project page.
- The `pipeline_tag: audio-to-audio` to correctly categorize the model and improve discoverability on the Hugging Face Hub.
- The `library_name: dualcodec`, as evidenced by the programmatic usage shown in the README (e.g., `import dualcodec`), enabling automated code snippets on the Hub.
README.md
CHANGED
|
@@ -1,4 +1,8 @@
|
|
| 1 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 2 |
|
| 3 |
# DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation
|
| 4 |
|
|
@@ -194,7 +198,10 @@ data.segment_speech.segment_length=24000
|
|
| 194 |
|
| 195 |
```
|
| 196 |
|
|
|
|
| 197 |
|
|
|
|
|
|
|
| 198 |
|
| 199 |
## Finetuning DualCodec
|
| 200 |
1. Install other necessary components for training:
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
pipeline_tag: audio-to-audio
|
| 4 |
+
library_name: dualcodec
|
| 5 |
+
---
|
| 6 |
|
| 7 |
# DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation
|
| 8 |
|
|
|
|
| 198 |
|
| 199 |
```
|
| 200 |
|
| 201 |
+
Below are example training loss curves for a DualCodec 25hz model:
|
| 202 |
|
| 203 |
+

|
| 204 |
+

|
| 205 |
|
| 206 |
## Finetuning DualCodec
|
| 207 |
1. Install other necessary components for training:
|