FireRedTeam
/

FireRedChat-punc

punctuation-restoration

Model card Files Files and versions

FireRedTeam commited on Sep 22, 2025

Commit

d9f70c4

·

verified ·

1 Parent(s): 158770a

Update README.md

Files changed (1) hide show

README.md +45 -3

README.md CHANGED Viewed

@@ -4,9 +4,51 @@ language:
 - zh
 base_model:
 - hfl/chinese-lert-base
 ---
-## FireRedChat-punc
-This is a chinese-lert-base model finetuned for punctuation restoration, release mainly for [FireRedASR](https://github.com/FireRedTeam/FireRedASR) postprocessing.
-Model restores the following punctuations -- [， 。 ？ ！]

 - zh
 base_model:
 - hfl/chinese-lert-base
+tags:
+- punctuation-restoration
 ---
+<div align="center">
+<h1>FireRedChat-punc</h1>
+</div>
+<div align="center">
+  <a href="https://fireredteam.github.io/demos/firered_chat/">Demo</a> •
+  <a href="https://arxiv.org/pdf/2509.06502">FireRedChat Paper</a> •
+  <a href="https://huggingface.co/FireRedTeam">Huggingface</a>
+</div>
+## Descriptions
+FireRedChat-punc is a fine-tuned `hfl/chinese-lert-base` model designed for punctuation restoration, primarily for post-processing in [FireRedASR](https://github.com/FireRedTeam/FireRedASR).
+The model restores the following punctuation marks: [， 。 ？ ！]. It supports both Chinese and English text, enhancing the readability of transcribed text.
+## Roadmap
+- [x] 2025/09
+  - [x] Release the fine-tuned punctuation restoration model.
+## Usage
+RedPost source code [github](https://github.com/FireRedTeam/FireRedChat/tree/main/fireredasr-server/server/redpost)
+Below is an example of how to use the FireRedChat-punc model for punctuation restoration:
+```python
+import os
+from redpost import RedPost, RedPostConfig
+punc_model_dir = os.path.join("FireRedChat-punc")
+post_config = RedPostConfig(
+    use_gpu=True,
+    sentence_max_length=30
+)
+post_model = RedPost.from_pretrained(punc_model_dir, post_config)
+batch_post_results = post_model.process([text], ["text"])
+text = "".join([r["punc_text"] for r in batch_post_results])
+text = re.sub("<unk>|<UNK>|\[unk\]|\[UNK\]", "", text)
+print(text)
+```
+## License
+The model and source code are licensed under the Apache-2.0 license.
+### Acknowledgment
+- Base model: `hfl/chinese-lert-base` (license: apache-2.0)
+- Designed for integration with [FireRedASR](https://github.com/FireRedTeam/FireRedASR).