--- license: apache-2.0 language: - zh base_model: - hfl/chinese-lert-base tags: - punctuation-restoration ---

FireRedChat-punc

DemoFireRedChat PaperHuggingface
## Descriptions FireRedChat-punc is a fine-tuned `hfl/chinese-lert-base` model designed for punctuation restoration, primarily for post-processing in [FireRedASR](https://github.com/FireRedTeam/FireRedASR). The model restores the following punctuation marks: [, 。 ? !]. It supports both Chinese and English text, enhancing the readability of transcribed text. ## Roadmap - [x] 2025/09 - [x] Release the fine-tuned punctuation restoration model. ## Usage RedPost source code [github](https://github.com/FireRedTeam/FireRedChat/tree/main/fireredasr-server/server/redpost) Below is an example of how to use the FireRedChat-punc model for punctuation restoration: ```bash git clone https://huggingface.co/hfl/chinese-lert-base FireRedChat-punc/chinese-lert-base ``` ```python import os from redpost import RedPost, RedPostConfig punc_model_dir = "./FireRedChat-punc" post_config = RedPostConfig( use_gpu=True, sentence_max_length=30 ) post_model = RedPost.from_pretrained(punc_model_dir, post_config) batch_post_results = post_model.process([text], ["text"]) text = "".join([r["punc_text"] for r in batch_post_results]) text = re.sub("||\[unk\]|\[UNK\]", "", text) print(text) ``` ## Use with FireRedASR This punctuation restoration model can be used together with FireRedASR, refer to [fireredasr-server](https://github.com/FireRedTeam/FireRedChat/blob/main/fireredasr-server/README.md) for setup instructions. https://github.com/FireRedTeam/FireRedChat/tree/main/fireredasr-server ## License The model and source code are licensed under the Apache-2.0 license. ### Acknowledgment - Base model: `hfl/chinese-lert-base` (license: apache-2.0) - Designed for integration with [FireRedASR](https://github.com/FireRedTeam/FireRedASR).