Homie0609
/

MatchTime

@@ -1,14 +1,39 @@
 ---
-license: cc-by-sa-4.0
 datasets:
 - Homie0609/MatchTime
 language:
 - en
 tags:
 - sports
 - soccer
 ---
 ## Requirements
 - Python >= 3.8 (Recommend to use [Anaconda](https://www.anaconda.com/download/#linux) or [Miniconda](https://docs.conda.io/en/latest/miniconda.html))
 - [PyTorch >= 2.0.0](https://pytorch.org/) (If use A100)
@@ -53,6 +78,8 @@ with the format of features is adjusted by
 ```
 python ./features/preprocess.py directory_path_of_feature
 ```
 After preparing the data and features, you can pre-train (or finetune) with the following terminal command (Check hyper-parameters at the bottom of *train.py*):
 ```
 python train.py
@@ -134,4 +161,29 @@ python ./evaluation/scoer_single.py --csv_path ./inference_result/sample.csv
 python ./evaluation/scoer_group.py
 # for gpt score (need OpenAI API Key)
 python ./evaluation/scoer_gpt.py ./inference_result/sample.csv
-```

 ---
 datasets:
 - Homie0609/MatchTime
 language:
 - en
+license: cc-by-sa-4.0
 tags:
 - sports
 - soccer
+pipeline_tag: video-text-to-text
+library_name: transformers
 ---
+# Commentary Generation for Soccer Highlights
+This repository contains the code and model for **Commentary Generation for Soccer Highlights**, as presented in our paper:
+**[Commentary Generation for Soccer Highlights](https://huggingface.co/papers/2508.07543)**
+## Abstract
+Automated soccer commentary generation has evolved from template-based systems to advanced neural architectures, aiming to produce real-time descriptions of sports events. While frameworks like SoccerNet-Caption laid foundational work, their inability to achieve fine-grained alignment between video content and commentary remains a significant challenge. Recent efforts such as MatchTime, with its MatchVoice model, address this issue through coarse and fine-grained alignment techniques, achieving improved temporal synchronization. In this paper, we extend MatchVoice to commentary generation for soccer highlights using the GOAL dataset, which emphasizes short clips over entire games. We conduct extensive experiments to reproduce the original MatchTime results and evaluate our setup, highlighting the impact of different training configurations and hardware limitations. Furthermore, we explore the effect of varying window sizes on zero-shot performance. While MatchVoice exhibits promising generalization capabilities, our findings suggest the need for integrating techniques from broader video-language domains to further enhance performance.
+<div align="center">
+[\u25b6\ufe0fDemo Video (YouTube)](https://www.youtube.com/watch?v=E3RxHR-M6y0) [\u25b6\ufe0fDemo Video (bilibili)](https://www.bilibili.com/video/BV1L4421U76m) \u00b7 [\ud83c\udfe0Project Page](https://haoningwu3639.github.io/MatchTime/) \u00b7 [\ud83d\udcbbCode](https://github.com/Homie0609/MatchTime) \u00b7 [\ud83d\udcddOriginal Paper (MatchTime)](https://arxiv.org/abs/2406.18530/) \u00b7 [\ud83d\udccaDataset](https://drive.google.com/drive/folders/14tb6lV2nlTxn3VygwAPdmtKm7v0Ss8wG) \u00b7 [\ud83d\udce5Checkpoint](https://huggingface.co/Homie0609/MatchVoice)
+</div>
+<div align="center">
+   <img src="https://github.com/Homie0609/MatchTime/raw/main/assets/teaser.png">
+</div>
+<div align="center">
+   <img src="https://github.com/Homie0609/MatchTime/raw/main/assets/commentary.png">
+</div>
 ## Requirements
 - Python >= 3.8 (Recommend to use [Anaconda](https://www.anaconda.com/download/#linux) or [Miniconda](https://docs.conda.io/en/latest/miniconda.html))
 - [PyTorch >= 2.0.0](https://pytorch.org/) (If use A100)
 ```
 python ./features/preprocess.py directory_path_of_feature
 ```
+Above example gives the format of Baidu feature, in our experiments we also used ResNET_PCA_512, C3D_PCA_512 from official website. If you want to use [CLIP](https://github.com/openai/CLIP)(2 FPS) or [InternVideo](https://github.com/OpenGVLab/InternVideo/tree/main/InternVideo1)(1FPS) feature. You can follow their official website to extract feature or contact us for features.
 After preparing the data and features, you can pre-train (or finetune) with the following terminal command (Check hyper-parameters at the bottom of *train.py*):
 ```
 python train.py
 python ./evaluation/scoer_group.py
 # for gpt score (need OpenAI API Key)
 python ./evaluation/scoer_gpt.py ./inference_result/sample.csv
+```
+## Citation
+If you use this code for your research or project, please cite:
+```bibtex
+@article{rao2024matchtimeautomaticsoccergame,
+         title={MatchTime: Towards Automatic Soccer Game Commentary Generation},
+         author={Jiayuan Rao and Haoning Wu and Chang Liu and Yanfeng Wang and Weidi Xie},
+         year={2024},
+         journal={arXiv preprint arXiv:2406.18530},
+      }
+@article{rao2024commentary,
+  title={Commentary Generation for Soccer Highlights},
+  author={Rao, Jiayuan and Wu, Haoning and Liu, Chang and Wang, Yanfeng and Xie, Weidi},
+  journal={arXiv preprint arXiv:2508.07543},
+  year={2024},
+}
+```
+## Acknowledgements
+Many thanks to the code bases from [Video-LLaMA](https://github.com/DAMO-NLP-SG/Video-LLaMA) and source data from [SoccerNet-Caption](https://arxiv.org/abs/2304.04565).
+## Contact
+If you have any questions, please feel free to contact jy_rao@sjtu.edu.cn or haoningwu3639@gmail.com.