BadToBest
/

EchoMimicV2

Model card Files Files and versions

xet

Community

lymhust commited on Jan 6

Commit

8648078

verified ·

1 Parent(s): e96d2f2

Update README.md

Browse files

Files changed (1) hide show

README.md +26 -6

README.md CHANGED Viewed

@@ -31,7 +31,9 @@ Terminal Technology Department, Alipay, Ant Group.
 * EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation. [GitHub](https://github.com/antgroup/echomimic_v2)
 ## &#x1F4E3; Updates
-* [2024.11.27] 🔥 Thanks [AiMotionStudio](https://www.youtube.com/@AiMotionStudio) for the [installation tutorial](https://www.youtube.com/watch?v=2ab6U1-nVTQ).
 * [2024.11.22] 🔥 [GradioUI](https://github.com/antgroup/echomimic_v2/blob/main/app.py) is now available. Thanks @gluttony-10 for the contribution.
 * [2024.11.22] 🔥 [ComfyUI](https://github.com/smthemex/ComfyUI_EchoMimic) is now available. Thanks @smthemex for the contribution.
 * [2024.11.21] 🔥 We release the EMTD dataset list and processing scripts.
@@ -132,14 +134,26 @@ Terminal Technology Department, Alipay, Ant Group.
 </tr>
 </table>
-## ⚒️ Installation
 ### Download the Codes
 ```bash
   git clone https://github.com/antgroup/echomimic_v2
   cd echomimic_v2
 ```
 ### Python Environment Setup
 - Tested System Environment: Centos 7.2/Ubuntu 22.04, Cuda >= 11.7
@@ -206,6 +220,11 @@ Run the python inference script:
 python infer.py --config='./configs/prompts/infer.yaml'
 ```
 ### EMTD Dataset
 Download dataset:
 ```bash
@@ -219,6 +238,7 @@ Process dataset:
 ```bash
 python ./EMTD_dataset/preprocess.py
 ```
 ## 📝 Release Plans
@@ -228,7 +248,8 @@ python ./EMTD_dataset/preprocess.py
 |    ✅    | Pretrained models trained on English and Mandarin Chinese on HuggingFace | 21st Nov, 2024 |
 |    ✅    | Pretrained models trained on English and Mandarin Chinese on ModelScope   | 21st Nov, 2024 |
 |    ✅    | EMTD dataset list and processing scripts                | 21st Nov, 2024 |
-|    🚀    | Accelerated models to be released                                        | TBD |
 |    🚀    | Online Demo on ModelScope to be released            | TBD |
 |    🚀    | Online Demo on HuggingFace to be released         | TBD |
@@ -248,13 +269,12 @@ If we missed any open-source projects or related articles, we would like to comp
 If you find our work useful for your research, please consider citing the paper :
 ```
-@misc{meng2024echomimic,
   title={EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation},
   author={Rang Meng, Xingyu Zhang, Yuming Li, Chenguang Ma},
   year={2024},
   eprint={2411.10061},
-  archivePrefix={arXiv},
-  primaryClass={cs.CV}
 }
 ```

 * EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation. [GitHub](https://github.com/antgroup/echomimic_v2)
 ## &#x1F4E3; Updates
+* [2025.01.03] 🚀🔥 **One Minute is All You Need to Generate Video**. [Accelerated EchoMimicV2](https://github.com/antgroup/echomimic_v2/blob/main/infer_acc.py) are released. The inference speed can be improved by 9x (from ~7mins/120frames to ~50s/120frames on A100 GPU).
+* [2024.12.16] 🔥 [RefImg-Pose Alignment Demo](https://github.com/antgroup/echomimic_v2/blob/main/demo.ipynb) is now available, which involves aligning reference image, extracting pose from driving video, and generating video.
+* [2024.11.27] 🔥 [Installation tutorial](https://www.youtube.com/watch?v=2ab6U1-nVTQ) is now available. Thanks [AiMotionStudio](https://www.youtube.com/@AiMotionStudio) for the contribution.
 * [2024.11.22] 🔥 [GradioUI](https://github.com/antgroup/echomimic_v2/blob/main/app.py) is now available. Thanks @gluttony-10 for the contribution.
 * [2024.11.22] 🔥 [ComfyUI](https://github.com/smthemex/ComfyUI_EchoMimic) is now available. Thanks @smthemex for the contribution.
 * [2024.11.21] 🔥 We release the EMTD dataset list and processing scripts.
 </tr>
 </table>
+## ⚒️ Automatic Installation
 ### Download the Codes
 ```bash
   git clone https://github.com/antgroup/echomimic_v2
   cd echomimic_v2
 ```
+### Automatic Setup
+- CUDA >= 11.7, Python == 3.10
+```bash
+   sh linux_setup.sh
+```
+## ⚒️ Manual Installation
+### Download the Codes
+```bash
+  git clone https://github.com/antgroup/echomimic_v2
+  cd echomimic_v2
+```
 ### Python Environment Setup
 - Tested System Environment: Centos 7.2/Ubuntu 22.04, Cuda >= 11.7
 python infer.py --config='./configs/prompts/infer.yaml'
 ```
+Run the python inference script for accelerated version. Make sure to check out the configuration for accelerated inference:
+```bash
+python infer_acc.py --config='./configs/prompts/infer_acc.yaml'
+```
 ### EMTD Dataset
 Download dataset:
 ```bash
 ```bash
 python ./EMTD_dataset/preprocess.py
 ```
+Make sure to check out the [discussions](https://github.com/antgroup/echomimic_v2/discussions) to learn how to start the inference.
 ## 📝 Release Plans
 |    ✅    | Pretrained models trained on English and Mandarin Chinese on HuggingFace | 21st Nov, 2024 |
 |    ✅    | Pretrained models trained on English and Mandarin Chinese on ModelScope   | 21st Nov, 2024 |
 |    ✅    | EMTD dataset list and processing scripts                | 21st Nov, 2024 |
+|    ✅    | Jupyter demo with pose and reference image alignmnet                | 16st Dec, 2024 |
+|    ✅    | Accelerated models                                        | 3st Jan, 2025 |
 |    🚀    | Online Demo on ModelScope to be released            | TBD |
 |    🚀    | Online Demo on HuggingFace to be released         | TBD |
 If you find our work useful for your research, please consider citing the paper :
 ```
+@misc{meng2024echomimicv2,
   title={EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation},
   author={Rang Meng, Xingyu Zhang, Yuming Li, Chenguang Ma},
   year={2024},
   eprint={2411.10061},
+  archivePrefix={arXiv}
 }
 ```