worstcoder
/

Wan

Model card Files Files and versions

Wan / README.md

worstcoder's picture

Create README.md

4498433 verified 5 days ago

|

history blame contribute delete

1.07 kB

	---
	license: apache-2.0
	base_model:
	- Wan-AI/Wan2.1-T2V-1.3B
	- Wan-AI/Wan2.1-T2V-14B
	pipeline_tag: text-to-video
	---
	# rCM: Score-Regularized Continuous-Time Consistency Model
	[Paper](https://arxiv.org/abs/2510.08431) \| [Website](https://research.nvidia.com/labs/dir/rcm) \| [Code](https://github.com/NVlabs/rcm)

	This repo holds converted Wan official checkpoints in rCM/TurboDiffusion style.

	Specifically, rCM equivalently replaces the Conv3d layer in the original Wan with a Linear layer for patch embedding, facilitating further optimization. The layer weight is directly reshaped without value change, e.g., from shape [5120, 16, 1, 2, 2] (Conv3d) to shape [5120, 64] (Linear).

	## Citation

	```
	@article{zheng2025rcm,
	title={Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency},
	author={Zheng, Kaiwen and Wang, Yuji and Ma, Qianli and Chen, Huayu and Zhang, Jintao and Balaji, Yogesh and Chen, Jianfei and Liu, Ming-Yu and Zhu, Jun and Zhang, Qinsheng},
	journal={arXiv preprint arXiv:2510.08431},
	year={2025}
	}
	```