Seeing Voices: Generating A-Roll Video from Audio with Mirage Paper β’ 2506.08279 β’ Published Jun 9 β’ 27
EdgeFusion: On-Device Text-to-Image Generation Paper β’ 2404.11925 β’ Published Apr 18, 2024 β’ 23
LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights Paper β’ 2404.11936 β’ Published Apr 18, 2024 β’ 1
LatentSwap: An Efficient Latent Code Mapping Framework for Face Swapping Paper β’ 2402.18351 β’ Published Feb 28, 2024 β’ 2
Shortened LLaMA: A Simple Depth Pruning for Large Language Models Paper β’ 2402.02834 β’ Published Feb 5, 2024 β’ 17
On Architectural Compression of Text-to-Image Diffusion Models Paper β’ 2305.15798 β’ Published May 25, 2023 β’ 4
A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation Paper β’ 2304.00471 β’ Published Apr 2, 2023 β’ 1