AIBA: Attention-based Instrument Band Alignment for Text-to-Audio Diffusion Paper • 2509.20891 • Published Sep 25, 2025
Jamendo-QA: A Large-Scale Music Question Answering Dataset Paper • 2509.15662 • Published Sep 19, 2025 • 1
CAT: Contrastive Adapter Training for Personalized Image Generation Paper • 2404.07554 • Published Apr 11, 2024 • 2
Improving Text Generation on Images with Synthetic Captions Paper • 2406.00505 • Published Jun 1, 2024 • 1