Diffusers
Safetensors

SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation

Zhenyuan Qin* · Xincheng Shuai* · Henghui Ding

Fudan University

HuggingFace

⚙️ Quick Start

1. Installation

  1. Install Python environment (recommended to use uv)

    uv sync
    

    Or alternatively:

    pip install -r requirements.txt
    
  2. Install Blender environment

    cd render
    python install.py
    

    If the automatic installation script fails, you can install manually:

    • First download Blender and extract it to the ./render directory
    • Then locate the Blender Python path and install the Python dependencies for Blender, for example:
    cd render
    blender-4.2.8-linux-x64/4.2/python/bin/python3.11 -m pip install -r blender_requirements.txt
    

2. Download Checkpoints

  1. Download the SceneDesigner weights to the checkpoints directory
  2. Download the Stable Diffusion 3.5 base model weights to the checkpoints directory

3. Run Demo

Launch the Gradio app:

python app.py \
  --blender_path render/blender/blender \
  --device cuda:0 \
  --port 7861 
  • Adjust the 9D pose of the cube in the Cube Controls panel
  • Enter text prompts in the Generation Config panel and click the Generate Images button to create images

✒️ Citation

If you find our work useful for your research and applications, please kindly cite using this BibTeX:

@inproceedings{SceneDesigner,
        title={SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation},
        author={Qin, Zhenyuan and Shuai, Xincheng and Ding, Henghui},
        booktitle={NeurIPS},
        year={2025}
      }
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including FudanCVL/SceneDesigner

Paper for FudanCVL/SceneDesigner