SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation

Zhenyuan Qin^* · Xincheng Shuai^* · Henghui Ding ^†

Fudan University

⚙️ Quick Start

1. Installation

Install Python environment (recommended to use uv)
```
uv sync
```
Or alternatively:
```
pip install -r requirements.txt
```
Install Blender environment
```
cd render
python install.py
```
If the automatic installation script fails, you can install manually:
- First download Blender and extract it to the ./render directory
- Then locate the Blender Python path and install the Python dependencies for Blender, for example:
```
cd render
blender-4.2.8-linux-x64/4.2/python/bin/python3.11 -m pip install -r blender_requirements.txt
```

2. Download Checkpoints

Download the SceneDesigner weights to the checkpoints directory
Download the Stable Diffusion 3.5 base model weights to the checkpoints directory

3. Run Demo

Launch the Gradio app:

python app.py \
  --blender_path render/blender/blender \
  --device cuda:0 \
  --port 7861

Adjust the 9D pose of the cube in the Cube Controls panel
Enter text prompts in the Generation Config panel and click the Generate Images button to create images

✒️ Citation

If you find our work useful for your research and applications, please kindly cite using this BibTeX:

@inproceedings{SceneDesigner,
        title={SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation},
        author={Qin, Zhenyuan and Shuai, Xincheng and Ding, Henghui},
        booktitle={NeurIPS},
        year={2025}
      }

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including FudanCVL/SceneDesigner

SceneDesigner

Collection

2 items • Updated Dec 18, 2025 • 1

Paper for FudanCVL/SceneDesigner

SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation

Paper • 2511.16666 • Published Nov 20, 2025