E3Diff: SAR-to-Optical Image Translation

๐Ÿ† 1st Place - CVPR PBVS2025 Multi-modal Aerial View Image Challenge

Model Description

E3Diff is an efficient end-to-end diffusion model for one-step SAR-to-Optical translation.

Key Features

  • Real-time inference: 0.17s per 256x256 image on A6000
  • High quality: 35% FID improvement over previous SOTA
  • One-step sampling: Unlike traditional diffusion (1000 steps)

Usage

from huggingface_hub import hf_hub_download
import torch

# Download weights
weights = hf_hub_download(repo_id="Dhenenjay/E3Diff-SAR2Optical", filename="I700000_E719_gen.pth")

Citation

@ARTICLE{10767752,
  author={Qin, Jiang and Zou, Bin and Li, Haolin and Zhang, Lamei},
  journal={IEEE Geoscience and Remote Sensing Letters}, 
  title={Efficient End-to-End Diffusion Model for One-step SAR-to-Optical Translation}, 
  year={2024},
  doi={10.1109/LGRS.2024.3506566}
}

Links

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using Dhenenjay/E3Diff-SAR2Optical 1