Overview of our Data Augmentation and Knowledge Distillation (DAKD) pipeline. The diffusion-based SAR-JointNet learns to generate SAR images and their soft labels for data augmentation and knowledge distillation to our student SAROSS-Net.
Oil spills in the ocean pose severe environmental risks, making early detection essential. Synthetic aperture radar (SAR) based oil spill segmentation offers robust monitoring under various conditions but faces challenges due to the limited labeled data and inherent speckle noise in SAR imagery. To address these issues, we propose (i) a diffusion-based Data Augmentation and Knowledge Distillation (DAKD) pipeline and (ii) a novel SAR oil spill segmentation network, called SAROSS-Net.
In our DAKD pipeline, we present a diffusion-based SAR-JointNet that learns to generate realistic SAR images and their labels for segmentation, by effectively modeling joint distribution with balancing two modalities. The DAKD pipeline augments the training dataset and distills knowledge from SAR-JointNet by utilizing generated soft labels (pixel-wise probability maps) to supervise our SAROSS-Net. The SAROSS-Net is designed to selectively transfer high-frequency features from noisy SAR images, by employing novel Context-Aware Feature Transfer blocks along skip connections.
We demonstrate our SAR-JointNet can generate realistic SAR images and well-aligned segmentation labels, providing the augmented data to train SAROSS-Net with enhanced generalizability. Our SAROSS-Net trained with the DAKD pipeline significantly outperforms existing SAR oil spill segmentation methods with large margins.
Network Architecture: The left figure shows the overall architecture of our SAR-JointNet, which is designed for joint modality fusion and SAR image generation. The right figure illustrates the SAR Oil Spill Segmentation Network (SAROSS-Net), optimized for precise oil spill segmentation tasks using SAR images.
Quantiity comparison of generated SAR images between our SAR-JointNet and other diffusion-based generative models such as DDPM and Joint-Net.
TOP: Oil spill segmentation performance comparison on OSD
dataset and SOS dataset (ALOS & Sentinel-1A satellites) with other segmentation model such as CBD-Net, DeepLabV3+, and Segformer.
BOTTOM: Oil spill segmentation performance comparison on OSD
dataset showing IoU (%) for 5 classes and mIoU (%) with other segmentation model such as CBD-Net, DeepLabV3+, and Segformer.
Joint generation results: Qualitative comparison of SAR images and corresponding segmentation masks generated by: (a) DDPM, (b) JointNet, and (c) SAR-JointNet (ours) to the samples from (d) the Original OSD dataset
Qualitative comparison of segmentation results on the OSD dataset. Each row shows a SAR input, four segmentation results by CBD-Net, SegFormer, DeepLabV3+ and our SAROSS-Net, and a ground truth segmentation mask, respectively.
@misc{moon2024dakddataaugmentationknowledge,
title={DAKD: Data Augmentation and Knowledge Distillation using Diffusion Models for SAR Oil Spill Segmentation},
author={Jaeho Moon and Jeonghwan Yun and Jaehyun Kim and Jaehyup Lee and Munchurl Kim},
year={2024},
eprint={2412.08116},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2412.08116},
}