DIS (IS-Net)

cv cv segmentation dis is-net dichotomous 1 min read

Dichotomous Image Segmentation with intermediate supervision strategy

Dichotomous Image Segmentation — proposes IS-Net with 3 components:

IS-Net Architecture

Stage 1: Self-supervised GT Encoder Training

$L_{gt} = \sum_{d=1}^{D} \lambda_{d}^{gt} BCE(F_{gt}(\theta_{gt}, G)_d, G)$

GT encoder is frozen after this stage.

Feature Consistency Loss (intermediate supervision):

$L_{fs} = \sum_{d=1}^{D} \lambda_{d}^{fs} |f_{d}^{I} - f_{d}^{G}|^2$

$L_{sg} = \sum_{d=1}^{D} \lambda_{d}^{sg} BCE(F_{sg}(\theta_{sg}, I), G)$

Total loss: $L = L_{fs} + L_{sg}$

DIS Results

HCE Algorithm