Learning Selective Mutual Attention and Contrast for RGB-D Saliency Detection

SMAC

ReDWeb-S: a large-scale challenging dataset for RGB-D Salient Object Detection.

Citing our work

If you think our work is helful, please cite

@misc{liu2020learning,
      title={Learning Selective Mutual Attention and Contrast for RGB-D Saliency Detection}, 
      author={Nian Liu and Ni Zhang and Ling Shao and Junwei Han},
      year={2020},
      eprint={2010.05537},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

The Proposed RGB-D Salient Object Detection Dataset

ReDWeb-S

We construct a new large-scale challenging dataset ReDWeb-S and it has totally 3179 images with various real-world scenes and high-quality depth maps. We split the dataset into a training set with 2179 RGB-D image pairs and a testing set with the remaining 1000 image pairs.

The proposed dataset link can be found here. [baidu pan fetch code: rp8b | Google drive]

Dataset Statistics and Comparisons

We analyze the proposed ReDWeb-S datset from several statistical aspects and also conduct a comparison between ReDWeb-S and other existing RGB-D SOD datasets.
table

scene_object_stat
Fig.1. Top 60% scene and object category distributions of our proposed ReDWeb-S dataset.

GC_IC
Fig.2. Comparison of nine RGB-D SOD dataset in terms of the distributions of global contrast and interior contrast.

center_bias
Fig.3. Comparsion of the average annotation maps for nine RGB-D SOD benchmark datasets.

object_size

Fig.4. Comparsion of the distribution of object size for nine RGB-D SOD benchmark datasets.

SOTA Results on our proposed dataset

We provide other SOTA RGB-D methods' results and scores on our proposed dataset. You can directly download all results [here lfa6].

No.	Pub.	Name	Title	Download
01	CVPR2020	S2MA	Learning Selective Self-Mutual Attention for RGB-D Saliency Detection	results, g0pgx
02	CVPR2020	JL-DCF	JL-DCF: Joint Learning and Densely-Cooperative Fusion Framework for RGB-D Salient Object Detection	results, xh9p
03	CVPR2020	UCNet	UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders	results, 6o93
04	CVPR2020	A2dele	A2dele: Adaptive and Attentive Depth Distiller for Efficient RGB-D Salient Object Detection	results, swv5
05	CVPR2020	SSF-RGBD	Select, Supplement and Focus for RGB-D Saliency Detection	results, oshl
06	TIP2020	DisenFusion	RGBD Salient Object Detection via Disentangled Cross-Modal Fusion	results, h3hc
07	TNNLS2020	D3Net	D3Net:Rethinking RGB-D Salient Object Detection: Models, Datasets, and Large-Scale Benchmarks	results, tetn
08	ICCV2019	DMRA	Depth-induced multi-scale recurrent attention network for saliency detection	results, kqq4
09	CVPR2019	CPFP	Depth-induced multi-scale recurrent attention network for saliency detection	results, 0v2c
10	TIP2019	TANet	Three-stream attention-aware network for RGB-D salient object detection	results, hsy9
11	CVPR2018	PCF	Progressively Complementarity-Aware Fusion Network for RGB-D Salient Object Detection	results, qzhm
12	PR2019	MMCI	Multi-modal fusion network with multiscale multi-path and cross-modal interactions for RGB-D salient object detection	results, c90m
13	TCyb2017	CTMF	CNNs-based RGB-D saliency detection via cross-view transfer and multiview fusion	results, i0zb
14	Access2019	AFNet	Adaptive fusion for rgb-d salient object detection	results, 54zc
15	TIP2017	DF	Rgbd salient object detection via deep fusion	results, d7sc
16	ICME2016	SE	Salient object detection for rgb-d image via saliency evolution	results, h10s
17	SPL2016	DCMC	Saliency detection for stereoscopic images based on depth confidence analysis and multiple cues fusion	results, 18po
18	CVPR2016	LBE	Local background enclosure for rgb-d salient object detection	results, iiz5

Methods	S-measure	maxF	E-measure	MAE
S2MA	0.711	0.696	0.781	0.139
JL-DCF	0.734	0.727	0.805	0.128
UCNet	0.713	0.71	0.794	0.13
A2dele	0.641	0.603	0.672	0.16
SSF-RGBD	0.595	0.558	0.71	0.189
DisenFusion	0.675	0.658	0.76	0.16
D3Net	0.689	0.673	0.768	0.149
DMRA	0.592	0.579	0.721	0.188
CPFP	0.685	0.645	0.744	0.142
TANet	0.656	0.623	0.741	0.165
PCF	0.655	0.627	0.743	0.166
MMCI	0.660	0.641	0.754	0.176
CTMF	0.641	0.607	0.739	0.204
AFNet	0.546	0.549	0.693	0.213
DF	0.595	0.579	0.683	0.233
SE	0.435	0.393	0.587	0.283
DCMC	0.427	0.348	0.549	0.313
LBE	0.637	0.629	0.73	0.253

Acknowledgement

We thank all annotators for helping us constructing the proposed dataset. Our proposed dataset is based on the ReDWeb dataset, which is a state-of-the-art dataset proposed for monocular image depth estimation. We also thank the authors for providing the ReDWeb dataset.

Learning Selective Mutual Attention and Contrast for RGB-D Saliency Detection

SMAC

Citing our work

The Proposed RGB-D Salient Object Detection Dataset

ReDWeb-S

Dataset Statistics and Comparisons

SOTA Results on our proposed dataset

Acknowledgement

GitHub

John

Nvidia GauGAN Graphical User Interface with Drawingboard.js

D2Det using mmdetection v2.1, supporting Objects365 and LVIS

SMAC

Citing our work

The Proposed RGB-D Salient Object Detection Dataset

ReDWeb-S

Dataset Statistics and Comparisons

SOTA Results on our proposed dataset

Acknowledgement

GitHub

Nvidia GauGAN Graphical User Interface with Drawingboard.js

D2Det using mmdetection v2.1, supporting Objects365 and LVIS

You might also like...