Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation

This is an official implementation of the paper "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation", accepted to ICCV2021.

For more information, please checkout the project site [website] and the paper [arXiv].

Pre-requisites

This repository uses the following libraries:

Python (3.6)
Pytorch (1.8.1)

Getting Started

Datasets

VOC

The structure of data path should be organized as follows:

/dataset/PASCALVOC/VOCdevkit/VOC2012/                         % Pascal VOC datasets root
/dataset/PASCALVOC/VOCdevkit/VOC2012/JPEGImages/              % Pascal VOC images
/dataset/PASCALVOC/VOCdevkit/VOC2012/SegmentationClass/       % Pascal VOC segmentation maps
/dataset/PASCALVOC/VOCdevkit/VOC2012/ImageSets/Segmentation/  % Pascal VOC splits

CONTEXT

The structure of data path should be organized as follows:

/dataset/context/                                 % Pascal CONTEXT dataset root
/dataset/context/59_labels.pth                    % Pascal CONTEXT segmentation maps
/dataset/context/pascal_context_train.txt         % Pascal CONTEXT splits
/dataset/context/pascal_context_val.txt           % Pascal CONTEXT splits
/dataset/PASCALVOC/VOCdevkit/VOC2012/JPEGImages/  % Pascal VOC images

Training

We use DeepLabV3+ with ResNet-101 as our visual encoder. Following ZS3Net, ResNet-101 is initialized with the pre-trained weights for ImageNet classification, where training samples of seen classes are used only. (weights here)

VOC

python train_pascal_zs3setting.py -c configs/config_pascal_zs3setting.json -d 0,1,2,3

Trained visual and semantic encoder weights

CONTEXT

python train_context_zs3setting.py -c configs/config_context_zs3setting.json -d 0,1,2,3

Trained visual and semantic encoder weights

Testing

VOC

python train_pascal_zs3setting.py -c configs/config_pascal_zs3setting.json -d 0,1,2,3 -r <visual encoder>.pth --test

CONTEXT

python train_pascal_zs3setting.py -c configs/config_pascal_zs3setting.json -d 0,1,2,3 -r <visual encoder>.pth --test

Acknowledgements

This template is borrowed from pytorch-template.

GitHub

https://github.com/cvlab-yonsei/JoEm

Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation

Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation

Pre-requisites

Getting Started

Datasets

VOC

CONTEXT

Training

VOC

CONTEXT

Testing

VOC

CONTEXT

Acknowledgements

GitHub

John

Easy compression and extraction for any compression or archival format

Youtube videos and channels scraper python wrapper

Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation

Pre-requisites

Getting Started

Datasets

VOC

CONTEXT

Training

VOC

CONTEXT

Testing

VOC

CONTEXT

Acknowledgements

GitHub

Easy compression and extraction for any compression or archival format

Youtube videos and channels scraper python wrapper

You might also like...