Style Transformer for Image Inversion and Editing

Mar 18, 2022 1 min read

Style Transformer for Image Inversion and Editing (CVPR2022)

Existing GAN inversion methods fail to provide latent codes for reliable reconstruction and flexible editing simultaneously. This paper presents a transformer-based image inversion and editing model for pretrained StyleGAN which is not only with less distortions, but also of high quality and flexibility for editing. The proposed model employs a CNN encoder to provide multi-scale image features as keys and values. Meanwhile it regards the style code to be determined for different layers of the generator as queries. It first initializes query tokens as learnable parameters and maps them into $W^+$ space. Then the multi-stage alternate self- and cross-attention are utilized, updating queries with the purpose of inverting the input by the generator. Moreover, based on the inverted code, we investigate the reference- and label-based attribute editing through a pretrained latent classifier, and achieve flexible image-to-image translation with high quality results. Extensive experiments are carried out, showing better performances on both inversion and editing tasks within StyleGAN.

Our style transformer proposes novel multi-stage style transformer in w+ space to invert image accurately, and we characterize the image editing in StyleGAN into label-based and reference-based, and use a non-linear classifier to generate the editing vector.

Getting Started

Prerequisites

Ubuntu 16.04
NVIDIA GPU + CUDA CuDNN
Python 3

Pretrained Models

Coming soon!

Training

Coming soon!

Inference

Coming soon!

Citation

If you use this code for your research, please cite

GitHub

View Github

Transformer Images

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

Style Transformer for Image Inversion and Editing

Style Transformer for Image Inversion and Editing (CVPR2022)

Getting Started

Prerequisites

Pretrained Models

Training

Inference

Citation

GitHub

John

A Telegram Bin Checker Bot made with python for check Bin valid or Invalid

CFGAN: A Generic Collaborative Filtering Framework based on Generative Adversarial Networks

Style Transformer for Image Inversion and Editing (CVPR2022)

Getting Started

Prerequisites

Pretrained Models

Training

Inference

Citation

GitHub

A Telegram Bin Checker Bot made with python for check Bin valid or Invalid

CFGAN: A Generic Collaborative Filtering Framework based on Generative Adversarial Networks

You might also like...