IMAGE INPAINTING BY MSCSWIN TRANSFORMER ADVERSARIAL AUTOENCODER

Bo-Wei Chen, Tsung-Jung Liu, Kuan-Hsien Liu

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Lecture 11 Oct 2023

Image inpainting has been researched for years. From deeper and larger models to models that focus on global information, all of them aim to obtain results closer to reality. In this paper, we combine the stripe window and line-by-line feature shift to modify the Vision Transformer (ViT) to reduce the computation cost and obtain global information from the oblique attention. In addition, we design a new loss function to enhance the texture and colors for inpainting. At last, to validate the efficacy of our proposed model, we conduct extensive experiments on commonly seen datasets (Places2 and CelebA) compared with other state-of-the-art methods. The source code and pretrained models are available at https://github.com/bobo0303/MSCS-Net.

Tags:

HSV color space

image inpainting

multi-shift window

vision transformer

gated convolution