Generating Annotated High-Fidelity Images Containing Multiple Coherent Objects

Bryan Cardenas Guevara, Devanshu Arya, Deepak K. Gupta

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:12:58

21 Sep 2021

Recent developments related to generative models have enabled the generation of diverse and high-fidelity images. In particular, layout-to-image generation models have gained significant attention due to their capability to generate realistic and complex images containing distinct objects. These models are generally conditioned on either semantic layouts or textual descriptions. However, unlike natural images, providing auxiliary information can be extremely hard in domains such as biomedical imaging and remote sensing. In this work, we propose a multi-object generation framework that can synthesize images with multiple objects without explicitly requiring their contextual information during the generation process. Based on a vector-quantized variational autoencoder (VQ-VAE) backbone, our model learns to preserve spatial coherency within an image as well as semantic coherency through the use of powerful autoregressive priors. An advantage of our approach is that the generated samples are accompanied by object-level annotations. The efficacy of our approach is demonstrated through application on medical imaging datasets, where we show that augmenting the training set with the samples generated by our approach improves the performance of existing models.

Tags:

signal processing society

IEEE icip 2021

september 19-22

virtual conference

2021

sps

virtual conference icip 2021

icip 2021

Generating Annotated High-Fidelity Images Containing Multiple Coherent Objects

Bryan Cardenas Guevara, Devanshu Arya, Deepak K. Gupta

Value-Added Bundle(s) Including this Product

ICIP 2021 Virtual Conference - Presentation Videos Product Bundle

More Like This

Combined Generation Of Electrocardiogram And Cardiac Anatomy Models Using Multi-Modal Variational Autoencoders

Characterizing Cell Populations Using Statistical Shape Modes

Enhanced Motor Imagery-Based Eeg Classification Using A Discriminative Graph Fourier Subspace

Join an IEEE Society