Self-Supervision By Prediction For Object Discovery In Videos

Beril Besbinar, Pascal Frossard

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:12:26

21 Sep 2021

Despite their irresistible success, deep learning algorithms still heavily rely on annotated data, and unsupervised settings pose many challenges, such as finding the right inductive bias in diverse scenarios. In this paper, we propose an object-centric model for image sequence representation that uses the prediction task for self-supervision. By disentangling object representation and motion dynamics, our novel compositional structure explicitly handles occlusion and inpaints inferred objects and background for the composition of the predicted frame. Using auxiliary losses to promote spatially and temporally consistent object representations, we train our self-supervised framework without the help of any annotation or pretrained network. Initial experiments confirm that our new pipeline is a promising step towards object-centric video prediction.

Tags:

signal processing society

IEEE icip 2021

september 19-22

virtual conference

2021

sps

virtual conference icip 2021

icip 2021

Self-Supervision By Prediction For Object Discovery In Videos

Beril Besbinar, Pascal Frossard

Value-Added Bundle(s) Including this Product

ICIP 2021 Virtual Conference - Presentation Videos Product Bundle

More Like This

Low-Shot Early Gastric Cancer Diagnostic Model Driven By Unsupervised Features

Conditional Diffusion Models For Inverse Mr Image Recovery

Laden: Lesion-Aware Adversarial Deep Network For The Grading Of Macular Diseases Using Color Fundus Images

Join an IEEE Society