Fast Learning From Label Proportions With Small Bags

Denis Baru?i?, Jan Kybic

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:11:45

19 Oct 2022

Image captioning is a challenging task that connects two major artificial intelligence fields: computer vision and natural language processing. Image captioning models use traditional images to generate a natural language description of the scene. However, the scene could contain private information that we want to hide but still generate the captions. inspired by the trend of jointly designing optics and algorithms, this paper addresses the problem of privacy-preserving scene captioning. Our approach promotes privacy preservation, by hiding the faces in the images, during the acquisition process with a designed refractive camera lens while extracting useful features to perform image captioning. The refractive lens and an image captioning deep network architecture are optimized end-to-end to generate descriptions directly from the blurred images. Simulations show that our privacy-preserving approach degrades private visual attributes (e.g., face detection fails with our distorted images) while achieving comparable captioning performance with traditional non-private methods on the COCO dataset.

Tags:

International Conference on Image Processing

IEEE ICIP 2022

icip

Fast Learning From Label Proportions With Small Bags

Denis Baru?i?, Jan Kybic

Value-Added Bundle(s) Including this Product

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

More Like This

Fusion Temporal Color Constancy

RPFNET: Complementary Feature Fusion For Hand Gesture Recognition

Class-Wise Fm-Nms For Knowledge Distillation of Object Detection

Join an IEEE Society