Double-Linear Thompson Sampling For Context-Attentive Bandits

Djallel Bouneffouf, Raphael Feraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:03:53

10 Jun 2021

In this paper, we analyze and extend an online learning framework known as Context-Attentive Bandit , motivated by various practical applications, from medical diagnosis to dialog systems, where due to observation costs only a small subset of a potentially large number of context variables can be observed at each iteration; however, the agent has a freedom to choose which variables to observe. We derive a novel algorithm, called Context-Attentive Thompson Sampling (CATS) , which builds upon the Linear Thompson Sampling approach, adapting it to Context-Attentive Bandit setting. We provide a theoretical regret analysis and an extensive empirical evaluation demonstrating advantages of the proposed approach over several baseline methods on a variety of real-life datasets.

Chairs:

Chang Yoo

Tags:

signal processing society

IEEE icassp 2021

virtual conference

2021

sps

virtual conference icassp 2021

june 6-11 2021

icassp 2021

Double-Linear Thompson Sampling For Context-Attentive Bandits

Djallel Bouneffouf, Raphael Feraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish

Value-Added Bundle(s) Including this Product

ICASSP 2021 Virtual Conference - Presentation Videos Product Bundle

More Like This

Welcome and Opening Remarks for the IEEE SustainTech Leadership Forum

Panel: Building Sustainable Cities for Tomorrow

Panel: Unleashing the Potential of Virtual Power Plants for Sustainable Energy Solutions

Join an IEEE Society