Sample Complexity of Q-learning: from Single-agent to Federated Learning

Yuejie Chi

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 58:52

17 Oct 2023

Q-learning, which seeks to learn the optimal Q-function of a Markov decision process (MDP) in a model-free fashion, lies at the heart of reinforcement learning practices. However, theoretical understandings on its non-asymptotic sample complexity remain unsatisfactory, despite significant recent efforts. In this talk, we first show a tight sample complexity bound of Q-learning in the single-agent setting, together with a matching lower bound to establish its minimax sub-optimality. We then show how federated versions of Q-learning allow collaborative learning using data collected by multiple agents without central sharing, where an importance averaging scheme is introduced to unveil the blessing of heterogeneity.

Tags:

machine learning

reinforcement learning

federated learning

Sample Complexity of Q-learning: from Single-agent to Federated Learning

Yuejie Chi

More Like This

Short Course Bundle: ICASSP 2023 COURSE 4: Graph Neural Networks (Parts 1-4)

Short Course Bundle: ICIP 2023 COURSE 1: Short Course: Multimodal Learning: Technical Foundation, Hands-on and Applications (Parts 1-4)

Slides: Fractional Programming for Discrete Optimization in Signal Processing and Machine Learning

Join an IEEE Society