Bandit problems in networks: Asymptotically efficient distributed allocation rules

Soummya Kar; H. Vincent Poor; Shuguang Cui; IEEE

doi:10.1109/CDC.2011.6160719

Back

Conference proceeding

Bandit problems in networks: Asymptotically efficient distributed allocation rules

Soummya Kar, H. Vincent Poor, Shuguang Cui and IEEE

2011 50th IEEE Conference on Decision and Control and European Control Conference, pp.1771-1778

12/2011

DOI: https://doi.org/10.1109/CDC.2011.6160719

Abstract

Asymptotically Efficient

Collaboration

Decision making

Density measurement

Distributed Allocation Rules

Networked Bandit Problems

Partially Observable Rewards

Random variables

Resource management

Symmetric matrices

Vectors

This paper studies the multi-agent bandit problem in a distributed networked setting. The setting considered assumes only one bandit (the major bandit) has accessible reward information from its samples, whereas the rest (the minor bandits) have unobservable rewards. Under the assumption that the minor bandits are aware of the sampling pattern of the major bandit (but with no direct access to its rewards), a lower bound on the expected average network regret is obtained. The lower bound resembles the logarithmic optimal regret attained in single (classical) bandit problems, but in addition is shown to scale down with the number of agents. A collaborative and adaptive distributed allocation rule DA is proposed and is shown to achieve the lower bound on the expected average regret for a connected inter-bandit communication network. In particular, it is shown that under the DA allocation rule, the minor bandits attain sub-logarithmic expected regrets as opposed to logarithmic in the single agent setting.

Metrics

1 Record Views

Details

Title: Bandit problems in networks: Asymptotically efficient distributed allocation rules
Creators - without role: Soummya Kar - Princeton University
H. Vincent Poor - Princeton University
Shuguang Cui - Texas A&M University
IEEE
Publication Details: 2011 50th IEEE Conference on Decision and Control and European Control Conference, pp.1771-1778
Publisher: IEEE
Identifiers: 9935031708331
Academic Unit: King Abdulaziz University
Language: English
Resource Type: Conference proceeding