• Aucun résultat trouvé

Multi-Armed bandit Learning in Iot Networks (MALIN)

N/A
N/A
Protected

Academic year: 2021

Partager "Multi-Armed bandit Learning in Iot Networks (MALIN)"

Copied!
2
0
0

Texte intégral

(1)

HAL Id: hal-02013866

https://hal.inria.fr/hal-02013866

Submitted on 11 Feb 2019

HAL is a multi-disciplinary open access

archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

Multi-Armed bandit Learning in Iot Networks (MALIN)

Remi Bonnefoi, Lilian Besson, Christophe Moy

To cite this version:

Remi Bonnefoi, Lilian Besson, Christophe Moy. Multi-Armed bandit Learning in Iot Networks (MA-LIN). ICT 2018 - 25th International Conference on Telecommunications, Jun 2018, Saint-Malo, France. �hal-02013866�

(2)

INSTITUT D’ÉLECTRONIQUE ET DE TÉLÉCOMMUNICATIONS DE RENNES

1

Multi-Armed bandit Learning in Iot Networks

(MALIN)

Rémi Bonnefoi1, Lilian Besson1 and Christophe Moy2

1 CentraleSupélec/IETR, F-35576, Cesson-Sévigné Cedex, France. Remi.Bonnefoi@CentraleSupelec.fr, Lilian.Besson@CentraleSupelec.fr

2 Univ Rennes, CNRS, IETR – UMR 6164, F-35000, Rennes, France. Christophe.Moy@Univ-Rennes1.fr

Goal

 With the advent of the Internet of Things (IoT), unlicensed band are going to be shared by a large number of devices with dissimilar caracteristics. In such context, solutions are required to allow the coexistence of devices and to avoid performance drop due to interference.

 In this demonstration, we show that reinforcement learning algorithms and in particular Multi-Armed Bandit algorithms can be used as a means of improving the performance of IoT communications.

Backgroung-traffic generator

Generates an interfering traffic that prevents the base station to correctly decode all the packets.

Gateway

Receives and decodes the packets sent by our intelligent devices in different channels.

Once a packet is successfully received, the gateway sends an acknowledgement in the channel used for the uplink packet

Intelligent devices

They want to send packets to the gateway. For that purpose, they can use the different channels available. The selection of the channel is done using a MAB learning algorithm in order to avoid collisions with other objects transmissions.

Each objet makes its own decision. They do not share any information. The decision is decentralized and uncoordinated.

MAB learning

 A user faces N choices (e.g. N channels)

 The channels provide him a reward with a given probability

How to identify the best channel?

 MAB learning algorithms are proved to be optimal to solve this problem.

 Any MAB algorithm can be used for channel selection (UCB, Thompson Sampling or others)

 With the UCB algorithm, the channel with the highest index is chosen for each transmission

P1 P2 P3 P4 P5 P1 P2 P3 P4 P5

[1] R. Bonnefoi, L. Besson, C. Moy, E. Kaufmann, J. Palicot, Multi-Armed Bandit Learning in IoT Networks: Learning Helps Even in Non-stationary Settings. In CrownCom 2017.

[2] S. Bubeck and N. Cesa-Bianchi, “Regret analysis of stochastic and nonstochastic multiarmed bandit problems,” Foundations and Trends® in Machine Learning, vol. 5, no. 1, pp. 1–122, 2012.

Acknowledgement

Traffic oberved by the gateway

L

J

J

#1 #2 #3 #4 Packet sent by an intelligent device Ack sent by the gateway Interfering traffic Channels

This work is supported by the European Union through the European Regional Development Fund (ERDF), and by Ministry of Higher Education and Research, Brittany and Rennes Métropole, through the CPER Project SOPHIE / STIC & Ondes. Part of this work is also funded by the ANR projects SOGREEN and BADASS, by Région Bretagne, France, by the French Ministry of Higher Education and ENS Paris Saclay.

Results

Références

Documents relatifs

Long slender aromatic white rice (basmati group) Cargo rice (round-grain cv Ballila) White rice (medium-grain cv Ariete) Cargo rice (indica cv Thaïbonnet) Paddy rice

The reporting duties of regulatory agencies allow the Member States and the political and administrative institutions of the Union to exert pressure on the agency.

After a first timid reference to democracy in the preamble of the Single Eu- ropean Act, the Treaty on European Union, as modified by the Amsterdam Treaty, confirms the attachment

ةحوللا IX ةباجتسلاا تءاج ةيلسانتلا لبق ام ةيموملاا ةيزمرلا تاذ ف ماش اهي ةل ةبكرمو GKob ريشت ةيسجرن ةيوثنأ ةيصوصخ وذ ءاملا ىلإ ريشي ىوتحمبو ثا

• Continuity: the data products from satellite, which have to be integrated into a global picture, must have assured a long-term continuity. • Gaps in observational

I n ranking the toxicity of materials by bioassay experiments the dependence of HCN generation on the conditions of combustion should also be evident..

It could, for example, ground debates on a democratic deficit in a sociological account of European public opinion and Europeanized behaviour in a regional integrat- ing economy; or

Puis, on propose des algorithmes pour la minimisation du regret dans les problèmes de bandit stochastique paramétrique dont les bras appartiennent à une certaine famille