S. Hämäläinen, H. Sanneck, and C. Sartori, LTE selforganising networks (SON): network management automation for operational efficiency, 2012.

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

P. Auer, N. Cesa-bianchi, and P. Fischer, Finitetime analysis of the multiarmed bandit problem, Machine Learning, vol.47, issue.2/3, pp.235-256, 2002.
DOI : 10.1023/A:1013689704352

S. Lohmüller, L. C. Schmelz, and S. Hahn, Adaptive SON management using KPI measurements, NOMS 2016, 2016 IEEE/IFIP Network Operations and Management Symposium, 2016.
DOI : 10.1109/NOMS.2016.7502868

S. Bubeck and N. Cesa-bianchi, Regret analysis of stochastic and non-stochastic multi-armed bandit problems, Machine Learning, pp.1-122, 2012.
DOI : 10.1561/2200000024

URL : http://arxiv.org/abs/1204.5721

A. Feki and V. Capdevielle, Autonomous resource allocation for dense LTE networks: A Multi Armed Bandit formulation, 2011 IEEE 22nd International Symposium on Personal, Indoor and Mobile Radio Communications, 2011.
DOI : 10.1109/PIMRC.2011.6140047

P. Coucheney, K. Khawam, and J. Cohen, Multiarmed bandit for distributed inter-cell interference coordination, IEEE International Conference on Communications (ICC), 2015.
DOI : 10.1109/icc.2015.7248837

URL : https://hal.archives-ouvertes.fr/hal-01218806

M. Lelarge, A. Proutiere, and M. S. Talebi, Spectrum bandit optimization, 2013 IEEE Information Theory Workshop (ITW), p.2013
DOI : 10.1109/ITW.2013.6691221

URL : https://hal.archives-ouvertes.fr/hal-00920063

S. Hahn, Classification of Cells Based on Mobile Network Context Information for the Management of SON Systems, 2015 IEEE 81st Vehicular Technology Conference (VTC Spring), p.2015
DOI : 10.1109/VTCSpring.2015.7145744