Pyramid Representations of the set of actions in reinforcement learning

TítuloPyramid Representations of the set of actions in reinforcement learning
AutoresR. Iglesias, V. Álvarez-Santos, M.A. Rodríguez, D. Santos Saavedra, C.V. Regueiro, X.M Pardo
TipoComunicación para congreso
Fonte International Work-Conference on the Interplay Between Natural and Artificial Computation, Elche (Spain), Springer, pp. 203-212 , 2015.
ISBN978-3-319-18832-4
ISSN0302-9743
DOI10.1007/978-3-319-18833-1_22
AbstractFuture robot systems will perform increasingly complex tasks in decreasingly well-structured and known environments. Robots will need to adapt their hardware and software, first only to foreseen, but ultimately to more complex changes of the environment. In this paper we describe a learning strategy based on reinforcement which allows fast robot learning from scratch using only its interaction with the environment, even when the reward is provided by a human observer and therefore is highly non-deterministic and noisy. To get this our proposal uses a novel representation of the action space together with an ensemble of learners able to forecast the time interval before a robot failure.
Palabras chaveReinforcement learning, Robotics, Ensembles, Learning and adaptation

Programas científicos