Catálogo de publicaciones - tesis

Compartir en
redes sociales


Título de Acceso Abierto

Contribución al estudio y el diseño de funciones de refuerzo

Juan Miguel Santos Hugo Daniel Scolnik Norbert Giambiasi

publishedVersion.

Resumen/Descripción – provisto por el repositorio digital
We propose a Reinforcement Function Design Process in two steps. The first one translates a natural language description into an instance of the Reinforcement Function General Expression. The second tunes parameters of constraints in this expression, so as to obtain the optimal definition of the function (relative to exploration). We separate the constraints according to the type of state variable estimator on which they act, in particular: position and velocity. Using a particular, but representative Reinforcement Function (RF) expression, we study the relation between the Sum of each reinforcement type and the RF parameters during the exploration phase of the learning. For linear relations, we propose an analytic method to obtain the RF parameters values (no experimentation requires). For non-linear, but monotonous relations, we propose the Update Parameter Algorithm (UPA) and show that UPA can efficiently adjust the proportion of negative and positive reinforcements received during the exploratory phase of the learning. Additionally, we study the feasibility and consequences of adapting the RF during the learning process so as to improve the learning convergence of the system. Dynamic-UPA allows the whole learning process to maintain a desired ratio of positive and negative rewards. Thus, we introduce an approach to solve the exploration-exploitation dilemma - a necessary step for efficient Reinforcement Learning. We illustrate, with several experiments involving robots (mobile and arm), the performance of the proposed design methods. Finally, we emphasize the main conclusions and present some future directions of research.
Palabras clave – provistas por el repositorio digital

No disponibles.

Disponibilidad
Institución detectada Año de publicación Navegá Descargá Solicitá
No requiere 1999 Biblioteca Digital (FCEN-UBA) (SNRD) acceso abierto

Información

Tipo de recurso:

tesis

Idiomas de la publicación

  • español castellano

País de edición

Argentina

Fecha de publicación

Información sobre licencias CC

https://creativecommons.org/licenses/by/2.5/ar/