Fenosoa Randrianjatovo | Max Planck Institute for Intelligent Systems

Institute Homepage

Institute Homepage DE Sign In

Back

Fenosoa Randrianjatovo

Note: Fenosoa Randrianjatovo has transitioned from the institute (Alumni).

Social Foundations of Computation Intern Alumni

Max-Planck-Ring 4 72076 Tübingen +49 1523 651 5430 fenosoa.randrianjatovo@tuebingen.mpg.de

Overview

Reinforcement learning (RL) algorithms have so far been developed mainly for steady-state environments and have difficulty adapting when the system dynamics or the reward function changes. Posterior sampling and Thompson sampling were identified early on as efficient strategies in RL, in part due to their randomised strategies. While some algorithms such as UCRL have recently been adapted to non-stationary environments, no randomised strategy has been proposed yet. In our project, we aim to propose a randomised -and more practical- algorithm that builds on posterior sampling and is capable of achieving sublinear regret. We will start by studying a (possibly context-dependent) bandit problem and then extend our findings to more complex RL models.

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives