Reinforcement-learning for sampling design in Markov random fields

Archive ouverte

Bonneau, Mathieu | Dubois Peyrard, Nathalie | Sabbadin, Régis

Edité par CCSD -

Optimal sampling in spatial random fields is a complex problem, which mobilizes several research fields in spatial statistics and artificial intelligence. We consider the case where observations are discrete-valued and modelled by a Markov Random Field. Then we encode the sampling problem into the Markov Decision Process (MDP) framework. After exploring existing heuristic solutions as well as classical algorithms from the field of Reinforcement Learning (RL), we design an original algorithm, LSDP (Least Square Dynamic Programming), which uses simulated trajectories to solve approximately any finite-horizon MDP problem. Based on an empirical study of the behaviour of these different approaches on binary models, we derive the following conclusions: i) a naive heuristic, consisting in sampling sites where marginals are the most uncertain, is already an efficient sampling approach. ii) LSDP outperforms all the classical RL approaches we have tested. iii) LSDP outperforms the heuristic in cases when reconstruction errors have a high cost, or sampling actions are constrained. In addition, LSDP readily handles action costs in the optimisation problem, as well as cases when some sites of the MRF can not be observed.

Suggestions

Du même auteur

Solving adaptive sampling problems in graphical models using Markov decision process

Archive ouverte | Bonneau, Mathieu | CCSD

In environmental management problems, decision should ideally rely on knowledge of the whole system. However, due to limited budget, in practice only a small part of the system is sampled and the complete system state is reconstru...

A graph-based Markov decision process framework applied to the optimization of strategies for integrated management of diseases

Archive ouverte | Dubois Peyrard, Nathalie | CCSD

absent

Modèles de réseaux d'interactions écologiques : une revue brève et incomplète

Archive ouverte | Dubois Peyrard, Nathalie | CCSD

Modèles de réseaux d'interactions écologiques : une revue brève et incomplète. Les 5e Journées du GRET

Chargement des enrichissements...