Pierre-Henri Wuillemin - Academia.edu (original) (raw)

Papers by Pierre-Henri Wuillemin

Research paper thumbnail of Numéro Spécial Réseaux Bayésiens et Modèles Probabilistes

Le Centre pour la Communication Scientifique Directe - HAL - Diderot, Feb 28, 2018

Research paper thumbnail of Learning Continuous High-Dimensional Models using Mutual Information and Copula Bayesian Networks

Proceedings of the AAAI Conference on Artificial Intelligence

We propose a new framework to learn non-parametric graphical models from continuous observational... more We propose a new framework to learn non-parametric graphical models from continuous observational data. Our method is based on concepts from information theory in order to discover independences and causality between variables: the conditional and multivariate mutual information (such as \cite{verny2017learning} for discrete models). To estimate these quantities, we propose non-parametric estimators relying on the Bernstein copula and that are constructed by exploiting the relation between the mutual information and the copula entropy \cite{ma2011mutual, belalia2017testing}. To our knowledge, this relation is only documented for the bivariate case and, for the need of our algorithms, is here extended to the conditional and multivariate mutual information. This framework leads to a new algorithm to learn continuous non-parametric Bayesian network. Moreover, we use this estimator to speed up the BIC algorithm proposed in \cite{elidan2010copula} by taking advantage of the decomposition...

Research paper thumbnail of Les réseaux Bayésiens dynamiques pour intégrer les connaissances relatives aux procédés alimentaires (application au processus d'affinage)

Le Centre pour la Communication Scientifique Directe - HAL - Diderot, Oct 1, 2009

National audienc

Research paper thumbnail of A Process Reverse Engineering Approach Using Process and Observation Ontology and Probabilistic Relational Models: Application to Processing of Bio-composites for Food Packaging

Communications in computer and information science, 2022

Research paper thumbnail of Réseaux probabilistes et Arbres de Décisions Binaires en Diagnostic

Le Centre pour la Communication Scientifique Directe - HAL - Université de Nantes, 2000

Research paper thumbnail of Towards Interactive Causal Relation Discovery Driven by an Ontology

In complex domains, users need to be able to find causal relations between the different attribut... more In complex domains, users need to be able to find causal relations between the different attributes that compose them. Our work offers a way to help users to uncover such relations by combining the representativity of ontologies and the flexibility of probabilistic relational models, and provides them with an interactive and iterative process in order to validate or modify the obtained results.

Research paper thumbnail of Étude comparée des inférences dans les Modèles Relationnels Probabilistes

La representation de connaissances incertaines est un probleme important dans le domaine de l&#39... more La representation de connaissances incertaines est un probleme important dans le domaine de l'intelligence artificielle. Les reseaux bayesiens proposent une solution interessante pour de nombreuses raisons tant theoriques que pratiques, mais ont le desavantage d'etre difficiles a creer autant qu'a maintenir. En reponse a ces problematiques, de nouveaux formalismes ont emerges, tels les Probablistic Relational Models. Ces nouveaux modeles sont tous aptes a utiliser les algorithmes d'inference developpes pour les reseaux bayesiens. Cependant ils permettent un passage a l'echelle rendant leurs utilisations tres couteuses et sous-optimales en regard de l'exploitation des informations structurelles de ces modeles. Dans cet article, nous proposons de tirer profit de notre cadre logiciel dans lequel coexistent plusieurs algorithmes d'inference dedies ou non aux PRMs afin de mener une comparaison experimentale du comportement de ces differents algorithmes.

Research paper thumbnail of A Bayesian MultiCriteria Approach for an Optimal Process Control

Research paper thumbnail of aGrUM/pyAgrum : a toolbox to build models and algorithms for Probabilistic Graphical Models in Python

This paper presents the aGrUM framework, a LGPL C++ library providing state-of-the-art implementa... more This paper presents the aGrUM framework, a LGPL C++ library providing state-of-the-art implementations of graphical models for decision making, including Bayesian Networks, Markov Networks (Markov random fields), Influence Diagrams, Credal Networks, Probabilistic Relational Models. The framework also contains a wrapper, pyAgrum for exploiting aGrUM in Python. This framework is the result of an ongoing effort to build an efficient and well maintained open source cross-platform software, running on Linux, MacOS X and Windows, for dealing with graphical models and for providing essential components to build new algorithms for graphical models.

Research paper thumbnail of Un algorithme d'arbre de jonction incrémental

Cet article s'interesse a l'inference probabiliste dans les systemes dynamiques multi-cib... more Cet article s'interesse a l'inference probabiliste dans les systemes dynamiques multi-cibles. Dans un contexte ou les cibles et/ou les observations peuvent evoluer, la plupart des algorithmes ne se servent pas completement de l'incrementalite du systeme pour optimiser les calculs. Cela induit des calculs inutiles et peut augmenter considerablement les temps d'in-ference, particulierement pour les systemes de grande taille. Pour pallier ce probleme, nous proposons un nouvel algorithme d'arbre de jonction utilisant une technique par envoi de messages. Etant donne une nouvelle requete, cet algorithme permet de minimiser les calculs en recalculant uniquement les messages differents des calculs precedents. Nous montrons l'effica-cite de notre approche par des resultats experimentaux. ABSTRACT. This article addresses the question of probabilistic inference in multi-target dynamic systems. In a context where targets and/or evidence may evolve, most of the algorithms...

Research paper thumbnail of An early guidance system for a general knowledge-based aiding framework using probabilistic interventions

A common decision problem repeats a lot of time with the same kind of alternatives and the same s... more A common decision problem repeats a lot of time with the same kind of alternatives and the same set of criteria, but with a different decision case in each occurrence. The objective of early guidance in this kind of problem is to facilitate the selection of a subset of satisfactory alternatives for each new decision case, without asking the user any knowledge of the problem. This article proposes an early guidance system based on a model of knowledge of the common decision problem. It first presents the construction of a Bayesian network for a common decision problem to embed the knowledge in the aiding framework. Second, the concept of intervention proposed by Pearl is extended to prob-abilistic interventions for a single variable and for a set of variables. Finally the early guidance procedure is presented on the basis of the Bayesian network and using a proba-bilistic intervention to set a decision case even though it is partially observed.

Research paper thumbnail of Reinforcing the Object-Oriented Aspect of Probabilistic Relational Models

Representing uncertainty in knowledge is a common issue in Artificial Intelligence. Bayesian Netw... more Representing uncertainty in knowledge is a common issue in Artificial Intelligence. Bayesian Networks have been one of the main models used in this field of research. The simplicity of their specification is one of the reason for their success, both in industrial and in theoretical domains. The widespread use of Bayesian Networks brings new challenges in the design and use of large- scale systems, where this very simplicity causes a lack of expressiveness and scalability. To fill this gap, an increasing number of languages emerged as extensions of Bayesian Networks with many approaches: first-order logic, object-oriented, entity-relation, and so on. In this paper we focus on Probabilistic Relational Models, an object-oriented extension. However, Probabilistic Relational Models do not fully exploit the object-oriented paradigm, in particular they lack class inheritance. Using Object-Oriented Bayesian Networks as a basis, we propose to lightly extend PRMs framework resulting in strong...

Research paper thumbnail of Elicitation of conditional probabilities in a Bayesian network using a ranking process and expert's beliefs fusion

Research paper thumbnail of Steps towards model-based, inference-driven SOA Testing

This paper reports the progress of a project whose goal is to develop a model-driven, contract-ba... more This paper reports the progress of a project whose goal is to develop a model-driven, contract-based SOA testing environment in which grey-box test strategies for large services architectures are driven by stochastic inference. In order to achieve this goal, a Bayesian network is built directly by mapping and model transformation from the services architecture models (MDA approach). As services architectures are complex and large systems, the generated BN and its associated probability tables may also be very large. Nevertheless, the structure of the services architecture, of its design and test models suggests that techniques of compilation of the inference tasks will allow using the BN as a sustainable tool for driving the troubleshooting strategy for large scale systems.

Research paper thumbnail of Discovering Prerequisite Structure of Skills through Probabilistic Association Rules Mining

Estimating the prerequisite structure of skills is a crucial issue in domain modeling. Students u... more Estimating the prerequisite structure of skills is a crucial issue in domain modeling. Students usually learn skills in sequence since the preliminary skills need to be learned prior to the complex skills. The prerequisite relations between skills underlie the design of learning sequence and adaptation strategies for tutoring systems. The prerequisite structures of skills are usually studied by human experts, but they are seldom tested empirically. Due to plenty of educational data available, in this paper, we intend to discover the prerequisite structure of skills from student performance data. However, it is a challenging task since skills are latent variables. Uncertainty exists in inferring student knowledge of skills from performance data. Probabilistic Association Rules Mining proposed by Sun et al. (2010) is a novel technique to discover association rules from uncertain data. In this paper, we preprocess student performance data by an evidence model. Then the probabilistic kn...

Research paper thumbnail of Modélisation de réseaux bayésiens de très grandes tailles

La representation de connaissances incertaines est un probleme important dans le domaine de l&#39... more La representation de connaissances incertaines est un probleme important dans le domaine de l'intelligence artificielle. Les reseaux bayesiens proposent une solution interessante pour de nombreuses raisons theoriques et pratiques mais ont le desavantage d'etre difficiles a creer autant qu'a maintenir. Dans cet article, nous comparons trois formalismes qui tentent de repondre a ces problemes en enrichissant les reseaux bayesiens avec des notions issues des paradigmes orientes objets et de la logique du premier ordre. Nous verrons que ces propositions fournissent de nouveaux outils de modelisation des reseaux bayesiens, mais posent de nouveaux problemes propres aux reseaux de grande taille.

Research paper thumbnail of Using ROBDDs for troubleshooting

When using Bayesian networks for modelling the behavior of man-made machinery, it usu­ally happen... more When using Bayesian networks for modelling the behavior of man-made machinery, it usu­ally happens that a large part of the model is deterministic. For such Bayesian networks the deterministic part of the model can be represented as a Boolean function, and a cen­tral part of belief updating reduces to the task of calculating the number of satisfying configurations in a Boolean function. In this paper we explore how advances in the calculation of Boolean functions can be adopted for belief updating, in particular within the context of troubleshooting. We present ex­perimental results indicating a substantial speed-up compared to traditional junction tree propagation.

Research paper thumbnail of The Role of Instrumental Variables in Causal Inference Based on Independence of Cause and Mechanism

Entropy, 2021

Causal inference methods based on conditional independence construct Markov equivalent graphs and... more Causal inference methods based on conditional independence construct Markov equivalent graphs and cannot be applied to bivariate cases. The approaches based on independence of cause and mechanism state, on the contrary, that causal discovery can be inferred for two observations. In our contribution, we pose a challenge to reconcile these two research directions. We study the role of latent variables such as latent instrumental variables and hidden common causes in the causal graphical structures. We show that methods based on the independence of cause and mechanism indirectly contain traces of the existence of the hidden instrumental variables. We derive a novel algorithm to infer causal relationships between two variables, and we validate the proposed method on simulated data and on a benchmark of cause-effect pairs. We illustrate by our experiments that the proposed approach is simple and extremely competitive in terms of empirical accuracy compared to the state-of-the-art methods.

Research paper thumbnail of An Efficient Low-Rank Tensors Representation for Algorithms in Complex Probabilistic Graphical Models

Probabilistic Graphical Models form a class of compact representations of high-dimensional probab... more Probabilistic Graphical Models form a class of compact representations of high-dimensional probability distributions by decomposing these distributions in a set of multivariate factors (potentials). Every exact algorithm (for probabilistic inference, MAP, etc.) operates on a specific representation of these potentials. However complex probabilistic models often lead to very large potentials which dramatically impact both the space and time complexities of these algorithms and which can make inference in complex models intractable. In this paper we propose a new approach based on low-rank tensor representation to approximate and operate with these potentials. The low-rank tensor representation is used for the approximation of potentials with controlled precision and an important reduction in the number of parameters. Every operator used in such algorithms (multiplication, addition, projection, etc.) can be defined within this representation, leading to an approximation framework wher...

Research paper thumbnail of Learning and Selection of Dynamic Bayesian Networks for Non-Stationary Processes in Real Time

Research paper thumbnail of Numéro Spécial Réseaux Bayésiens et Modèles Probabilistes

Le Centre pour la Communication Scientifique Directe - HAL - Diderot, Feb 28, 2018

Research paper thumbnail of Learning Continuous High-Dimensional Models using Mutual Information and Copula Bayesian Networks

Proceedings of the AAAI Conference on Artificial Intelligence

We propose a new framework to learn non-parametric graphical models from continuous observational... more We propose a new framework to learn non-parametric graphical models from continuous observational data. Our method is based on concepts from information theory in order to discover independences and causality between variables: the conditional and multivariate mutual information (such as \cite{verny2017learning} for discrete models). To estimate these quantities, we propose non-parametric estimators relying on the Bernstein copula and that are constructed by exploiting the relation between the mutual information and the copula entropy \cite{ma2011mutual, belalia2017testing}. To our knowledge, this relation is only documented for the bivariate case and, for the need of our algorithms, is here extended to the conditional and multivariate mutual information. This framework leads to a new algorithm to learn continuous non-parametric Bayesian network. Moreover, we use this estimator to speed up the BIC algorithm proposed in \cite{elidan2010copula} by taking advantage of the decomposition...

Research paper thumbnail of Les réseaux Bayésiens dynamiques pour intégrer les connaissances relatives aux procédés alimentaires (application au processus d'affinage)

Le Centre pour la Communication Scientifique Directe - HAL - Diderot, Oct 1, 2009

National audienc

Research paper thumbnail of A Process Reverse Engineering Approach Using Process and Observation Ontology and Probabilistic Relational Models: Application to Processing of Bio-composites for Food Packaging

Communications in computer and information science, 2022

Research paper thumbnail of Réseaux probabilistes et Arbres de Décisions Binaires en Diagnostic

Le Centre pour la Communication Scientifique Directe - HAL - Université de Nantes, 2000

Research paper thumbnail of Towards Interactive Causal Relation Discovery Driven by an Ontology

In complex domains, users need to be able to find causal relations between the different attribut... more In complex domains, users need to be able to find causal relations between the different attributes that compose them. Our work offers a way to help users to uncover such relations by combining the representativity of ontologies and the flexibility of probabilistic relational models, and provides them with an interactive and iterative process in order to validate or modify the obtained results.

Research paper thumbnail of Étude comparée des inférences dans les Modèles Relationnels Probabilistes

La representation de connaissances incertaines est un probleme important dans le domaine de l&#39... more La representation de connaissances incertaines est un probleme important dans le domaine de l'intelligence artificielle. Les reseaux bayesiens proposent une solution interessante pour de nombreuses raisons tant theoriques que pratiques, mais ont le desavantage d'etre difficiles a creer autant qu'a maintenir. En reponse a ces problematiques, de nouveaux formalismes ont emerges, tels les Probablistic Relational Models. Ces nouveaux modeles sont tous aptes a utiliser les algorithmes d'inference developpes pour les reseaux bayesiens. Cependant ils permettent un passage a l'echelle rendant leurs utilisations tres couteuses et sous-optimales en regard de l'exploitation des informations structurelles de ces modeles. Dans cet article, nous proposons de tirer profit de notre cadre logiciel dans lequel coexistent plusieurs algorithmes d'inference dedies ou non aux PRMs afin de mener une comparaison experimentale du comportement de ces differents algorithmes.

Research paper thumbnail of A Bayesian MultiCriteria Approach for an Optimal Process Control

Research paper thumbnail of aGrUM/pyAgrum : a toolbox to build models and algorithms for Probabilistic Graphical Models in Python

This paper presents the aGrUM framework, a LGPL C++ library providing state-of-the-art implementa... more This paper presents the aGrUM framework, a LGPL C++ library providing state-of-the-art implementations of graphical models for decision making, including Bayesian Networks, Markov Networks (Markov random fields), Influence Diagrams, Credal Networks, Probabilistic Relational Models. The framework also contains a wrapper, pyAgrum for exploiting aGrUM in Python. This framework is the result of an ongoing effort to build an efficient and well maintained open source cross-platform software, running on Linux, MacOS X and Windows, for dealing with graphical models and for providing essential components to build new algorithms for graphical models.

Research paper thumbnail of Un algorithme d'arbre de jonction incrémental

Cet article s'interesse a l'inference probabiliste dans les systemes dynamiques multi-cib... more Cet article s'interesse a l'inference probabiliste dans les systemes dynamiques multi-cibles. Dans un contexte ou les cibles et/ou les observations peuvent evoluer, la plupart des algorithmes ne se servent pas completement de l'incrementalite du systeme pour optimiser les calculs. Cela induit des calculs inutiles et peut augmenter considerablement les temps d'in-ference, particulierement pour les systemes de grande taille. Pour pallier ce probleme, nous proposons un nouvel algorithme d'arbre de jonction utilisant une technique par envoi de messages. Etant donne une nouvelle requete, cet algorithme permet de minimiser les calculs en recalculant uniquement les messages differents des calculs precedents. Nous montrons l'effica-cite de notre approche par des resultats experimentaux. ABSTRACT. This article addresses the question of probabilistic inference in multi-target dynamic systems. In a context where targets and/or evidence may evolve, most of the algorithms...

Research paper thumbnail of An early guidance system for a general knowledge-based aiding framework using probabilistic interventions

A common decision problem repeats a lot of time with the same kind of alternatives and the same s... more A common decision problem repeats a lot of time with the same kind of alternatives and the same set of criteria, but with a different decision case in each occurrence. The objective of early guidance in this kind of problem is to facilitate the selection of a subset of satisfactory alternatives for each new decision case, without asking the user any knowledge of the problem. This article proposes an early guidance system based on a model of knowledge of the common decision problem. It first presents the construction of a Bayesian network for a common decision problem to embed the knowledge in the aiding framework. Second, the concept of intervention proposed by Pearl is extended to prob-abilistic interventions for a single variable and for a set of variables. Finally the early guidance procedure is presented on the basis of the Bayesian network and using a proba-bilistic intervention to set a decision case even though it is partially observed.

Research paper thumbnail of Reinforcing the Object-Oriented Aspect of Probabilistic Relational Models

Representing uncertainty in knowledge is a common issue in Artificial Intelligence. Bayesian Netw... more Representing uncertainty in knowledge is a common issue in Artificial Intelligence. Bayesian Networks have been one of the main models used in this field of research. The simplicity of their specification is one of the reason for their success, both in industrial and in theoretical domains. The widespread use of Bayesian Networks brings new challenges in the design and use of large- scale systems, where this very simplicity causes a lack of expressiveness and scalability. To fill this gap, an increasing number of languages emerged as extensions of Bayesian Networks with many approaches: first-order logic, object-oriented, entity-relation, and so on. In this paper we focus on Probabilistic Relational Models, an object-oriented extension. However, Probabilistic Relational Models do not fully exploit the object-oriented paradigm, in particular they lack class inheritance. Using Object-Oriented Bayesian Networks as a basis, we propose to lightly extend PRMs framework resulting in strong...

Research paper thumbnail of Elicitation of conditional probabilities in a Bayesian network using a ranking process and expert's beliefs fusion

Research paper thumbnail of Steps towards model-based, inference-driven SOA Testing

This paper reports the progress of a project whose goal is to develop a model-driven, contract-ba... more This paper reports the progress of a project whose goal is to develop a model-driven, contract-based SOA testing environment in which grey-box test strategies for large services architectures are driven by stochastic inference. In order to achieve this goal, a Bayesian network is built directly by mapping and model transformation from the services architecture models (MDA approach). As services architectures are complex and large systems, the generated BN and its associated probability tables may also be very large. Nevertheless, the structure of the services architecture, of its design and test models suggests that techniques of compilation of the inference tasks will allow using the BN as a sustainable tool for driving the troubleshooting strategy for large scale systems.

Research paper thumbnail of Discovering Prerequisite Structure of Skills through Probabilistic Association Rules Mining

Estimating the prerequisite structure of skills is a crucial issue in domain modeling. Students u... more Estimating the prerequisite structure of skills is a crucial issue in domain modeling. Students usually learn skills in sequence since the preliminary skills need to be learned prior to the complex skills. The prerequisite relations between skills underlie the design of learning sequence and adaptation strategies for tutoring systems. The prerequisite structures of skills are usually studied by human experts, but they are seldom tested empirically. Due to plenty of educational data available, in this paper, we intend to discover the prerequisite structure of skills from student performance data. However, it is a challenging task since skills are latent variables. Uncertainty exists in inferring student knowledge of skills from performance data. Probabilistic Association Rules Mining proposed by Sun et al. (2010) is a novel technique to discover association rules from uncertain data. In this paper, we preprocess student performance data by an evidence model. Then the probabilistic kn...

Research paper thumbnail of Modélisation de réseaux bayésiens de très grandes tailles

La representation de connaissances incertaines est un probleme important dans le domaine de l&#39... more La representation de connaissances incertaines est un probleme important dans le domaine de l'intelligence artificielle. Les reseaux bayesiens proposent une solution interessante pour de nombreuses raisons theoriques et pratiques mais ont le desavantage d'etre difficiles a creer autant qu'a maintenir. Dans cet article, nous comparons trois formalismes qui tentent de repondre a ces problemes en enrichissant les reseaux bayesiens avec des notions issues des paradigmes orientes objets et de la logique du premier ordre. Nous verrons que ces propositions fournissent de nouveaux outils de modelisation des reseaux bayesiens, mais posent de nouveaux problemes propres aux reseaux de grande taille.

Research paper thumbnail of Using ROBDDs for troubleshooting

When using Bayesian networks for modelling the behavior of man-made machinery, it usu­ally happen... more When using Bayesian networks for modelling the behavior of man-made machinery, it usu­ally happens that a large part of the model is deterministic. For such Bayesian networks the deterministic part of the model can be represented as a Boolean function, and a cen­tral part of belief updating reduces to the task of calculating the number of satisfying configurations in a Boolean function. In this paper we explore how advances in the calculation of Boolean functions can be adopted for belief updating, in particular within the context of troubleshooting. We present ex­perimental results indicating a substantial speed-up compared to traditional junction tree propagation.

Research paper thumbnail of The Role of Instrumental Variables in Causal Inference Based on Independence of Cause and Mechanism

Entropy, 2021

Causal inference methods based on conditional independence construct Markov equivalent graphs and... more Causal inference methods based on conditional independence construct Markov equivalent graphs and cannot be applied to bivariate cases. The approaches based on independence of cause and mechanism state, on the contrary, that causal discovery can be inferred for two observations. In our contribution, we pose a challenge to reconcile these two research directions. We study the role of latent variables such as latent instrumental variables and hidden common causes in the causal graphical structures. We show that methods based on the independence of cause and mechanism indirectly contain traces of the existence of the hidden instrumental variables. We derive a novel algorithm to infer causal relationships between two variables, and we validate the proposed method on simulated data and on a benchmark of cause-effect pairs. We illustrate by our experiments that the proposed approach is simple and extremely competitive in terms of empirical accuracy compared to the state-of-the-art methods.

Research paper thumbnail of An Efficient Low-Rank Tensors Representation for Algorithms in Complex Probabilistic Graphical Models

Probabilistic Graphical Models form a class of compact representations of high-dimensional probab... more Probabilistic Graphical Models form a class of compact representations of high-dimensional probability distributions by decomposing these distributions in a set of multivariate factors (potentials). Every exact algorithm (for probabilistic inference, MAP, etc.) operates on a specific representation of these potentials. However complex probabilistic models often lead to very large potentials which dramatically impact both the space and time complexities of these algorithms and which can make inference in complex models intractable. In this paper we propose a new approach based on low-rank tensor representation to approximate and operate with these potentials. The low-rank tensor representation is used for the approximation of potentials with controlled precision and an important reduction in the number of parameters. Every operator used in such algorithms (multiplication, addition, projection, etc.) can be defined within this representation, leading to an approximation framework wher...

Research paper thumbnail of Learning and Selection of Dynamic Bayesian Networks for Non-Stationary Processes in Real Time