An Efficient Search Strategy for Aggregation and Discretization of Attributes of Bayesian Networks Using Minimum Description Length

04/03/2014
by   Jem Corcoran, et al.
0

Bayesian networks are convenient graphical expressions for high dimensional probability distributions representing complex relationships between a large number of random variables. They have been employed extensively in areas such as bioinformatics, artificial intelligence, diagnosis, and risk management. The recovery of the structure of a network from data is of prime importance for the purposes of modeling, analysis, and prediction. Most recovery algorithms in the literature assume either discrete of continuous but Gaussian data. For general continuous data, discretization is usually employed but often destroys the very structure one is out to recover. Friedman and Goldszmidt suggest an approach based on the minimum description length principle that chooses a discretization which preserves the information in the original data set, however it is one which is difficult, if not impossible, to implement for even moderately sized networks. In this paper we provide an extremely efficient search strategy which allows one to use the Friedman and Goldszmidt discretization in practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2022

Structure Learning for Hybrid Bayesian Networks

Bayesian networks have been used as a mechanism to represent the joint d...
research
10/13/2017

A simple data discretizer

Data discretization is an important step in the process of machine learn...
research
01/30/2013

A Multivariate Discretization Method for Learning Bayesian Networks from Mixed Data

In this paper we address the problem of discretization in the context of...
research
07/02/2021

Prequential MDL for Causal Structure Learning with Neural Networks

Learning the structure of Bayesian networks and causal relationships fro...
research
01/16/2013

Mix-nets: Factored Mixtures of Gaussians in Bayesian Networks With Mixed Continuous And Discrete Variables

Recently developed techniques have made it possible to quickly learn acc...
research
10/20/2018

Renormalized Normalized Maximum Likelihood and Three-Part Code Criteria For Learning Gaussian Networks

Score based learning (SBL) is a promising approach for learning Bayesian...
research
07/17/2022

Minimum Description Length Control

We propose a novel framework for multitask reinforcement learning based ...

Please sign up or login with your details

Forgot password? Click here to reset