We present an independence-based method for learning Bayesian network (BN) structure without making any assumptions on the probability distribution of the domain. This is mainly useful for continuous domains. Even mixed continuous-categorical domains and structures containing vectorial variables can be handled. We address the problem by developing a non-parametric conditional independence test based on the so-called kernel dependence measure, which can be readily used by any existing independence-based BN structure learning algorithm. We demonstrate the structure learning of graphical models in continuous and mixed domains from real-world data without distributional assumptions. We also experimentally show that our test is a good alternative, in particular in case of small sample sizes, compared to existing tests, which can only be used in purely categorical or continuous domains.
Author(s): | Sun, X. |
Book Title: | ECML PKDD 2008 |
Journal: | Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2008 |
Pages: | 423-439 |
Year: | 2008 |
Month: | September |
Day: | 0 |
Editors: | Daelemans, W. , B. Goethals, K. Morik |
Publisher: | Springer |
Bibtex Type: | Conference Paper (inproceedings) |
Address: | Berlin, Germany |
DOI: | 10.1007/978-3-540-87481-2_28 |
Event Name: | European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases |
Event Place: | Antwerpen, Belgium |
Digital: | 0 |
Electronic Archiving: | grant_archive |
Language: | en |
Organization: | Max-Planck-Gesellschaft |
School: | Biologische Kybernetik |
Links: |
BibTex
@inproceedings{5253, title = {Distribution-free Learning of Bayesian Network Structure}, journal = {Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2008}, booktitle = {ECML PKDD 2008}, abstract = {We present an independence-based method for learning Bayesian network (BN) structure without making any assumptions on the probability distribution of the domain. This is mainly useful for continuous domains. Even mixed continuous-categorical domains and structures containing vectorial variables can be handled. We address the problem by developing a non-parametric conditional independence test based on the so-called kernel dependence measure, which can be readily used by any existing independence-based BN structure learning algorithm. We demonstrate the structure learning of graphical models in continuous and mixed domains from real-world data without distributional assumptions. We also experimentally show that our test is a good alternative, in particular in case of small sample sizes, compared to existing tests, which can only be used in purely categorical or continuous domains.}, pages = {423-439}, editors = {Daelemans, W. , B. Goethals, K. Morik}, publisher = {Springer}, organization = {Max-Planck-Gesellschaft}, school = {Biologische Kybernetik}, address = {Berlin, Germany}, month = sep, year = {2008}, slug = {5253}, author = {Sun, X.}, month_numeric = {9} }