PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data

dc.contributor.authorTang, Yan
dc.contributor.authorWang, Jianwu
dc.contributor.authorNguyen, Mai
dc.contributor.authorAltintas, Ilkay
dc.date.accessioned2024-02-13T15:45:29Z
dc.date.available2024-02-13T15:45:29Z
dc.date.issued2019-10-11
dc.description.abstractDiscovering the Bayesian network (BN) structure from big datasets containing rich causal relationships is becoming increasingly valuable for modeling and reasoning under uncertainties in many areas with big data gathered from sensors due to high volume and fast veracity. Most of the current BN structure learning algorithms have shortcomings facing big data. First, learning a BN structure from the entire big dataset is an expensive task which often ends in failure due to memory constraints. Second, it is quite difficult to select a learner from numerous BN structure learning algorithms to consistently achieve good learning accuracy. Lastly, there is a lack of an intelligent method that merges separately learned BN structures into a well structured BN network. To address these shortcomings, we introduce a novel parallel learning approach called PEnBayes (Parallel Ensemble-based Bayesian network learning). PEnBayes starts with an adaptive data preprocessing phase that calculates the Appropriate Learning Size and intelligently divides a big dataset for fast distributed local structure learning. Then, PEnBayes learns a collection of local BN Structures in parallel using a two-layered weighted adjacent matrix-based structure ensemble method. Lastly, PEnBayes merges the local BN Structures into a global network structure using the structure ensemble method at the global layer. For the experiment, we generate big data sets by simulating sensor data from patient monitoring, transportation, and disease diagnosis domains. The Experimental results show that PEnBayes achieves a significantly improved execution performance with more consistent and stable results compared with three baseline learning algorithms.
dc.description.sponsorshipThe work was supported by Key Technologies Research and Development Program of China (2017YFC0405805-04). The funding body mainly support the method design and the data analysis of the study.
dc.description.urihttps://www.mdpi.com/1424-8220/19/20/4400
dc.format.extent27 pages
dc.genrejournal articles
dc.identifierdoi:10.13016/m2ofze-u8yd
dc.identifier.citationTang, Yan, Jianwu Wang, Mai Nguyen, and Ilkay Altintas. 2019. "PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data" Sensors 19, no. 20: 4400. https://doi.org/10.3390/s19204400
dc.identifier.urihttps://doi.org/10.3390/s19204400
dc.identifier.urihttp://hdl.handle.net/11603/31607
dc.language.isoen_US
dc.publisherMDPI
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Information Systems Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Center for Accelerated Real Time Analysis
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.relation.ispartofUMBC Data Science
dc.relation.ispartofUMBC Joint Center for Earth Systems Technology (JCET)
dc.relation.ispartofUMBC Center for Real-time Distributed Sensing and Autonomy
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rightsAttribution 4.0 International (CC BY 4.0 DEED)en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectUMBC Big Data Analytics Lab
dc.titlePEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-9933-1170

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
sensors-19-04400-v2.pdf
Size:
3.57 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: