A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data

dc.contributor.authorHobbs, Elizabeth T.
dc.contributor.authorPereira, Talmo
dc.contributor.authorO’Neill, Patrick K.
dc.contributor.authorErill, Ivan
dc.date.accessioned2021-03-08T17:14:01Z
dc.date.available2021-03-08T17:14:01Z
dc.date.issued2016-07-08
dc.description.abstractBackground Metagenomics enables the analysis of bacterial population composition and the study of emergent population features, such as shared metabolic pathways. Recently, we have shown that metagenomics datasets can be leveraged to characterize population-wide transcriptional regulatory networks, or meta-regulons, providing insights into how bacterial populations respond collectively to specific triggers. Here we formalize a Bayesian inference framework to analyze the composition of transcriptional regulatory networks in metagenomes by determining the probability of regulation of orthologous gene sequences. We assess the performance of this approach on synthetic datasets and we validate it by analyzing the copper-homeostasis network of Firmicutes species in the human gut microbiome. Results Assessment on synthetic datasets shows that our method provides a robust and interpretable metric for assessing putative regulation by a transcription factor on sets of promoter sequences mapping to an orthologous gene cluster. The inference framework integrates the regulatory contribution of secondary sites and can discern false positives arising from multiple instances of a clonal sequence. Posterior probabilities for orthologous gene clusters decline sharply when less than 20 % of mapped promoters have binding sites, but we introduce a sensitivity adjustment procedure to speed up computation that enhances regulation assessment in heterogeneous ortholog clusters. Analysis of the copper-homeostasis regulon governed by CsoR in the human gut microbiome Firmicutes reveals that CsoR controls itself and copper-translocating P-type ATPases, but not CopZ-type copper chaperones. Our analysis also indicates that CsoR frequently targets promoters with dual CsoR-binding sites, suggesting that it exploits higher-order binding conformations to fine-tune its activity. Conclusions We introduce and validate a method for the analysis of transcriptional regulatory networks from metagenomic data that enables inference of meta-regulons in a systematic and interpretable way. Validation of this method on the CsoR meta-regulon of gut microbiome Firmicutes illustrates the usefulness of the approach, revealing novel properties of the copper-homeostasis network in poorly characterized bacterial species and putting forward evidence of new mechanisms of DNA binding for this transcriptional regulator. Our approach will enable the comparative analysis of regulatory networks across metagenomes, yielding novel insights into the evolution of transcriptional regulatory networks.en_US
dc.description.sponsorshipThe authors wish to thank David Nicholson and Joseph Cornish for their contribution to earlier versions of the metagenomic analysis pipeline. This work was funded by the US National Science Foundation Division of Molecular and Cellular Biosciences award MCB-1158056, by the UMBC Office of Research through a Special Research Assistantship/Initiative Support (SRAIS) award and by the UMBC Office of Undergraduate Research through an Undergraduate Research Award (TP). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.en_US
dc.description.urihttps://almob.biomedcentral.com/articles/10.1186/s13015-016-0082-8en_US
dc.format.extent11 pagesen_US
dc.genrejournal articlesen_US
dc.identifierdoi:10.13016/m2dvhg-ysek
dc.identifier.citationHobbs, E.T., Pereira, T., O’Neill, P.K. et al. A Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic data. Algorithms Mol Biol 11, 19 (2016). https://doi.org/10.1186/s13015-016-0082-8en_US
dc.identifier.urihttps://doi.org/10.1186/s13015-016-0082-8
dc.identifier.urihttp://hdl.handle.net/11603/21087
dc.language.isoen_USen_US
dc.publisherBMCen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Biological Sciences Department Collection
dc.relation.ispartofUMBC Faculty Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author.
dc.rightsAttribution 4.0 International*
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/*
dc.titleA Bayesian inference method for the analysis of transcriptional regulatory networks in metagenomic dataen_US
dc.typeTexten_US

Files

Original bundle

Now showing 1 - 5 of 6
Loading...
Thumbnail Image
Name:
s13015-016-0082-8.pdf
Size:
1.27 MB
Format:
Adobe Portable Document Format
Description:
Loading...
Thumbnail Image
Name:
13015_2016_82_MOESM2_ESM.csv
Size:
1.42 MB
Format:
Unknown data format
Description:
Loading...
Thumbnail Image
Name:
13015_2016_82_MOESM5_ESM.csv
Size:
6.77 KB
Format:
Unknown data format
Description:
Loading...
Thumbnail Image
Name:
13015_2016_82_MOESM7_ESM.csv
Size:
285.21 KB
Format:
Unknown data format
Description:
Loading...
Thumbnail Image
Name:
13015_2016_82_MOESM8_ESM.csv
Size:
964 B
Format:
Unknown data format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.56 KB
Format:
Item-specific license agreed upon to submission
Description: