Please use this identifier to cite or link to this item: https://doi.org/10.1371/journal.pcbi.1000352
DC FieldValue
dc.titleStatistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples
dc.contributor.authorWhite J.R.
dc.contributor.authorNagarajan N.
dc.contributor.authorPop M.
dc.date.accessioned2019-11-06T09:36:09Z
dc.date.available2019-11-06T09:36:09Z
dc.date.issued2009
dc.identifier.citationWhite J.R., Nagarajan N., Pop M. (2009). Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples. PLoS Computational Biology 5 (4). ScholarBank@NUS Repository. https://doi.org/10.1371/journal.pcbi.1000352
dc.identifier.issn1553734X
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/161676
dc.description.abstractNumerous studies are currently underway to characterize the microbial communities inhabiting our world. These studies aim to dramatically expand our understanding of the microbial biosphere and, more importantly, hope to reveal the secrets of the complex symbiotic relationship between us and our commensal bacterial microflora. An important prerequisite for such discoveries are computational tools that are able to rapidly and accurately compare large datasets generated from complex bacterial communities to identify features that distinguish them. We present a statistical method for comparing clinical metagenomic samples from two treatment populations on the basis of count data (e.g. as obtained through sequencing) to detect differentially abundant features. Our method, Metastats, employs the false discovery rate to improve specificity in high-complexity environments, and separately handles sparsely-sampled features using Fisher's exact test. Under a variety of simulations, we show that Metastats performs well compared to previously used methods, and significantly outperforms other methods for features with sparse counts. We demonstrate the utility of our method on several datasets including a 16S rRNA survey of obese and lean human gut microbiomes, COG functional profiles of infant and mature gut microbiomes, and bacterial and viral metabolic subsystem data inferred from random sequencing of 85 metagenomes. The application of our method to the obesity dataset reveals differences between obese and lean subjects not reported in the original study. For the COG and subsystem datasets, we provide the first statistically rigorous assessment of the differences between these populations. The methods described in this paper are the first to address clinical metagenomic datasets comprising samples from multiple subjects. Our methods are robust across datasets of varied complexity and sampling level. While designed for metagenomic applications, our software can also be applied to digital gene expression studies (e.g. SAGE). A web server implementation of our methods and freely available source code can be found at http://metastats.cbcb.umd.edu/.
dc.rightsAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.sourceUnpaywall 20191101
dc.subjectRNA 16S
dc.subjectbacterial DNA
dc.subjectarticle
dc.subjectbacterium
dc.subjectcomputer program
dc.subjectFisher exact test
dc.subjectgene expression
dc.subjectgene sequence
dc.subjecthuman
dc.subjecthuman experiment
dc.subjectinfant
dc.subjectintermethod comparison
dc.subjectintestine
dc.subjectmetabolism
dc.subjectmetagenomics
dc.subjectmicrobiome
dc.subjectnormal human
dc.subjectsimulation
dc.subjectstatistical analysis
dc.subjectvirus
dc.subjectbacterium
dc.subjectchromosome map
dc.subjectclassification
dc.subjectgene expression profiling
dc.subjectgenetics
dc.subjectintestine
dc.subjectisolation and purification
dc.subjectmethodology
dc.subjectmicrobiology
dc.subjectobesity
dc.subjectBacteria (microorganisms)
dc.subjectBacteria
dc.subjectChromosome Mapping
dc.subjectDNA, Bacterial
dc.subjectGene Expression Profiling
dc.subjectHumans
dc.subjectIntestines
dc.subjectObesity
dc.typeArticle
dc.contributor.departmentDEPARTMENT OF COMPUTER SCIENCE
dc.contributor.departmentMEDICINE
dc.description.doi10.1371/journal.pcbi.1000352
dc.description.sourcetitlePLoS Computational Biology
dc.description.volume5
dc.description.issue4
Appears in Collections:Elements
Staff Publications

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1371_journal_pcbi_1000352.pdf701.46 kBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons