Please use this identifier to cite or link to this item: https://doi.org/10.1186/1471-2105-11-S1-S12
Title: Short clones or long clones? A simulation study on the use of paired reads in metagenomics
Authors: Mitra, S
Schubach, M
Huson, D.H 
Keywords: Analysis softwares
Environmental sample
Illumina
Metagenomes
Metagenomics
Process sequencing
Short reads
Simulation studies
Computer simulation
Bioinformatics
Cloning
article
computer program
DNA sequence
genetic database
genome
metagenomics
methodology
nucleotide sequence
phylogeny
sequence alignment
Base Sequence
Databases, Genetic
Genome
Metagenomics
Phylogeny
Sequence Alignment
Sequence Analysis, DNA
Software
Issue Date: 2010
Citation: Mitra, S, Schubach, M, Huson, D.H (2010). Short clones or long clones? A simulation study on the use of paired reads in metagenomics. BMC Bioinformatics 11 (SUPPLL.1) : S12. ScholarBank@NUS Repository. https://doi.org/10.1186/1471-2105-11-S1-S12
Rights: Attribution 4.0 International
Abstract: Background: Metagenomics is the study of environmental samples using sequencing. Rapid advances in sequencing technology are fueling a vast increase in the number and scope of metagenomics projects. Most metagenome sequencing projects so far have been based on Sanger or Roche-454 sequencing, as only these technologies provide long enough reads, while Illumina sequencing has not been considered suitable for metagenomic studies due to a short read length of only 35 bp. However, now that reads of length 75 bp can be sequenced in pairs, Illumina sequencing has become a viable option for metagenome studies.Results: This paper addresses the problem of taxonomical analysis of paired reads. We describe a new feature of our metagenome analysis software MEGAN that allows one to process sequencing reads in pairs and makes assignments of such reads based on the combined bit scores of their matches to reference sequences. Using this new software in a simulation study, we investigate the use of Illumina paired-sequencing in taxonomical analysis and compare the performance of single reads, short clones and long clones. In addition, we also compare against simulated Roche-454 sequencing runs.Conclusion: This work shows that paired reads perform better than single reads, as expected, but also, perhaps slightly less obviously, that long clones allow more specific assignments than short ones. A new version of the program MEGAN that explicitly takes paired reads into account is available from our website. © 2010 Mitra et al; licensee BioMed Central Ltd.
Source Title: BMC Bioinformatics
URI: https://scholarbank.nus.edu.sg/handle/10635/181684
ISSN: 14712105
DOI: 10.1186/1471-2105-11-S1-S12
Rights: Attribution 4.0 International
Appears in Collections:Elements
Staff Publications

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1186_1471-2105-11-S1-S12.pdf2.17 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons