Please use this identifier to cite or link to this item: https://doi.org/10.1186/1471-2105-7-390
DC FieldValue
dc.titlePET-tool: A software suite for comprehensive processing and managing of Paired-End diTag (PET) sequence data
dc.contributor.authorChiu, K.P.
dc.contributor.authorWong, C.-H.
dc.contributor.authorChen, Q.
dc.contributor.authorAriyaratne, P.
dc.contributor.authorOoi, H.S.
dc.contributor.authorWei, C.-L.
dc.contributor.authorSung, W.-K.K.
dc.contributor.authorRuan, Y.
dc.date.accessioned2013-07-04T07:29:08Z
dc.date.available2013-07-04T07:29:08Z
dc.date.issued2006
dc.identifier.citationChiu, K.P., Wong, C.-H., Chen, Q., Ariyaratne, P., Ooi, H.S., Wei, C.-L., Sung, W.-K.K., Ruan, Y. (2006). PET-tool: A software suite for comprehensive processing and managing of Paired-End diTag (PET) sequence data. BMC Bioinformatics 7. ScholarBank@NUS Repository. https://doi.org/10.1186/1471-2105-7-390
dc.identifier.issn14712105
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/38887
dc.description.abstractBackground: We recently developed the Paired End diTag (PET) strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. Results: We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the ProjectManager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. Conclusion: The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management. © 2006 Chiu et al; licensee BioMed Central Ltd.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1186/1471-2105-7-390
dc.sourceScopus
dc.typeArticle
dc.contributor.departmentCOMPUTER SCIENCE
dc.description.doi10.1186/1471-2105-7-390
dc.description.sourcetitleBMC Bioinformatics
dc.description.volume7
dc.description.codenBBMIC
dc.identifier.isiut000240684500001
Appears in Collections:Staff Publications
Elements

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
2006-pet_tool_software_suite_comprehensive-published.pdf4.2 MBAdobe PDF

OPEN

PublishedView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.