Please use this identifier to cite or link to this item: https://doi.org/10.1186/1471-2105-7-390
Title: PET-tool: A software suite for comprehensive processing and managing of Paired-End diTag (PET) sequence data
Authors: Chiu, K.P.
Wong, C.-H.
Chen, Q. 
Ariyaratne, P.
Ooi, H.S.
Wei, C.-L.
Sung, W.-K.K. 
Ruan, Y.
Issue Date: 2006
Source: Chiu, K.P., Wong, C.-H., Chen, Q., Ariyaratne, P., Ooi, H.S., Wei, C.-L., Sung, W.-K.K., Ruan, Y. (2006). PET-tool: A software suite for comprehensive processing and managing of Paired-End diTag (PET) sequence data. BMC Bioinformatics 7. ScholarBank@NUS Repository. https://doi.org/10.1186/1471-2105-7-390
Abstract: Background: We recently developed the Paired End diTag (PET) strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. Results: We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the ProjectManager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. Conclusion: The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management. © 2006 Chiu et al; licensee BioMed Central Ltd.
Source Title: BMC Bioinformatics
URI: http://scholarbank.nus.edu.sg/handle/10635/38887
ISSN: 14712105
DOI: 10.1186/1471-2105-7-390
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
2006-pet_tool_software_suite_comprehensive-published.pdf4.2 MBAdobe PDF

OPEN

PublishedView/Download

SCOPUSTM   
Citations

24
checked on Dec 7, 2017

WEB OF SCIENCETM
Citations

16
checked on Nov 29, 2017

Page view(s)

51
checked on Dec 18, 2017

Download(s)

2
checked on Dec 18, 2017

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.