Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/69947
Title: Digital signal processing for potential promoter prediction
Authors: Zhang, X.
Kassim, A. 
Bajic, V.B.
Issue Date: 2004
Source: Zhang, X.,Kassim, A.,Bajic, V.B. (2004). Digital signal processing for potential promoter prediction. 2004 IEEE International Workshop on Biomedical Circuits and Systems : S2.7.INV-16. ScholarBank@NUS Repository.
Abstract: We evaluate the suitability of three domain transforms, DFT, DCT and DWT for recognition of human promoter sequences. We use genomic segments covering [-512,+512] relative to transcription start sites (TSSs), and also non-promoter sequences of the same length. We used a total of 14,001 promoter sequences with TSS locations determined based on experimental transcript data. Sequences were extracted from the human genome using PromoSer and FIE2 tools. The non-promoter set has the same number of sequences. We used the total count of mono-, di- and tri-nucleotides in the sequences, as well as the coefficients of domain transforms. The promoters and non-promoters were divided into 22 disjoint groups based on their GC-content. Feature selection procedures were separately applied to the data for each group and we opted to use 30 best ranked features. In each group, the data is first divided into training and test sets after random ordering of positive and negative data before it is further divided into two sets. Linear discriminant analysis is used to predict sequences as promoter (positive) and non-promoter (negative) ones. Three general observations can be made based on the experiments performed: i) the ability to recognize promoters degrades with the reduction of GC-content, ii) there are no significant differences in the prediction performance when any transform is used, and iii) the best performance was achieved by combining all three transforms. We show that the use of domain transforms in predicting human promoters is promising and thus should be combined with predictions of biological features for even better performance results. © 2004 IEEE.
Source Title: 2004 IEEE International Workshop on Biomedical Circuits and Systems
URI: http://scholarbank.nus.edu.sg/handle/10635/69947
ISBN: 0780386655
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Page view(s)

26
checked on Dec 9, 2017

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.