Strategies for identifying statistically significant dense regions in microarray data

Please use this identifier to cite or link to this item: https://doi.org/10.1109/TCBB.2007.1022

DC Field	Value
dc.title	Strategies for identifying statistically significant dense regions in microarray data
dc.contributor.author	Yip, A.M.
dc.contributor.author	Ng, M.K.
dc.contributor.author	Wu, E.H.
dc.contributor.author	Chan, T.F.
dc.date.accessioned	2014-10-28T02:46:30Z
dc.date.available	2014-10-28T02:46:30Z
dc.date.issued	2007-07
dc.identifier.citation	Yip, A.M., Ng, M.K., Wu, E.H., Chan, T.F. (2007-07). Strategies for identifying statistically significant dense regions in microarray data. IEEE/ACM Transactions on Computational Biology and Bioinformatics 4 (3) : 415-428. ScholarBank@NUS Repository. https://doi.org/10.1109/TCBB.2007.1022
dc.identifier.issn	15455963
dc.identifier.uri	http://scholarbank.nus.edu.sg/handle/10635/104202
dc.description.abstract	We propose and study the notion of dense regions for the analysis of categorized gene expression data and present some searching algorithms for discovering them. The algorithms can be applied to any categorical data matrices derived from gene expression level matrices. We demonstrate that dense regions are simple but useful and statistically significant patterns that can be used to 1) Identify genes and/or samples of Interest and 2) eliminate genes and/or samples corresponding to outliers, noise, or abnormalities. Some theoretical studies on the properties of the dense regions are presented which allow us to characterize dense regions Into several classes and to derive tailor-made algorithms for different classes of regions. Moreover, an empirical simulation study on the distribution of the size of dense regions is carried out which is then used to assess the significance of dense regions and to derive effective pruning methods to speed up the searching algorithms. Real microarray data sets are employed to test our methods. Comparisons with six other well-known clustering algorithms using synthetic and real data are also conducted which confirm the superiority of our methods in discovering dense regions. The DRIFT code and a tutorial are available as supplemental material, which can be found on the Computer Society Digital Library at http://computer.org/tcbb/archlves. htm. © 2007 IEEE.
dc.description.uri	http://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1109/TCBB.2007.1022
dc.source	Scopus
dc.subject	Bicluster
dc.subject	Categorical data
dc.subject	Clustering
dc.subject	Coexpressed genes
dc.subject	Dense region
dc.subject	Gene expression
dc.subject	Microarray
dc.type	Article
dc.contributor.department	MATHEMATICS
dc.description.doi	10.1109/TCBB.2007.1022
dc.description.sourcetitle	IEEE/ACM Transactions on Computational Biology and Bioinformatics
dc.description.volume	4
dc.description.issue	3
dc.description.page	415-428
dc.identifier.isiut	000248414700008
Appears in Collections:	Staff Publications

Show simple item record

Files in This Item:

There are no files associated with this item.

Google Scholar^TM

Check

Files in This Item:

Google ScholarTM

Altmetric

Google Scholar^TM