Please use this identifier to cite or link to this item: https://doi.org/10.1007/s10867-006-2120-0
DC FieldValue
dc.titleSuper paramagnetic clustering of DNA sequences
dc.contributor.authorRadjiman, S.
dc.contributor.authorLianyi, H.
dc.contributor.authorJian-Sheng, W.
dc.contributor.authorChen, Y.Z.
dc.date.accessioned2014-10-16T09:43:09Z
dc.date.available2014-10-16T09:43:09Z
dc.date.issued2006-01
dc.identifier.citationRadjiman, S., Lianyi, H., Jian-Sheng, W., Chen, Y.Z. (2006-01). Super paramagnetic clustering of DNA sequences. Journal of Biological Physics 32 (1) : 11-25. ScholarBank@NUS Repository. https://doi.org/10.1007/s10867-006-2120-0
dc.identifier.issn00920606
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/98111
dc.description.abstractAn unsupervised clustering of 4541 DNA sequences containing active promoter regions from vertebrate and arthropod classes (including their viral genes) was performed. All necessary information was solely gathered a priori from the DNA sequences by measuring frequencies of tri-nucleotides and tetra-nucleotides. We employed Super Paramagnetic Clustering, a novel clustering algorithm based on physical properties of an inhomogeneous granular ferromagnet. This method utilizes Swendsen-Wang cluster Monte Carlo simulations to distinguish clusters by measuring pairs of correlation functions from different resolutions. We identified two strongly separated clusters of human viral genes corresponding to the Epstein-Barr virus and the Herpes Simplex virus type 1. In addition, vertebrate and arthropod sequences were successfully separated into two different classes with merely 9.25% of arthropod sequences being misclassified. From a functional perspective, these sequences have high gene function correlations with sequences from the vertebrate cluster. By tuning a clustering parameter, Super Paramagnetic Clustering was able to classify vertebrate class further into two major clusters, from where a large number of housekeeping genes and tissue-specific genes were found respectively. The indications came from observation of gene expression function and consensus transcription factors which were found grouped together in specific positions of the DNA sequences. © Springer 2006.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1007/s10867-006-2120-0
dc.sourceScopus
dc.subjectCluster algorithm
dc.subjectData clustering
dc.subjectDNA sequence
dc.subjectPromoters
dc.subjectStatistical physics
dc.subjectTranscription factor binding sites
dc.typeArticle
dc.contributor.departmentPHARMACY
dc.contributor.departmentPHYSICS
dc.description.doi10.1007/s10867-006-2120-0
dc.description.sourcetitleJournal of Biological Physics
dc.description.volume32
dc.description.issue1
dc.description.page11-25
dc.description.codenJBPHB
dc.identifier.isiut000236515200003
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.