Please use this identifier to cite or link to this item: https://doi.org/10.1371/journal.pone.0162293
Title: Identifying multi-dimensional co-clusters in tensors based on hyperplane detection in singular vector spaces
Authors: Zhao H.
Wang D.D. 
Chen L.
Liu X.
Yan H.
Keywords: decomposition
embryo cell
experimental model
gene expression
gene ontology
nonhuman
reliability
yeast
algorithm
animal
Caenorhabditis elegans
cell cycle
cluster analysis
data mining
diseases
fungal gene
gene
genetics
growth, development and aging
human
information processing
molecular genetics
Saccharomyces cerevisiae
statistics and numerical data
Algorithms
Animals
Caenorhabditis elegans
Cell Cycle
Cluster Analysis
Data Mining
Datasets as Topic
Disease
Gene Expression
Gene Ontology
Genes, Fungal
Genes, Helminth
Humans
Molecular Sequence Annotation
Saccharomyces cerevisiae
Issue Date: 2016
Citation: Zhao H., Wang D.D., Chen L., Liu X., Yan H. (2016). Identifying multi-dimensional co-clusters in tensors based on hyperplane detection in singular vector spaces. PLoS ONE 11 (9) : e0162293. ScholarBank@NUS Repository. https://doi.org/10.1371/journal.pone.0162293
Rights: Attribution 4.0 International
Abstract: Co-clustering, often called biclustering for two-dimensional data, has found many applications, such as gene expression data analysis and text mining. Nowadays, a variety of multidimensional arrays (tensors) frequently occur in data analysis tasks, and co-clustering techniques play a key role in dealing with such datasets. Co-clusters represent coherent patterns and exhibit important properties along all the modes. Development of robust coclustering techniques is important for the detection and analysis of these patterns. In this paper, a co-clustering method based on hyperplane detection in singular vector spaces (HDSVS) is proposed. Specifically in this method, higher-order singular value decomposition (HOSVD) transforms a tensor into a core part and a singular vector matrix along each mode, whose row vectors can be clustered by a linear grouping algorithm (LGA). Meanwhile, hyperplanar patterns are extracted and successfully supported the identification of multi-dimensional co-clusters. To validate HDSVS, a number of synthetic and biological tensors were adopted. The synthetic tensors attested a favorable performance of this algorithm on noisy or overlapped data. Experiments with gene expression data and lineage data of embryonic cells further verified the reliability of HDSVS to practical problems. Moreover, the detected co-clusters are well consistent with important genetic pathways and gene ontology annotations. Finally, a series of comparisons between HDSVS and state-of-the-art methods on synthetic tensors and a yeast gene expression tensor were implemented, verifying the robust and stable performance of our method. © 2016 Zhao et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Source Title: PLoS ONE
URI: https://scholarbank.nus.edu.sg/handle/10635/161553
ISSN: 19326203
DOI: 10.1371/journal.pone.0162293
Rights: Attribution 4.0 International
Appears in Collections:Staff Publications
Elements

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1371_journal_pone_0162293.pdf5.83 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons