Please use this identifier to cite or link to this item: https://doi.org/10.1186/s12859-016-0939-3
Title: A two-layer integration framework for protein complex detection
Authors: Ou-Yang, L
Wu, M
Zhang, X.-F
Dai, D.-Q
Li, X.-L 
Yan, H
Keywords: Cell signaling
Clustering algorithms
Cobalt compounds
Complexation
Computational methods
Data integration
Matrix algebra
Molecular biology
Proteins
Purification
Stochastic models
Stochastic systems
Co complexes
Doubly stochastic matrix
Integration frameworks
Integration techniques
Matrix decomposition
Protein complexes
Protein interaction
Tandem affinity purification
Complex networks
consensus
decomposition
experimental model
mass spectrometry
model
noise
prediction
screening
stochastic model
human
metabolism
procedures
protein analysis
proteomics
protein
Humans
Protein Interaction Mapping
Proteins
Proteomics
Issue Date: 2016
Citation: Ou-Yang, L, Wu, M, Zhang, X.-F, Dai, D.-Q, Li, X.-L, Yan, H (2016). A two-layer integration framework for protein complex detection. BMC Bioinformatics 17 (1) : 100. ScholarBank@NUS Repository. https://doi.org/10.1186/s12859-016-0939-3
Rights: Attribution 4.0 International
Abstract: Background: Protein complexes carry out nearly all signaling and functional processes within cells. The study of protein complexes is an effective strategy to analyze cellular functions and biological processes. With the increasing availability of proteomics data, various computational methods have recently been developed to predict protein complexes. However, different computational methods are based on their own assumptions and designed to work on different data sources, and various biological screening methods have their unique experiment conditions, and are often different in scale and noise level. Therefore, a single computational method on a specific data source is generally not able to generate comprehensive and reliable prediction results. Results: In this paper, we develop a novel Two-layer INtegrative Complex Detection (TINCD) model to detect protein complexes, leveraging the information from both clustering results and raw data sources. In particular, we first integrate various clustering results to construct consensus matrices for proteins to measure their overall co-complex propensity. Second, we combine these consensus matrices with the co-complex score matrix derived from Tandem Affinity Purification/Mass Spectrometry (TAP) data and obtain an integrated co-complex similarity network via an unsupervised metric fusion method. Finally, a novel graph regularized doubly stochastic matrix decomposition model is proposed to detect overlapping protein complexes from the integrated similarity network. Conclusions: Extensive experimental results demonstrate that TINCD performs much better than 21 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. © 2016 Ou-Yang et al.
Source Title: BMC Bioinformatics
URI: https://scholarbank.nus.edu.sg/handle/10635/181388
ISSN: 14712105
DOI: 10.1186/s12859-016-0939-3
Rights: Attribution 4.0 International
Appears in Collections:Elements
Staff Publications

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1186_s12859-016-0939-3.pdf1.81 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons