Please use this identifier to cite or link to this item:
https://doi.org/10.1371/journal.pone.0077940
Title: | SIFT Indel: Predictions for the Functional Effects of Amino Acid Insertions/Deletions in Proteins | Authors: | Hu J. Ng P.C. |
Keywords: | amino acid protein amino acid substitution area under the curve article classifier correlation coefficient decision tree frameshift mutation gene construct genetic algorithm genetic variability human indel mutation measurement accuracy prediction protein function protein structure sensitivity and specificity SIFT Indel validation study algorithm amino acid sequence biological model genetics indel mutation Algorithms Amino Acid Sequence Area Under Curve Humans INDEL Mutation Models, Genetic Proteins Sensitivity and Specificity |
Issue Date: | 2013 | Citation: | Hu J., Ng P.C. (2013). SIFT Indel: Predictions for the Functional Effects of Amino Acid Insertions/Deletions in Proteins. PLoS ONE 8 (10) : e77940. ScholarBank@NUS Repository. https://doi.org/10.1371/journal.pone.0077940 | Rights: | Attribution 4.0 International | Abstract: | Indels in the coding regions of a gene can either cause frameshifts or amino acid insertions/deletions. Frameshifting indels are indels that have a length that is not divisible by 3 and subsequently cause frameshifts. Indels that have a length divisible by 3 cause amino acid insertions/deletions or block substitutions; we call these 3n indels. The new amino acid changes resulting from 3n indels could potentially affect protein function. Therefore, we construct a SIFT Indel prediction algorithm for 3n indels which achieves 82% accuracy, 81% sensitivity, 82% specificity, 82% precision, 0.63 MCC, and 0.87 AUC by 10-fold cross-validation. We have previously published a prediction algorithm for frameshifting indels. The rules for the prediction of 3n indels are different from the rules for the prediction of frameshifting indels and reflect the biological differences of these two different types of variations. SIFT Indel was applied to human 3n indels from the 1000 Genomes Project and the Exome Sequencing Project. We found that common variants are less likely to be deleterious than rare variants. The SIFT indel prediction algorithm for 3n indels is available at http://sift-dna.org/. © 2013 Hu, Ng. | Source Title: | PLoS ONE | URI: | https://scholarbank.nus.edu.sg/handle/10635/161458 | ISSN: | 1932-6203 | DOI: | 10.1371/journal.pone.0077940 | Rights: | Attribution 4.0 International |
Appears in Collections: | Staff Publications Elements |
Show full item record
Files in This Item:
File | Description | Size | Format | Access Settings | Version | |
---|---|---|---|---|---|---|
10_1371_journal_pone_0077940.pdf | 842.97 kB | Adobe PDF | OPEN | Published | View/Download |
This item is licensed under a Creative Commons License