Please use this identifier to cite or link to this item: https://doi.org/10.1186/s12859-017-1470-x
DC FieldValue
dc.titleBATVI: Fast, sensitive and accurate detection of virus integrations
dc.contributor.authorTennakoon, C
dc.contributor.authorSung, W.K
dc.date.accessioned2020-10-27T10:29:41Z
dc.date.available2020-10-27T10:29:41Z
dc.date.issued2017
dc.identifier.citationTennakoon, C, Sung, W.K (2017). BATVI: Fast, sensitive and accurate detection of virus integrations. BMC Bioinformatics 18 : 71. ScholarBank@NUS Repository. https://doi.org/10.1186/s12859-017-1470-x
dc.identifier.issn14712105
dc.identifier.urihttps://scholarbank.nus.edu.sg/handle/10635/181296
dc.description.abstractBackground: The study of virus integrations in human genome is important since virus integrations were shown to be associated with diseases. In the literature, few methods have been proposed that predict virus integrations using next generation sequencing datasets. Although they work, they are slow and are not very sensitive. Results and discussion: This paper introduces a new method BatVI to predict viral integrations. Our method uses a fast screening method to filter out chimeric reads containing possible viral integrations. Next, sensitive alignments of these candidate chimeric reads are called by BLAST. Chimeric reads that are co-localized in the human genome are clustered. Finally, by assembling the chimeric reads in each cluster, high confident virus integration sites are extracted. Conclusion: We compared the performance of BatVI with existing methods VirusFinder and VirusSeq using both simulated and real-life datasets of liver cancer patients. BatVI ran an order of magnitude faster and was able to predict almost twice the number of true positives compared to other methods while maintaining a false positive rate less than 1%. For the liver cancer datasets, BatVI uncovered novel integrations to two important genes TERT and MLL4, which were missed by previous studies. Through gene expression data, we verified the correctness of these additional integrations. BatVI can be downloaded from http://biogpu.ddns.comp.nus.edu.sg/~ksung/batvi/index.html. © 2017 The Author(s).
dc.rightsAttribution 4.0 International
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.sourceUnpaywall 20201031
dc.subjectAlignment
dc.subjectDiagnosis
dc.subjectDiseases
dc.subjectForecasting
dc.subjectGene expression
dc.subjectGenes
dc.subjectDetection of virus
dc.subjectFalse positive rates
dc.subjectFast screening
dc.subjectGene Expression Data
dc.subjectLiver cancers
dc.subjectNext-generation sequencing
dc.subjectReal life datasets
dc.subjectTrue positive
dc.subjectViruses
dc.subjectDNA binding protein
dc.subjectMLL4 protein, human
dc.subjecttelomerase
dc.subjectTERT protein, human
dc.subjectvirus DNA
dc.subjectalgorithm
dc.subjectcluster analysis
dc.subjectDNA sequence
dc.subjectgenetics
dc.subjecthigh throughput sequencing
dc.subjecthost pathogen interaction
dc.subjecthuman
dc.subjecthuman genome
dc.subjectLiver Neoplasms
dc.subjectmetabolism
dc.subjectsoftware
dc.subjecttheoretical model
dc.subjectvirology
dc.subjectvirus DNA cell DNA interaction
dc.subjectAlgorithms
dc.subjectCluster Analysis
dc.subjectDNA, Viral
dc.subjectDNA-Binding Proteins
dc.subjectGenome, Human
dc.subjectHigh-Throughput Nucleotide Sequencing
dc.subjectHost-Pathogen Interactions
dc.subjectHumans
dc.subjectLiver Neoplasms
dc.subjectModels, Theoretical
dc.subjectSequence Analysis, DNA
dc.subjectSoftware
dc.subjectTelomerase
dc.subjectVirus Integration
dc.typeArticle
dc.contributor.departmentDEPARTMENT OF COMPUTER SCIENCE
dc.description.doi10.1186/s12859-017-1470-x
dc.description.sourcetitleBMC Bioinformatics
dc.description.volume18
dc.description.page71
Appears in Collections:Elements
Staff Publications

Show simple item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1186_s12859-017-1470-x.pdf1.65 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons