Please use this identifier to cite or link to this item: https://doi.org/10.1093/nar/gkaa1237
Title: SurVirus: a repeat-aware virus integration caller
Authors: Rajaby, Ramesh 
Zhou, Yi
Meng, Yifan
Zeng, Xi
Li, Guoliang 
Wu, Peng
Sung, Wing-Kin 
Keywords: Science & Technology
Life Sciences & Biomedicine
Biochemistry & Molecular Biology
HEPATITIS-B-VIRUS
HUMAN-PAPILLOMAVIRUS
HBV INTEGRATION
PATTERNS
DNA
Issue Date: 14-Jan-2021
Publisher: OXFORD UNIV PRESS
Citation: Rajaby, Ramesh, Zhou, Yi, Meng, Yifan, Zeng, Xi, Li, Guoliang, Wu, Peng, Sung, Wing-Kin (2021-01-14). SurVirus: a repeat-aware virus integration caller. NUCLEIC ACIDS RESEARCH 49 (6). ScholarBank@NUS Repository. https://doi.org/10.1093/nar/gkaa1237
Abstract: A significant portion of human cancers are due to viruses integrating into human genomes. Therefore, accurately predicting virus integrations can help uncover the mechanisms that lead to many devastating diseases. Virus integrations can be called by analysing second generation high-throughput sequencing datasets. Unfortunately, existing methods fail to report a significant portion of integrations, while predicting a large number of false positives. We observe that the inaccuracy is caused by incorrect alignment of reads in repetitive regions. False alignments create false positives, while missing alignments create false negatives. This paper proposes SurVirus, an improved virus integration caller that corrects the alignment of reads which are crucial for the discovery of integrations. We use publicly available datasets to show that existing methods predict hundreds of thousands of false positives; SurVirus, on the other hand, is significantly more precise while it also detects many novel integrations previously missed by other tools, most of which are in repetitive regions. We validate a subset of these novel integrations, and find that the majority are correct. Using SurVirus, we find that HPV and HBV integrations are enriched in LINE and Satellite regions which had been overlooked, as well as discover recurrent HBV and HPV breakpoints in human genome-virus fusion transcripts.
Source Title: NUCLEIC ACIDS RESEARCH
URI: https://scholarbank.nus.edu.sg/handle/10635/226646
ISSN: 0305-1048
1362-4962
DOI: 10.1093/nar/gkaa1237
Appears in Collections:Staff Publications
Elements

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
SurVirus a repeat-aware virus integration caller.pdf1.57 MBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.