Please use this identifier to cite or link to this item: http://scholarbank.nus.edu.sg/handle/10635/118611
Title: Towards Handling Repeats in Genome Assembly
Authors: NARMADA SAMBATURU
Keywords: repeat region, repeat, assembly, scaffolding, nextera, genome assembly
Issue Date: 22-Aug-2014
Source: NARMADA SAMBATURU (2014-08-22). Towards Handling Repeats in Genome Assembly. ScholarBank@NUS Repository.
Abstract: Repeat regions have been shown to play a role in human-pathogen interactions, and their study could open up new treatment avenues. Since only small amounts of pathogen can be extracted from a patient, and waiting for the pathogen to multiply in the lab is impractical, a genomics pipeline which works with small quantities of cells and handles repeats is essential. Genome assemblers, however, tend to collapse all occurrences of a repeat into one contiguous sequence (contig). While ordering contigs, assemblers might interpret distant contigs as adjacent if they flank different occurrences of the same repeat. We develop an algorithm to link regions flanking a repeat given only picogram quantities of DNA. The algorithm exploits a 9bp overlap between adjacent fragments caused by the library preparation technique (Nextera). The algorithm was tested with an E.coli library prepared with 0.25pg of DNA, and was able to assemble the sequences bridging 26 repeats.
URI: http://scholarbank.nus.edu.sg/handle/10635/118611
Appears in Collections:Master's Theses (Open)

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
SambaturuN.pdf1.6 MBAdobe PDF

OPEN

NoneView/Download

Page view(s)

229
checked on Feb 24, 2018

Download(s)

117
checked on Feb 24, 2018

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.