Please use this identifier to cite or link to this item: https://doi.org/10.1371/journal.pgen.1000160
Title: Genetic variation in an individual human exome
Authors: Ng P.C. 
Levy S.
Huang J.
Stockwell T.B.
Walenz B.P.
Li K.
Axelrod N.
Busam D.A.
Strausberg R.L.
Venter J.C.
Keywords: amino acid
protein
article
bioinformatics
exon
gene
gene deletion
gene insertion
genetic variability
genome
genomics
human
protein function
single nucleotide polymorphism
comparative study
gene frequency
genetic disorder
genetics
male
metabolism
mutation
phenotype
single nucleotide polymorphism
Exons
Gene Frequency
Genetic Diseases, Inborn
Genome, Human
Humans
Male
Mutation
Phenotype
Polymorphism, Single Nucleotide
Proteins
Variation (Genetics)
Issue Date: 2008
Citation: Ng P.C., Levy S., Huang J., Stockwell T.B., Walenz B.P., Li K., Axelrod N., Busam D.A., Strausberg R.L., Venter J.C. (2008). Genetic variation in an individual human exome. PLoS Genetics 4 (8) : e1000160. ScholarBank@NUS Repository. https://doi.org/10.1371/journal.pgen.1000160
Rights: Attribution 4.0 International
Abstract: There is much interest in characterizing the variation in a human individual, because this may elucidate what contributes significantly to a person's phenotype, thereby enabling personalized genomics. We focus here on the variants in a person's 'exome,' which is the set of exons in a genome, because the exome is believed to harbor much of the functional variation. We provide an analysis of the ?12,500 variants that affect the protein coding portion of an individual's genome. We identified ?10,400 nonsynonymous single nucleotide polymorphisms (nsSNPs) in this individual, of which ?15-20% are rare in the human population. We predict ?1,500 nsSNPs affect protein function and these tend be heterozygous, rare, or novel. Of the ?700 coding indels, approximately half tend to have lengths that are a multiple of three, which causes insertions/deletions of amino acids in the corresponding protein, rather than introducing frameshifts. Coding indels also occur frequently at the termini of genes, so even if an indel causes a frameshift, an alternative start or stop site in the gene can still be used to make a functional protein. In summary, we reduced the set of ?12,500 nonsilent coding variants by ?8-fold to a set of variants that are most likely to have major effects on their proteins' functions. This is our first glimpse of an individual's exome and a snapshot of the current state of personalized genomics. The majority of coding variants in this individual are common and appear to be functionally neutral. Our results also indicate that some variants can be used to improve the current NCBI human reference genome. As more genomes are sequenced, many rare variants and non-SNP variants will be discovered. We present an approach to analyze the coding variation in humans by proposing multiple bioinformatic methods to hone in on possible functional variation. © 2008 Ng et al.
Source Title: PLoS Genetics
URI: https://scholarbank.nus.edu.sg/handle/10635/161684
ISSN: 15537390
DOI: 10.1371/journal.pgen.1000160
Rights: Attribution 4.0 International
Appears in Collections:Elements
Staff Publications

Show full item record
Files in This Item:
File Description SizeFormatAccess SettingsVersion 
10_1371_journal_pgen_1000160.pdf419.91 kBAdobe PDF

OPEN

NoneView/Download

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons