Please use this identifier to cite or link to this item:
Title: Gene team tree: A hierarchical representation of gene teams for all gap lengths
Authors: Zhang, M.
Leong, H.W. 
Keywords: Algorithms
Comparative genomics
Gene clusters
Gene order analysis
Issue Date: 2009
Citation: Zhang, M., Leong, H.W. (2009). Gene team tree: A hierarchical representation of gene teams for all gap lengths. Journal of Computational Biology 16 (10) : 1383-1398. ScholarBank@NUS Repository.
Abstract: The identification of spatially co-located gene clusters is an important step towards understanding genome evolution and function. Gene team is a popular model for conserved gene clusters that constrains the maximum distance between adjacent genes in the same cluster. Existing algorithms for finding gene teams require the specification of the maximum allowed distance, δ. However, determining suitable values of δ is non-trivial, due to varying rates of rearrangement and differences in the distribution of genes across multiple genomes. Instead of trying to determine a single best value of δ, we propose constructing the Gene Team Tree, a compact representation of gene teams for all values of δ. The teams computed can then be verified/scored using application specific methods. Our algorithm for computing the GTT extends existing gene team mining algorithms without increasing their time complexity. We compute the GTT for E. coli K-12 and B. subtilis and show that E. coli K-12 operons are modelled by gene teams with different values of δ. We demonstrate the scalability of our method and the trade-off involved when comparing more than two genomes, through a comparative study using five gamma-proteobacteria genomes. Lastly, we describe how to compute the GTT for multi-chromosomal genomes and illustrate by computing the GTT for the human and mouse genomes. An implementation of the algorithms described in this article and the datasets used in the experiments can be downloaded from © 2009 Mary Ann Liebert, Inc.
Source Title: Journal of Computational Biology
ISSN: 10665277
DOI: 10.1089/cmb.2009.0093
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.