Please use this identifier to cite or link to this item: https://doi.org/10.1111/j.1096-0031.2010.00329.x
DC FieldValue
dc.titleSequenceMatrix: Concatenation software for the fast assembly of multi-gene datasets with character set and codon information
dc.contributor.authorVaidya, G.
dc.contributor.authorLohman, D.J.
dc.contributor.authorMeier, R.
dc.date.accessioned2014-10-27T08:39:29Z
dc.date.available2014-10-27T08:39:29Z
dc.date.issued2011-04
dc.identifier.citationVaidya, G., Lohman, D.J., Meier, R. (2011-04). SequenceMatrix: Concatenation software for the fast assembly of multi-gene datasets with character set and codon information. Cladistics 27 (2) : 171-180. ScholarBank@NUS Repository. https://doi.org/10.1111/j.1096-0031.2010.00329.x
dc.identifier.issn07483007
dc.identifier.urihttp://scholarbank.nus.edu.sg/handle/10635/101649
dc.description.abstractWe present SequenceMatrix, software that is designed to facilitate the assembly and analysis of multi-gene datasets. Genes are concatenated by dragging and dropping FASTA, NEXUS, or TNT files with aligned sequences into the program window. A multi-gene dataset is concatenated and displayed in a spreadsheet; each sequence is represented by a cell that provides information on sequence length, number of indels, the number of ambiguous bases ("Ns"), and the availability of codon information. Alternatively, GenBank numbers for the sequences can be displayed and exported. Matrices with hundreds of genes and taxa can be concatenated within minutes and exported in TNT, NEXUS, or PHYLIP formats, preserving both character set and codon information for TNT and NEXUS files. SequenceMatrix also creates taxon sets listing taxa with a minimum number of characters or gene fragments, which helps assess preliminary datasets. Entire taxa, whole gene fragments, or individual sequences for a particular gene and species can be excluded from export. Data matrices can be re-split into their component genes and the gene fragments can be exported as individual gene files. SequenceMatrix also includes two tools that help to identify sequences that may have been compromised through laboratory contamination or data management error. One tool lists identical or near-identical sequences within genes, while the other compares the pairwise distance pattern of one gene against the pattern for all remaining genes combined. SequenceMatrix is Java-based and compatible with the Microsoft Windows, Apple MacOS X and Linux operating systems. The software is freely available from © The Willi Hennig Society 2010. © The Willi Hennig Society 2010.
dc.description.urihttp://libproxy1.nus.edu.sg/login?url=http://dx.doi.org/10.1111/j.1096-0031.2010.00329.x
dc.sourceScopus
dc.typeArticle
dc.contributor.departmentBIOLOGICAL SCIENCES
dc.description.doi10.1111/j.1096-0031.2010.00329.x
dc.description.sourcetitleCladistics
dc.description.volume27
dc.description.issue2
dc.description.page171-180
dc.description.codenCLADE
dc.identifier.isiut000288124600005
Appears in Collections:Staff Publications

Show simple item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.