Please use this identifier to cite or link to this item: https://doi.org/10.1093/bib/bbp007
Title: Scaling the walls of discovery: Using semantic metadata for integrative problem solving
Authors: Manning, M.
Aggarwal, A.
Gao, K.
Tucker-Kellogg, G. 
Keywords: Architecture
Cancer
Data integration
Genomics
Metadata
Semantic web
Issue Date: 2009
Citation: Manning, M., Aggarwal, A., Gao, K., Tucker-Kellogg, G. (2009). Scaling the walls of discovery: Using semantic metadata for integrative problem solving. Briefings in Bioinformatics 10 (2) : 164-176. ScholarBank@NUS Repository. https://doi.org/10.1093/bib/bbp007
Abstract: Current data integration approaches by bioinformaticians frequently involve extracting data from a wide variety of public and private data repositories, each with a unique vocabulary and schema, via scripts. These separate data sets must then be normalized through the tedious and lengthy process of resolving naming differences and collecting information into a single view. Attempts to consolidate such diverse data using data warehouses or federated queries add significant complexity and have shown limitations in flexibility. The alternative of complete semantic integration of data requires a massive, sustained effort in mapping data types and maintaining ontologies. We focused instead on creating a data architecture that leverages semantic mapping of experimental metadata, to support the rapid prototyping of scientific discovery applications with the twin goals of reducing architectural complexity while still leveraging semantic technologies to provide flexibility, efficiency and more fully characterized data relationships. A metadata ontology was developed to describe our discovery process. A metadata repository was then created by mapping metadata from existing data sources into this ontology, generating RDF triples to describe the entities. Finally an interface to the repository was designed which provided not only search and browse capabilities but complex query templates that aggregate data from both RDF and RDBMS sources. We describe how this approach (i) allows scientists to discover and link relevant data across diverse data sources and (ii) provides a platform for development of integrative informatics applications. © The Author 2009. Published by Oxford University Press.
Source Title: Briefings in Bioinformatics
URI: http://scholarbank.nus.edu.sg/handle/10635/102533
ISSN: 14675463
DOI: 10.1093/bib/bbp007
Appears in Collections:Staff Publications

Show full item record
Files in This Item:
There are no files associated with this item.

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.