An agent- and ontology-based system for integrating public gene, protein, and disease databases.
Oligonucleotide Array Sequence Analysis
Database Management Systems
Genetic Predisposition to Disease
Information Storage and Retrieval
In this paper, we describe OntoFusion, a database integration system. This system has been designed to provide unified access to multiple, heterogeneous biological and medical data sources that are publicly available over Internet. Many of these databases do not offer a direct connection, and inquiries must be made via Web forms, returning results as HTML pages. A special module in the OntoFusion system is needed to integrate these public 'Web-based' databases. Domain ontologies are used to do this and provide database mapping and unification. We have used the system to integrate seven significant and widely used public biomedical databases: OMIM, PubMed, Enzyme, Prosite and Prosite documentation, PDB, SNP, and InterPro. A case study is detailed in depth, showing system performance. We analyze the system's architecture and methods and discuss its use as a tool for biomedical researchers.