Interoperability Workshop
JANUARY 8, 1999
MARRIOTT HOTEL, POOKS HILL
Kensington Room
Bethesda, MD
8:30am – 3:30 pm
Agenda
8:30 Meeting Introduction-All
9:00 Overview of Issue and lessons learned from large database operations
- Dennis Benson
- Jim Ostell
(National Center for Biotechnology Information, National Library of Medicine, NIH)
10:00 Break
10:15 General Discussion-All.
- Define concerns
- Suggest solutions
- Suggest options
12:00 Lunch Break
1:00 Summary Discussion of Interoperability and Future Steps-All
- Basic Neuroscience
- Clinical Neuroscience
- Informatics
2:00 Other Issues-All
3:30 Meeting Adjourn
Participants Neuroinformatics
Interoperability Workshop
Consultants
Ira Black- UMDNJ-Robert Wood Johnson Medical School
Joseph T. Coyle-Harvard Medical School
Daniel Gardner-Cornell Medical College
Edward D. Jones-University of California, Davis
Perry Molinoff-Bristol-Meyers Squibb
Gordon Shepherd-Yale Medical School
NIH Participants
Tom Aigner-National Institute on Drug Abuse
Connie Atwell- National Institute of Neurological Disorders and Stroke
Dennis Benson- National Center for Bioinformatics, National Library of Medicine
Gerald Fischbach-Director, National Institute of Neurological Disorders and Stroke
Michael Hirsch-National Institute of Mental Health.
Steven Hyman-Director, National Institute of Mental Health
Cheryl Kitt- National Institute of Neurological Disorders and Stroke
Stephen Koslow-National Institute of Mental Health. Chair of the Federal Interagency Coordinating Committee on the Human Brain Project(FICC-HBP)
James Ostell-National Center for Bioinformatics, National Library of Medicine
Rochelle Small-National Institute of Deafness and Other Communication Disorders
SUMMARY REPORT
NEUROINFORMATICS
INTEROPERABILITY WORKSHOP
JANUARY 8, 1999
MARRIOTT HOTEL, POOKS HILL
Goal
To discuss and solicit opinions on the challenges and opportunities and next steps needed to advance the formation of a distributed set of interoperable databases in Neuroscience. The overarching thrust of this endeavor is to promote the creation within the Neuroscience community of systems for information management that communicate among databases and datasets across Neuroscience and other major relevant fields (e.g., genome, protein, etc.).
- Jim Ostell, drawing examples from sequence databases, reviewed some of the elements aiding interoperability, including:
- Shared Linkages
- Accession Numbers
- Standard Vocabulary
- Standard Nomenclature
- In data or real time
- Data Models
- formal versus informal
- inter-structural and -functional relationships at varying levels of analyses
- model interactions, and the importance of continued maintenance of relationships, computational components, and linkages for original observed data vs. the insufficiency for this requirement for data interpretation.
The heuristic nature of scientific discovery employs both the initial descriptions of raw data and direct linkages, followed by hypothesis testing and theoretical development to further establish more complex parametric interrelationships. Scientific progress requires that the informational relationships be maintained accessible, at least locally, if not more broadly interoperable across all databases.
- Levels of interoperability
- Compatible schema (Linkages and Data Models)
- Compatible datasets (Defined Formats or interfaces such as CORBA)
- Portability among hardware and operating system platforms
- Concerns Discussed
- Maintenance of Databases
- Definition of a dataset/database
- Criteria for selection (inclusion vs. exclusion)
- Updating - How and when made?
- When to discard
- Provisional sets of domains (i.e., since data models impose boundaries)
- Compatibility and interfacing multi-platform databases and communities of federated databases
- Feasibility studies to establish reliability and cross-validate both datasets per se, and database interoperability
- How to remain contemporary with technological advancement
- Who does all of the above maintenance?
- How should it be supported?
- Scope of Support
- Investigator initiated
- Contract
- Cooperative Agreements
- Focused RFAs
- Neuroscience Data
- Select neuroscience data for inclusion
- Define data models
- Specify domains of investigation
- Prioritize goals (identification of critical national needs for specified areas of investigation and their broader objectives)
- Specify types of Neuroinformatics tools required
- Confirm data reliability and validity
- Establish equivalence canonical models
- Standards
- How to engage the Informatics Community
- What are the current career-tracks?
- Cross-training Issues (across the Informatics and Neuroscience fields)
- Differential salaries (in the private vs. the public academic sectors; resource developers vs. Neuroscience research investigators)
- Summer Workshops
- Keystone Meetings
- Gordon Conferences
- Annual Neuroscience meeting
Summary Issues
At the end of the day, the issues highlighted were:
- To identify and prioritize the database needs of the Neuroscience community and determine how they differ from those of other communities.
- To specify the informatics skills needed to aid Neuroscience and how to interest and recruit individuals with these skills.
- How to achieve interoperability?
Three major recommendations were made:
- Establish an Advisory Committee comprised of Informaticians and Neuroscientists.
- Establish mechanisms to develop common or compatible nomenclature, data models, and organizing themes, and encourage or mandate interoperability components to neuroinformatics projects.
- Focus initially upon Neuroanatomy and establish a nomenclature, data models, and an organizing theme.
Additional Post Meeting Comments:
Encourage or mandate interoperability components to neuroinformatics projects. The focus should not be on neuroanatomy only.
There is a need to be more explicit about defining, from a functional point of view, what neuroscientists need from databases and where they see databases making a significant contribution to the way they do science. In the end, it may turn our to be a fragmented answer, with different specialties having very different needs. But it may be worth laying out the general landscape of imaging, neural circuitry, receptor/gene expression databases, etc. and tying these to how they can be used in conducting research as opposed to being simply compendiums of facts. Once this is laid out, it may be easier to answer questions such as comparing the needs of neuroscience vs. other communities and issues of interoperability.
The critical next step involves the delineation of alternative Data Set strategies to begin encompassing the scope of the project problem and exemplars of solutions. For example, one potential approach might involve nigrostriatal gene expression-dopamine structure and function (etc.)-intracellular cascades-relevant motor programs-striatal memory-Parkinson's disease-L-DOPA-pallidotomy-deep brain stimulation....... etc. An alternative would be the use of a relatively well-defined system, such as the hippocampal system, with analysis from single channel to synaptic transmission to cell system and memory, entering data from LTP to chemical messengers to patient H.M.While we could formulate many different examples, the point, I think is made that different categories and strategies must be articulated for choices to be made for the direction of the overall project.
Undoubtedly an additional meeting would be required to consider the merits of hierarchical, structural, functional, systems, disease-oriented models as starting points.
