Analysis Environments For Scientific Communities From Bases to Spaces - PowerPoint PPT Presentation

1 / 60
About This Presentation
Title:

Analysis Environments For Scientific Communities From Bases to Spaces

Description:

Mice and Men, Worms and Flies, Yeasts and Weeds. ... Honey Bee / Fruit Fly. Song Bird / Soy Bean. Behavior. Social / Territorial. Foraging / Nesting ... – PowerPoint PPT presentation

Number of Views:26
Avg rating:3.0/5.0
Slides: 61
Provided by: CAN128
Category:

less

Transcript and Presenter's Notes

Title: Analysis Environments For Scientific Communities From Bases to Spaces


1
Analysis Environments For Scientific
CommunitiesFrom Bases to Spaces
Bruce R. Schatz Institute for Genomic
Biology University of Illinois at
Urbana-Champaign schatz_at_uiuc.edu,www.beespace.uiuc
.edu
Baker Center for Bioinformatics Iowa State
University October 6, 2006
2
What are Analysis Environments
  • Functional Analysis
  • Find the underlying Mechanisms
  • Of Genes, Behaviors, Diseases
  • Comparative Analysis
  • Top-down data mining (vs Bottom-up)
  • Multiple Sources especially literature

3
Building Analysis Environments
  • Manual by Humans
  • Interaction user navigation
  • Classification collection indexing
  • Automatic by Computers
  • Federation search bridges
  • Integration results links

4
Trends in Analysis Environments
  • Central versus Distributed Viewpoints
  • The 90s Pre-Genome
  • Entrez (NIH NCBI) versus
  • WCS (NSF Arizona)
  • The 00s Post-Genome
  • GO (NIH curators) versus
  • BeeSpace (NSF Illinois)

5
Pre-Genome Environments
  • Focused on Syntax pre-Web
  • WCS (Worm Community System)
  • Search words across sources
  • Follow links across sources
  • Words automatic, Links manual
  • Towards Integrated Searching

6
Post-Genome Environments
  • Focused on Semantics post-Web
  • BeeSpace (Honey Bee Inter Space)
  • Navigate concepts across sources
  • Integrate data across sources
  • Concepts automatic, Links automatic
  • Towards Conceptual Navigation

7
Worm Community System
  • WCS Information
  • Literature BIOSIS, MEDLINE, newsletters,
    meetings
  • Data Genes, Maps, Sequences, strains, cells
  • WCS Functionality
  • Browsing search, navigation
  • Filtering selection, analysis
  • Sharing linking, publishing
  • WCS 250 users at 50 labs across Internet (1991)

8
WCS Molecular
9
WCS Cellular
10
WCS invokes gm
11
WCS vis-à-vis acedb
12
Towards the Interspace
  • from Objects to Concepts
  • from Syntax to Semantics
  • Infrastructure is Interaction with Abstraction

Internet is packet transmission across
computers Interspace is concept navigation
across repositories
13
THE THIRD WAVE OF NET EVOLUTION
CONCEPTS
OBJECTS
PACKETS
14
LEVELS OF INDEXES
15
Post-Genome Informatics I
  • Comparative Analysis within the
  • Dry Lab of Biological Knowledge
  • Classical Organisms have Genetic Descriptions.
  • There will be NO more classical organisms beyond
  • Mice and Men, Worms and Flies, Yeasts and Weeds.
  • Must use comparative genomics on classical
    organisms
  • Via sequence homologies and literature analysis.

16
Post-Genome Informatics II
  • Functional Analysis within the
  • Dry Lab of Biological Knowledge
  • Automatic annotation of genes to standard
    classifications, e.g. Gene Ontology via homology
    on computed protein sequences.
  • Automatic analysis of functions to scientific
    literature, e.g. concept spaces via text
    extractions. Thus must use functions in
    literature descriptions.

17
Informatics From Bases to Spaces
  • data Bases support genome data
  • e.g. FlyBase has sequences and maps
  • Genes annotated by GeneOntology and
  • linked to biological literature
  • information Spaces support biological literature
  • e.g. BeeSpace uses automatically generated
  • conceptual relationships to navigate functions

18
BeeSpace FIBR Project
  • BeeSpace project is NSF FIBR flagship
  • Frontiers Integrative Biological Research,
  • 5M for 5 years at University of Illinois
  • Analyzing Nature and Nurture in Societal Roles
    using honey bee as model
  • (Functional Analysis of Social Behavior)
  • Genomic technologies in wet lab and dry lab
  • Bee Biology gene expressions
  • Space Informatics concept navigations

19
(No Transcript)
20
System Architecture
  • BeeSpace
  • Concepts
  • Concepts
  • SEQ
  • Expressions
  • Expressions
  • Databases
  • Bees
  • Flies
  • Documents
  • Documents
  • SEQ
  • Community
  • Community

21
Concept Navigation in BeeSpace
22
V1 BeeSpace Community Collections
  • Organism
  • Honey Bee / Fruit Fly
  • Song Bird / Soy Bean
  • Behavior
  • Social / Territorial
  • Foraging / Nesting
  • Development
  • Behavioral Maturation
  • Insect Development
  • Insect Communication
  •  Structure
  • Fly Genetics / Fly Biochemistry
  • Fly Physiology / Insect Neurophysiology

23
CONCEPT SWITCHING
  • Concept versus Term
  • set of semantically equivalent terms
  • Concept switching
  • region to region (set to set) match

24
(No Transcript)
25
(No Transcript)
26
(No Transcript)
27
(No Transcript)
28
(No Transcript)
29
BeeSpace Analysis Environment
  • Build Concept Space of Biomedical Literature for
    Functional Analysis of Bee Genes
  • -Partition Literature into Community Collections
  • -Extract and Index Concepts within Collections
  • -Navigate Concepts within Documents
  • -Follow Links from Documents into Databases
  • Locate Candidate Genes in Related Literatures
    then follow links into Genome Databases

30
Well Characterized Gene
  • Ling et. al., PSB 2006

31
Poorly Characterized Gene
  • Ling et. al., PSB 2006

32
Gene Summarization, BeeSpace V2
33
(No Transcript)
34
Collaboration across Users
35
(No Transcript)
36
(No Transcript)
37
(No Transcript)
38
(No Transcript)
39
(No Transcript)
40
(No Transcript)
41
Category Browse (Collection)
42
(No Transcript)
43
(No Transcript)
44
Category Browse (Search)
45
(No Transcript)
46
(No Transcript)
47
PlantSpace Examples
48
(No Transcript)
49
(No Transcript)
50
(No Transcript)
51
(No Transcript)
52
(No Transcript)
53
(No Transcript)
54
(No Transcript)
55
(No Transcript)
56
Interactive Functional Analysis
  • BeeSpace will enable users to navigate a uniform
    space of diverse databases and literature sources
    for hypothesis development and testing, with a
    software system beyond a searchable database,
    using literature analyses to discover functional
    relationships between genes and behavior.
  • Genes to Behaviors
  • Behaviors to Genes
  • Concepts to Concepts
  • Clusters to Clusters
  • Navigation across Sources

57
BeeSpace Information Sources
  • General for All Spaces
  • Scientific Literature
  • -Medline, Biosis, CAB Abstracts
  • Genome Databases
  • -GenBank, ProteinDataBank, ArrayExpress
  • Special for BeeSpace
  • Model Organisms (heredity)
  • -Gene Descriptions (FlyBase, WormBase)
  • Natural Histories (environment)
  • -BeeKeeping Books (Cornell, Harvard)

58
XSpace Information Sources
  • Organize Genome Databases (XBase)
  • Compute Gene Descriptions from Model Organisms
  • Partition Scientific Literature for Organism X
  • Compute XSpace using Semantic Indexing
  • Boost the Functional Analysis from Special
    Sources
  • Collecting Useful Data about Natural Histories
  • e.g. CowSpace Leverage in AIPL Databases

59
Towards SoySpace
  • Organize Genome Databases (SoyBase)
  • Partition Scientific Literature for SoyBean
  • Gene Descriptions from Models (TAIR)
  • Natural Histories from Population Databases
  • Key to Functional Analysis is Special Sources
  • Collecting Appropriate Text about Genes
  • Extracting Adequate Data about Histories
  • Leverage is National Archives of germplasm and
    Historical Records for soybean crops

60
Towards the Interspace
  • The Analysis Environment technology is
    GENERAL! BirdSpace? BeeSpace?
  • PigSpace? CowSpace?
  • BehaviorSpace? BrainSpace?
  • SoySpace? PlantSpace?
  • BioSpace
  • Interspace
Write a Comment
User Comments (0)
About PowerShow.com