Digital Library Development in the Asia Pacific - PowerPoint PPT Presentation

1 / 98
About This Presentation
Title:

Digital Library Development in the Asia Pacific

Description:

Digital Library Development in the Asia Pacific – PowerPoint PPT presentation

Number of Views:154
Avg rating:3.0/5.0
Slides: 99
Provided by: aiBpaA
Category:

less

Transcript and Presenter's Notes

Title: Digital Library Development in the Asia Pacific


1
  • Digital Library Development in the Asia Pacific
  • Digital Library Future Research
  • Case Study in International Disease Surveillance

Hsinchun Chen, Ph.D. ????????, ??? ?? McClelland
Professor, Director, Artificial Intelligence Lab
NSF COPLINK Center Dept. of Management
Information Systems University of Arizona
Acknowledgement NSF DLI1, DLI2, NSDL NIH, NLM,
NCI NSF DG, DOJ, DOD, DHS
2
Outline
  • DL After the First Decade
  • Asian Digital Library Development
  • NSF Cyberinfrastructure Program and NSF Chatham
    Digital Library Workshop
  • Case Study International Disease Surveillance
    BioPortal (from FMD to Avian Influenza)

3
Introduction
  • Digital libraries represent a form of information
    technology in which social impact matters as much
    as technological advancement.
  • Over the past decade the development of digital
    library activities has been steadily increasing.
  • International conferences in digital library have
    proliferated from their roots of ACM and IEEE
    Digital Conferences (and then the Joint
    Conference on Digital Libraries, JCDL) to the
    European version of ECDL (European Conference on
    Digital Libraries) and the Asian version of ICADL
    (International Conference of Asian Digital
    Libraries).

4
DLI Program Implementation History
Digital Libraries Initiative Phase 1
(1994-98) Sponsors NSF, DARPA, NASA 76
Proposals, 6 Awards, 25M total Digital Libraries
Initiative Phase 2 (1999-03) Sponsors NSF,
DARPA, NASA, NIH/NLM, NEH Partners IMLS,
Smithsonian Institution, NARA 300 Proposals, 34
Awards, 48M total International Digital
Libraries Cooperative Research Initiative
(1999-03) Sponsors NSF, JISC, DFG 60
Proposals, 16 Awards (6 with JISC, 4 with DFG),
6M NSF total Intl DL Cooperative Research
Initiative and Applications (2002- ) Sponsors
NSF, JISC, DFG 50 Proposals, Awards in process
(4 with JISC, 2 with DFG)
5
Select Digital Library Development Milestones
6
Digital Library Development in Asia Pacific (An
ICADL Analysis)
7
Overview of ICADL
  • ICADL (International Conference of Asian Digital
    Libraries)
  • Overview of ICADL
  • 80 participants in Hong Kong in 1998 (host CS)
  • 150 participants in Taipei, Taiwan in 1999
    (host LIS)
  • 300 participants in Seoul, Korea in 2000 (host
    CS)
  • 600 participants in Bangalore, India in 2001
    (host LIS)
  • 400 participants in Singapore in 2002 (host
    LIS)
  • 350 participants in Kuala Lumpur, Malaysia in
    2003 (host NLM)
  • 350 participants in Shanghai, China in 2004
    (host LIS, CS)
  • ICADL 2005, Bangkok, Thailand in December 2005

8
Summary on Participation of ICADL Conferences
9
Increase of Papers Accepted in ICADL
10
Increase of Countries Represented in ICADL
11
Topical Analysis in ICADL
  • Digital library research is not restricted to
    only technical aspects it involves social
    aspects as well.
  • From a technological perspective, digital
    libraries are a set of electronic resources that
    are built to help create, search, and use
    information.
  • From a sociological perspective, digital
    libraries are constructed by a community of users
    who use the system to better support their
    informational needs and applications. (Borgman,
    1998)

12
Topical Analysis Social Aspect
  • Multicultural Issues
  • In Asian digital library applications, there are
    countless scenarios that involve creating and
    distributing locally produced information
    collections.
  • e.g.
  • INFLIBNT project aimed at creating a digital
    library of theses and dissertations from India
    (Vijayakumar and Murthy, 2001).
  • The Tsinghua University Architecture Digital
    Library developed a prototype system to provide
    rich, valuable resources for traditional Chinese
    architecture research and education (Xing et al.,
    2002).

13
Topical Analysis Social Aspect
  • Asian Languages and Cross-lingual Issues
  • A crucial feature of Asian digital libraries is
    the ability to work in various local languages.
  • Chinese, Japanese, Korean, Indian, Malaysian, and
    Thai language processing techniques have been
    reported.
  • e.g.
  • Wong and Li (1998) and Yang et al. (1998) both
    studied Chinese information retrieval and
    discussed issues related to Chinese language
    indexing techniques.
  • Theeramunkong et al. (2002) investigated using
    n-gram and HMM approaches for Thai OCR
    application.

14
Topical Analysis Social Aspect
  • Asian Languages and Cross-lingual Issues (contd)
  • Cross-lingual information retrieval between
    English and Asian languages has been more widely
    studied in ICADL conferences than in other
    western digital library conferences.
  • e.g.
  • Qin et al. (2003) presented an English-Chinese
    cross-lingual Web retrieval system in the
    business domain.
  • Sugimoto (2001) presented a multilingual document
    browsing tool and its metadata creation carried
    out at ULIS.

15
Samples of Significant Digital Library Research
in Asia Pacific Capturing Cultural Heritage and
Indigenous Knowledge
16
International Islamic Digital Library Malaysia
  • Focus
  • To provide information on Islam and Muslims
    around the world
  • To act as a referral centre to direct information
    enquiries on Islam to the appropriate sources
  • To promote sharing and exchange of knowledge
    among scholars of Islam and those interested in
    it
  • To enable the world to understand Islam better
  • Partners
  • National Library of Malaysia
  • Multimedia Development Corporation
  • International Islamic
  • University Malaysia Library

17
International Islamic Digital Library Malaysia
  • Contents
  • Books,
  • Manuscripts
  • Special collections,
  • Theses and articles,
  • Journals and conferences papers,
  • Pictures, audios and videos
  • Service
  • Both in Arabic and English
  • Category browse
  • Browse search
  • Keyword search
  • Expert search
  • Broadcast search

http//www.iidl.net
18
Technology Development for Indian Languages
India
  • Focus
  • To develop information processing tools to
    facilitate human machine interaction in Indian
    languages and multi-lingual knowledge resources.
  • To support RD efforts in the area of information
    processing in Indian Languages and to support
    research on knowledge tools representation,
    integration, compression and learning
    methodologies.
  • To consolidate technologies thus developed for
    Indian languages and integrate these to develop
    innovative user products and services.

http//www.tdil.mit.gov.in
19
Technology Development for Indian Languages
India
  • Funding
  • Ministry of Information Technology, India
  • Partners
  • Indian Institute of Technology, Kanpur Hindi,
    Nepali
  • Indian Institute of Technology, Mumbai Marathi,
    Konkani
  • Indian Institute of Technology, Guwahati
    Assamese, Manipuri

http//www.tdil.mit.gov.in
20
Technology Development for Indian Languages
India
  • Contents
  • Multi-lingual dictionaries,
  • Thesauri,
  • Educational software,
  • Encyclopedia,
  • Gyan-nidhi creative writing system,
  • Translation support systems,
  • OCR,
  • Text-to-speech speech recognition system,
  • Pocket translator,
  • Personal digital assistants,
  • Reading machine for blinds deaf,
  • Portals,
  • e-governance / e-commerce / e-skills.

http//www.tdil.mit.gov.in
21
China Digital Library China
  • Focus
  • Strengthen and protect the cultural tradition and
    heritage
  • Enhance the usage and sharing of information
    resource
  • Serve the national projects and related researches

http//www.nlc.gov.cn
22
China Digital Library China
  • Funding
  • 10th Five-year Project
  • Ministry of Culture, China
  • Partners
  • National Library of China
  • Tsinghua University
  • Peking University
  • China Academy of Science
  • China Academy of Social Science
  • etc. (more than 100 different types of libraries
    and partners)

http//www.nlc.gov.cn
23
China Digital Library
http//www.nlc.gov.cn
24
Research Directions for Digital Libraries (The
Next Decade)- JCDL 2004, 2005, 2006- NSF
Cyberinfrastructure Program- NSF Chatham
Digital Library Workshop
25
Overview of JCDL 2004
  • Joint IEEE-CS/ACM Conference on Digital Libraries
    (JCDL), Tucson, AZ
  • Theme Global Reach and Diverse Impact.
  • Co-Chairs
  • Hsinchun Chen, Howard Wactlar, Ching-chih Chen

26
Strong Participation in JCDL04
27
JCDL 2005 and 2006
  • JCDL 2005, Denver, Colorado
  • JCDL 2006, Charlotte, North Carolina

28
Social Networks,Cyberinfrastructure (CI)and
Meta CI
  • Daniel E. AtkinsSchool of Information Dept. of
    Elec. Engr. Comp. Sci.University of
    MichiganAnn Arboratkins_at_umich.eduNovember 2005

29
CI Genealogy Movement
30
NSF Blue Ribbon Advisory Panel on
Cyberinfrastructure
  • a new age has dawned in scientific and
    engineering research, pushed by continuing
    progress in computing, information, and
    communication technology, and pulled by the
    expanding complexity, scope, and scale of todays
    challenges. The capacity of this technology has
    crossed thresholds that now make possible a
    comprehensive cyberinfrastructure on which to
    build new types of scientific and engineering
    knowledge environments and organizations and to
    pursue research in new ways and with increased
    efficacy.

31
(No Transcript)
32
NSF states intent toplay a leadership role
  • NSF will play a leadership role in the
    development and support of a comprehensive
    cyberinfrastructure essential to 21st century
    advances in science and engineering research and
    education.
  • NSF is the only agency within the U.S. government
    that funds research and education across all
    disciplines of science and engineering. ... Thus,
    it is strategically placed to leverage,
    coordinate and transition cyberinfrastructure
    advances in one field to all fields of research.
  • From NSF Cyberinfrastructure Vision for the 21st
    Century Discovery

33
Cyberinfrastructure-enhancedKnowledge
Communities (Networks)
Outcomes New Ideas, New Tools, Education
Career Development, Outreach
Attributes Collaborative, Multidisciplinary,
Geographically Distributed, Inter-institutional
Broader Application to other disciplines and
types of activity.
Specific Cyber Environments collaboratories,
grids, e-science community, virtual teams,
community portal, ...
Cyber-infrastructure Services Equipment,
Software, People, Institutions
Computation, Storage, Communication and Interface
Technologies
From Cummings Kiesler (2003) report on KDI
Initiative ultidisciplinary scientific
collaborations, see http//www.p2design.com/papers
/kdi.pdf
34
Cyberinfrastructureincludes both
  • Technology Infrastructure (creation and
    provisioning) - middleware, portals, HPC, hybrid
    (IP lambda) networks, ....
  • Social Infrastructure (competition cooperation,
    IP policies, incentive structure, cost, etc.)

35
From CI Advisory Panel Report
towards functionally complete CKCs
36
Transforming the Information Landscape Research
Directions for Digital Libraries
Report of the NSF Workshop on Research Directions
for Digital Libraries
  • Ronald L. Larsen
  • School of Information Sciences
  • University of Pittsburgh

Knowledge Lost in Information, NSF Award No.
IIS-0331314
37
Workshop Background
  • DLI over, DLI2 winding down
  • What is next?
  • Did we finish the job? Are we done?
  • What have we learned?
  • What constitutes DL research?
  • Does it influence other disciplines?
  • Should DL change from
  • Initiative to Program?

38
Emerging U.S. Vision for DLs
  • Next generation digital libraries will be
  • A confluence of resources, technology and
    infrastructure
  • An intersection of national priorities and
    scientific goals
  • A common testbed for all computer and information
    science research (sub)disciplines
  • Federated resources serving individuals,
    institutions and governments simultaneously
  • A progression from information to knowledge

39
Users
  • Cognitive Completion
  • MS spell-checker for facts and knowledge
  • Task and user context sensitive
  • Do what I mean
  • Find what I need
  • Be aware of what I know
  • Collaboration
  • Identifying the collegial context
  • Providing contextual guidance
  • Managing personal libraries
  • All that is seen and heard
  • Personal memory assistant

40
National priorities influence IT research agenda
Advances in Science and Engineering
Information Technology Research
Economic Prosperity and Vibrant Civil Society
National and Homeland Security
Digital Libraries form the Enabling Resources,
Technology Infrastructure
41
National priorities influence IT research agenda
  • Advances in Science and Engineering
  • Advance the frontiers of science and engineering
    research and education
  • Examples include those that collect, disseminate,
    and analyze observational or experimental data,
    or data from models or simulations
  • Economic Prosperity and Vibrant Civil Society
  • Human and socio-technical aspects of current and
    future distributed information
  • Topics include business, work, health,
    government, learning, and community, and their
    related policy implications.
  • National and Homeland Security
  • Robust Information Technology to protect critical
    infrastructures and support the understanding of
    threats to national security
  • Examples include collaborative knowledge
    environments, knowledge discovery, medical
    informatics, information extraction and fusion,
    cross-linguality, spoken language and imagery,
    social network analysis

42
Recommendations
  • 20M / year for new U.S. research
  • Search, context, extraction, ubiquity,
    productivity
  • 40M /year for sustaining evolving resources in
    the U.S.
  • Acquisition, access, usage, stewardship,
    management
  • Coordinate with Advanced Cyberinfrastructure
    Program

43
DL Research After the First Decade Global
Reach and Diverse Impact!Can DL help with
international security?Can DL help save human
life and improve patient well-being?
44
Case Study International Disease Surveillance
-- BioPortal
45
BioPortalDisease and Bioagent Information
Sharing, Surveillance, Analysis, and Visualization
  • Research Team
  • University of Arizona
  • University of California, Davis
  • Kansas State University
  • Arizona Department of Public Health
  • University of Utah
  • New York State Department of Health/HRI
  • California Department of Health Services/PHFE
  • U.S. Geological Survey
  • The SIMI Group
  • Acknowledgements NSF, ITIC, DHS, DOD/AFMIC,
    IDIWC, AZDPS

46
Research Partners and Supports
  • University of Arizona
  • University of California, Davis
  • Kansas State University
  • University of Utah
  • Arizona Department of Public Health
  • New York State Department of Health/HRI
  • California Department of Health Services/PHFE
  • U.S. Geological Survey
  • The SIMI Group
  • NSF
  • CIA/ITIC
  • DHS
  • DOD/AFMIC
  • CDC
  • AZDPS

47
Outline
  • Project Background
  • BioPortal V1.0 Achievements
  • System Architecture
  • System Functionalities
  • BioPortal Collaboration Framework
  • New Developments
  • International Foot-and-mouth Disease Monitoring
  • Syndromic Surveillance
  • Livestock Health Surveillance

48
BioPortal Background
Acknowledgment NSF, ITIC, NYSDH, CDHS,
USGS (Drs. Kvach and Ascher)
49
Background (I)
  • In September, 2002, representatives of 18
    different agencies, including DOD, DOE, DOJ, DHS,
    NIH/NLM, CDC, CIA, NSF, and NASA, were convened
    to discuss disease surveillance.
  • An interagency working group called Disease
    Informatics Senior Coordinating Committee (DISCC)
    was established.
  • DISCC established an Infectious Disease
    Informatics Working Committee (IDIWC) to survey
    the field and identify gaps.
  • IDIWC developed requirements for a National
    Infectious Disease Informatics Infrastructure
    (NIDII).

50
Background (II)
  • In June, 2003, IDIWC was charged with the task of
    developing one or more rapid prototype systems to
    demonstrate interoperability and innovation
    across species and jurisdictions.
  • Botulism and West Nile virus were selected as
    diseases.
  • States of New York and California were selected
    as partners.
  • The University of Arizona was chosen as
    integrator and was provided with a supplement to
    an existing NSF grant.

51
BioPortal Project Goals
  • Demonstrate and assess the technical feasibility
    and scalability of an infectious disease
    information sharing (across species and
    jurisdictions), alerting, and analysis framework.
  • Develop and assess advanced data mining and
    visualization techniques for infectious disease
    data analysis and predictive modeling.
  • Identify important technical and policy-related
    challenges in developing a national infectious
    disease information infrastructure.

52
BioPortal V1.0 Accomplishments
  • Prototype system design and development
  • Initial design and implementation of
    interoperable messaging backbones
  • Live prototype systems
  • Preliminary user evaluation
  • Information sharing
  • Data sharing agreements/memoranda of
    understanding (MOUs) developed
  • Many disease datasets integrated into the portal
  • Analysis and visualization
  • Hotspot analysis research
  • Spatial-Temporal Visualizer (STV)

53
Information Sharing Infrastructure Design
Portal Data Store (MS SQL 2000)
Data Ingest Control Module Cleansing /
Normalization
Info-Sharing Infrastructure
Adaptor
Adaptor
Adaptor
SSL/RSA
SSL/RSA
XML/HL7 Network
PHINMS Network
New
NYSDOH
CADHS
54
Data Access Infrastructure Design
55
BioPortal Collaboration Framework
  • A Memorandum of Understanding (MOU) is used to
    document the relationship between parties that
    will be sharing data
  • Who the entities are and how they will act
    independently and cooperatively
  • What the mutual interests, benefits, and purposes
    of sharing data are
  • How each party will maintain control over and
    share their resources, and what each party shall
    provide to the other (e.g., system accounts,
    portal access)
  • Which types of data are to be shared (e.g., dead
    bird surveillance)

56
Datasets Integrated WNV, BOT
57
Communications/Messaging
  • Scalable, flexible, light-weight, and extendible.
    Easy to include
  • New diseases
  • New jurisdictions
  • New techniques!
  • Messaging infrastructure installed and tested
  • NYSDOH-UA PHIN MS
  • CADHS-UA Regional message broker
  • NWHC-UA PHIN MS
  • XML generation/conversion
  • NY_DeadBird, NY_Alerts, NY_BotHuman, NY_WNVHuman,
    NY_CaptiveAnimal, NY_Mosquito
  • CA_BotHuman, CA_WNVHuman, CA_DeadBird,
    CA_Chicken, CA_Mosquito
  • USGS_Epizoo

58
BioPortal Research Framework
  • BioPortal Demo Develop the system for
    demonstration purposes using scrubbed data.
    Refine system functionality and performance based
    on user feedback.
  • BioPortal Operation Develop the system for
    production mode with real data and real users.
  • BioPortal Research Continue to develop
    advanced technologies and practical sharing
    policies. Expand to new diseases and
    jurisdictions.

59
BioPortal Prototype Systems
60
Spatio-Temporal Data Mining Hotspot Analysis
  • A hotspot is a condition indicating some form of
    clustering in a spatial and temporal distribution
    (Rogerson Sun 2001 Theophilides et al. 2003
    Patil Tailie 2004).
  • For WNV, localized clusters of dead birds
    typically identify high-risk disease areas
    (Gotham et al. 2001).
  • Automatic detection of dead bird clusters using
    hotspot analysis can help predict disease
    outbreaks and aid in effective allocation of
    prevention/control resources.

61
Existing Hotspot Analysis Approach SaTScan
  • The spatial scan statistical techniques
    implemented in SaTScan are widely used to detect
    and evaluate disease outbreaks (Kulldorff 2001).
  • NYSDOH has used SaTScan to develop an early
    warning system for WNV (Gotham et al. 2001).
  • An important factor considered by spatial scan
    statistical analysis is the baseline.
  • The significance of the density of dead birds
    depends on the historical distribution of bird
    deaths, human population, and so on.

62
Other Hotspot Analysis Approaches CrimeStat and
RSVC
  • Hotspot analysis techniques applied to crime
    analysis CrimeStat (Levine 2002).
  • CrimeStats Risk-Adjusted Nearest Neighbor
    Hierarchical Clustering (RNNH) Uses a kernel
    density estimation obtained from baseline data to
    adjust the threshold that controls whether data
    points can be grouped together.
  • Risk-Adjusted Support Vector Machine Clustering
    (RSVC) It combines the power and flexibility of
    support vector machine-based clustering and the
    risk adjustment idea of RNNH.

63
Case Study (NY WNV)
  • On May 26, 2002, the first dead bird with WNV was
    found in NY
  • Based on NYs test dataset

140 records
224 records
March 5
May 26
July 2
new cases
baseline
64
(No Transcript)
65
Dead Bird Hotspots Identified
66
Hotspot Analysis Findings
  • RSVC delivers similar recall levels and higher
    precision than SaTScan.
  • RNNH matches RSVC precision, but has very low
    recall.
  • RSVC significantly outperforms other methods in
    the F-measure.
  • Techniques could be complementary for different
    hotspot analysis tasks.

67
Spatial-Temporal Visualization
  • Integrates four visualization techniques
  • GIS View
  • Periodic Pattern View
  • Timeline View
  • Central Time Slider
  • Visualizes the events in multiple dimensions to
    identify hidden patterns
  • Spatial
  • Temporal
  • Hotspot analysis
  • Phylogenetic (planned)

68
(No Transcript)
69
(No Transcript)
70
(No Transcript)
71
BioPortal New Developments
  • NSF Infectious Disease Informatics Grant (2004-9)
  • International Foot-and-Mouth Disease BioPortal
    (2005-6) FMD Lab, UC Davis
  • Human Syndromic Surveillance System Arizona
    State Department of Health (2005-6)
  • Livestock Syndromic Surveillance System Kansas
    State University RSVP-A (2005-6)

72
New Research Directions
  • Analytical Algorithms
  • Prospective hotspot analysis auto baseline
    discovery
  • Spatial-Temporal correlation analysis
  • Dynamic Network Analysis
  • Visualization
  • International FMD news visualization
  • Phylogenetic Spatial-Temporal visualization
  • Syndromic Surveillance
  • Syndromic surveillance system survey
  • Emergency room chief complaint syndromic
    classification
  • Livestock syndromic surveillance

73
Extended BioPortal Research Framework
  • BioPortal Demo
  • BioPortal Operation
  • BioPortal Research
  • FMD BioPortal A dedicated instance of
    BioPortal customized for International
    Foot-and-Mouth disease monitoring. Additional
    functionalities such as gene sequence analysis
    and FMD News are added
  • BioPortal Syndromic Surveillance A specialized
    BioPortal instance that processes chief
    complainants using a hybrid method of ontology
    and knowledge rules
  • BioPortal Livestock A BioPortal instance
    devoted in Livestock syndromic surveillance case
    management and data analysis

74
International FMD BioPortal
Acknowledgment DHS, DOD, UC Davis (Drs. Thurmond
and Lynch)
75
Introduction
  • Foot-and-mouth disease (FMD) is the top disease
    on the Office International des Epizooties (OIE)
    List A, which can infect all cloven-hoofed
    animals.
  • FMD is the most contagious infectious diseases of
    livestock animals
  • Massive shedding of virus and contamination of
    the environment.
  • Transmitted by direct or indirect contact
    (droplets), animate vectors (humans), inanimate
    vectors (vehicles
  • Serologically diverse with seven distinct types
    (A, O, C, SAT1, SAT2, SAT3, Asia1), which makes
    diagnosis and vaccination problematic, and
    genetic diversity likely.
  • Endemic in Africa, Asia, Middle East and South
    America
  • Potential cost for U.S. outbreaks gt10 billion
  • Broader economic impact trade and travel
    restrictions.

76
FMD Information Model
77
International FMD BioPortal Goals
  • Real-time, web-based situational awareness of FMD
    outbreaks worldwide through the establishment of
    an international information sharing and analysis
    system
  • FMDv characterization at the genomic level
    integrated with associated epidemiological
    information and modeling tools to forecast
    national, regional, and/or international spread
    and the prospect of import into the U.S. and the
    rest of North America
  • Web-based crisis management of resourcesfacilitie
    s, personnel, diagnostics, and therapeutics

78
Research Plans
  • Global FMD epidemiological data
  • (Near) real-time data collection
  • Web-based information sharing and analysis
  • International FMD news
  • Indexed collection of global FMD news
  • Search and visualization of the FMD news via the
    web
  • FMD genetic/sequence data
  • Predictive model using phylogenetic, spatial, and
    temporal information to stop FMD at the boarder
  • Visualization for FMD event in time, space, and
    genetic space

79
Preliminary Global FMD Dataset
  • Provider UC Davis FMD Lab
  • Information sources reference labs and OIE
  • Coverage 28 countries globally
  • Time span May, 1905 March, 2005
  • Dataset size 30,000 records of which 6789
    records are complete
  • Host species Cattle, Caprine, Ovine, Bovine,
    Swine, NK, Elephant, Buffalo, Sheep, Camelidae,
    Goat

80
Global FMD Coverage in BioPortal
81
International FMD News
  • Provider UC Davis FMD Lab
  • Information sources Google, Yahoo, and open
    Internet sources
  • Time span Oct 4, 2004 present (real-time
    messaging under development)
  • Data size 460 events (6/21/05)
  • Coverage 51 countries
  • (Africa11, Asia16,
  • Europe12, Americas12)

82
Searching FMD News
  • http//fmd.ucdavis.edu/
  • Searchable by
  • Date range
  • Country
  • Keyword

83
Visualizing FMD News on BioPortal
84
FMD Genetic Information Analysis
  • Genome clustering analysis
  • Phylogenetic clustering
  • Spatial clustering
  • Temporal clustering
  • Hotspot detection among gene sequences
  • Create a tree structure based on semantic
    distance between gene sequences.
  • Automatically detect the dense portion of the
    tree.
  • Identify the connection between the semantic
    cluster and the geographic pattern of gene
    sequences.

85
FMD Genetic Visualization
  • Goal Extend STV to incorporate 3rd dimension,
    phylogenetic distance
  • Include a phylogenetic tree.
  • Identify phylogenetic groups and color-code the
    isolate points on the map.
  • Leverage available NCBI tools such as BLAST.
  • Proof of concept SAT 2 3 analysis
  • Data 54 partial DNA sequence records in South
    Africa received from UC Davis FMD Lab
    (Bastos,A.D. et al. 2000, 2003)
  • Date range 1978-1998
  • Countries covered South Africa, Zimbabwe,
    Zambia, Namibia, Botswana

86
Sample FMD Sequence Records
Color-coded View (MEGA3)
Textual View of Gene Sequence
87
Phylogenetic Trees
88
Phylogenetic Tree of Sample FMD Data
Group6
Group1
Identify 6 groups within 2 major families (MEGA3
based on sequence similarity)
Group2
Group5
Group4
Group3
89
Genetic, Spatial, and Temporal Visualization of
FMD Data
Phylogenetic tree color coded
Isolates locations color coded
Isolates appearances in time
90
FMD Time Sequence Analysis
First family cases appeared throughout the period
2nd family cases exist before 1993 and a comeback
lately
Second family cases existed before 1993 and
reappeared later after 1997
91
FMD Periodic Pattern Analysis
2nd family concentrated in Feb. while 1st family
spread evenly
92
Locations of Family 1 records
Selected only groups 1, 2, and 3 and found a
spatial cluster
93
Locations of Family 2 records
Sparse isolate locations
Selected only groups 4, 5, and 6
94
BioPortal Future Work
  • Complete open source, generic BioPortal
    architecture and system
  • Develop multi-lingual BioPortal
  • Incorporate other diseases, e.g., avian
    influenza, SARS
  • Solicit partners and expand test sites
  • Continue infectious disease informatics research

95
For more informationBioPortal web site
http//www.bioportal.org
96
A Little Promotion
97
Library as the PlaceLibrarian in
ContextRole of a librarian in a digital
world?Challenges and opportunities facing the
library and information profession?
98
For more informationHsinchun
Chenhchen_at_eller.arizona.eduAI Lab web site
http//ai.arizona.edu
Write a Comment
User Comments (0)
About PowerShow.com