UK CropNet databases: a brief guide - PowerPoint PPT Presentation

1 / 31
About This Presentation
Title:

UK CropNet databases: a brief guide

Description:

Intititally developed for worm data. Developed for biologists by biologists ... Data arranged into categories called classes' Data' can be (almost) anything, e. ... – PowerPoint PPT presentation

Number of Views:24
Avg rating:3.0/5.0
Slides: 32
Provided by: ukc5
Category:

less

Transcript and Presenter's Notes

Title: UK CropNet databases: a brief guide


1
UK CropNet databases a brief guide
  • Keith Bradnam
  • Nottingham Arabidopsis
  • Stock Centre

2
UK CropNet Databases
MilletGenes
Arabidopsis Genome Resource (AGR)
BarleyDB
FoggDB
BrassicaDB
3
ACEDB
  • A Caenorhabditis elegans Database
  • Intititally developed for worm data
  • Developed for biologists by biologists
  • Available for all computer platforms

4
Advantages of ACEDB
  • It works!
  • Configurable
  • Under constant development
  • Good feedback from creators
  • Can access ACEDB databases via web
  • Its free!

5
Disadvantages of ACEDB
  • Its free!
  • No manual
  • Not always intuitive to use

6
ACEDB database structure
  • Data arranged into categories called classes
  • Data can be (almost) anything, e.g. Author,
    genetic map, sequence, picture etc.
  • Data in any one class can be linked to data in
    others

7
Example database
Database
8
Searching ACEDB databases
  • Basic text search
  • Browse data by surfing
  • Powerful query language
  • Find Author May Follow Paper Follow Sequence
    Clone
  • Many plugins available, e.g. BLAST search
    against sequences

9
Local vs remote databases
  • People want web access to databases
  • Dont have space on their machine
  • Dont have expertise
  • Dont have time
  • People familiar with the web
  • Web based databases are not so powerful

10
Xace screenshot 1
11
UK Cropnet Screenshot 1
ukcrop.net/
12
UK Cropnet Screenshot 2
13
Acebrowser Screenshot 1
14
EcoSys
15
AGR a complex ACEDB database
  • There is a lot of Arabidopsis information
    available!
  • 130 Mb genome, nearly completed
  • Both major ecotypes have been sequenced
  • Organising so much data is not easy!
  • Why is there so much data?

16
Smallest plant genome?
17
What data is in AGR?
  • Sequences (updated daily)
  • AGI genome sequence (1,500 clones)
  • EST and GSS sequences
  • Organelle sequences
  • Other ecotype sequences
  • Protein sequences (Swissprot TREMBL)
  • Insert sequences
  • 225,000 sequences in total

18
Other AGR data
  • Maps and markers
  • Physical, genetic, and recombinant inbred (RI)
    maps
  • 1,200 RI markers
  • RI scoring information
  • Clone, Locus, and Allele info
  • Germplasm info with links to order stocks

19
Still more AGR data
  • Bibliographic data (from EMBL records)
  • Images (plants and gels)
  • Other species info (protein sequences and
    associated info from all higher plant species)
  • BLAST homologies millions of hits
  • Mostly intra-specific homologies
  • Some inter-specific homologies

20
Acebrowser Screenshot 2
21
Acebrowser screenshot 3
22
AGR page screenshot
ukcrop.net/agr/
23
AGR page screenshot2
24
Insert lines
  • Many Arabidopsis plants now contain random
    transposon insertions
  • Therefore genes of interest may be hit, or
    modified by inserts
  • Genomic location of inserts identified by blast
    analysis
  • Can only identify putative location

25
Insert data in AGR
  • SINS (Jones) inserts
  • Stocks sequences
  • IMA (Sundaresan) inserts
  • Stocks some location info
  • ITS (Pereira) inserts
  • Just sequences
  • Separate web page for inserts
  • ukcrop.net/agr/insert.html

26
Xace screenshot
27
Webace screenshot
28
Searching for inserts
Find sequence of interest in AGR
Look to see if sequence has homology to an insert
BLAST search your sequence against insert
sequences
Search against precomputed BLAST analysis
Look to see if corresponding stocks are available
29
NASC blast server
nasc.nott.ac.uk/blast.html
30
Search precomputed blast data
31
Future developments
  • More data
  • Microarray data
  • More insert sequences
  • Improved access to data
  • New look to Acebrowser interface
  • Bring GFace interface online
  • More links between NASC and AGR
Write a Comment
User Comments (0)
About PowerShow.com