Title: The UCSC University of California Santa Cruz Genome Browser the golden path genome'ucsc'edu
1The UCSC (University of California Santa Cruz)
Genome Browserthe golden pathgenome.ucsc.edu
- Stephen Baird
- Apoptosis Research Centre
- Childrens Hospital of Eastern Ontario
- sbaird_at_arc.cheo.ca
2Jim Kent
- First assembly of the human genome as a graduate
student with his program GigAssembler
- Catalog of software includes
- blat - Fast alignment of similar sequences.
- autoSql - create SQL and C code for permanently
storing a structure in database and loading it
back into memory based on a specification file - ameme - Find motifs in DNA sequence.
- 40 other command line programs for genome browser
- The Intronerator - to look at C. elegans genes
and splicing patterns. - cis-Site Seeker - Look for regulatory regions in
RNA or DNA sequences
3UCSC Genome Gateway Structure
Custom tracks
Genome browser
Table browser
Your sequence
Gene sorter
BLAT
in silico PCR
Proteome browser
Downloadable data files
Public MySQL server
4The UCSC Home page genome.ucsc.edu
5UCSC Genome Browser Gateway- start page, basic
search
6(No Transcript)
7Overview of the whole Genome Browser page
Genome viewer section
Groups of data
Variation and Repeats
8(No Transcript)
9Configure Tracks Spliced ESTs, Microarray
Expression, Repeats, etc
10Spliced ESTs By UCSC
Simple Repeats
11Known Gene Details page for Clock gene
Gene Description
Links to Tools/DBs
UniProt Description
Links to output Sequence
Microarray data
mRNA secondary structure
Protein domains/structure
Homologs
Gene Ontology (GO)
mRNA descriptions
pathways
12Proteome Browser
13Genome Gateway Help/Users Guide
14BLAT Blast Like Alignment Tool
15In Silico PCR
16Gene Sorter and Table Browser
- Query database by filtering and cross references
all of the data tables of the database to output
sequence, genomic positions or text data. - What are in all the tables?
- genome.ucsc.edu/goldenPath/gbdDescriptions.html
17Gene Sorter
- display a sorted table of genes that are related
to one another
- EXAMPLE 1 Make a list of genes of membrane
proteins that are highly expressed in pancreatic
islet cells to possibly explore the role of
autoimmunity in Type 1 Diabetes.
18Gene Sorter - Configure
19Gene Sorter - Filter
20Gene Sorter - Output
Sequence- genomic, protein or mRNA
Text Tab delimited
21Gene Sorter - To Try Now
- EXAMPLE 2 Find genes expressed predominately in
the mouse adrenal gland that have human
homologs. Get the sequence data and examine
the expression of the human orthologs.
- Enter any gene to start.
- In configure menu (a) Expand tissue selection of
GNF Atlas 2 to median of replicas, (b) click on
human homologs - In filter menu (a) set adrenal gland minimum box
to 2.5, (b) look at results and set maximum box
of other commonly expressed tissues to 0.5 - Complete solution in notes 7.2 UCSC.
22Table Browser
Groups as in Browser
Tracks within Group
Filter fields in Table and connecting Tables
Intersect non-connecting Tables by position
RESET!
23Table Browser table schema
24Table Browser Example
- EXAMPLE 3 Find CpG islands in known genes on the
last part of chromosome 22 of the human genome.
Obtain the genes sequences as one fasta record
per region.
Change to
25Table Browser CpG Example
Set group for Expression and Regulation and
track for CpG Islands
Click on intersection
26Table Browser CpG Example
27Table Browser CpG Example
Copy and paste sequences or Set up an output
file in the Table Browser
28Table Browser Example To Try
- EXAMPLE 4 Find trinucleotide repeats of more
than 10 copies within mRNA sequence on human
chromosome 4. How many are there? How many are
linked to known disease genes?
- Hints
- Period 3, copies gt 10.
- Intersect tables and custom track.
- Tables knownGene, simpleRepeats, spDisease
29VisiGene-in situ mRNA and protein images in mice
and frogs
30Data Downloads- from download link on homepage
. . .
31Example simpleRepeats table
32Public MySQL Server
- See the Data and Downloads FAQ
- Direct MySQL access to data
- http//genome.ucsc.edu/FAQ/FAQdownloadsdownload29
- Command from local MySQL client
- mysql --usergenome --hostgenome-mysql.cse.ucsc.e
du -A