National Resource for Proteomics and Pathways P41 funded by NCRR - PowerPoint PPT Presentation

1 / 29
About This Presentation
Title:

National Resource for Proteomics and Pathways P41 funded by NCRR

Description:

Curt Wilkerson (MSU) David States (UM) NCRR Proteomics, Glycomics, and Mass Spectrometry Centers ... Curt Wilkerson. Brett Phinney. Eric Deutsch. Joshua Adkins ... – PowerPoint PPT presentation

Number of Views:52
Avg rating:3.0/5.0
Slides: 30
Provided by: Andr544
Category:

less

Transcript and Presenter's Notes

Title: National Resource for Proteomics and Pathways P41 funded by NCRR


1
National Resource for Proteomics and PathwaysP41
funded by NCRR
  • Philip Andrews, Director (UM)
  • Russ Finley (WSU)
  • Trey Ideker (UCSD)
  • George Michailidis (UM)
  • Curt Wilkerson (MSU)
  • David States (UM)

2
NCRR Proteomics, Glycomics, and Mass Spectrometry
Centers
  • Proteomics Research Resource for Integrative
    Biology
  • PNNL Richard Smith
  • National Resource for Proteomics and Pathways
  • University of Michigan Philip Andrews
  • National Center for Glycomics and Glycoproteomics
  • University of Indiana- Milos Novotny
  • Integrated Technology Resource for Biomedical
    Glycomics
  • University of Georgia- J. Michael Pierce
  • Research Resource for Integrated Glycotechnology
  • University of Georgia- James Prestegard
  • Mass Spectrometry Resource for Biology and
    Medicine BU- Catherine Costello
  • Bio-organic Biomedical Mass Spectrometry Resource
    UCSF Alma Burlingame
  • National Resource for Mass Spectrometric Analysis
    of Biological Macromolecules Rockefeller
    University Brian Chait
  • Resource for Biomedical and Bio-organic Mass
    Spectrometry Washington University- Michael Gross
  • National Resource for Biomedical Accelerator Mass
    Spectrometry
  • LLNL- Ken Turteltaub

3
The Primary Goals of the NRPP
  • Develop computational and bioinformatics tools
    for proteomics.
  • Provide the datasets required to build predictive
    organismal models.
  • Develop and prove technologies needed to produce
    proteomics data.

4
NRPP Principal Technologies
YTH Protein Interactions
Informatics
NRPP Dissemination System
PRIME
Pull-downs Interactions
Cytoscape
Computational Tools
Data Sets
IM Browser
Organismal Models
Proteome Commons
5
Protein Interaction Maps
Drosophila 6,935 nodes, 20,004 edges Giot et al.
(Curagen) Science, Dec 2003
Drosophila 737 nodes, 2,363 edges Stanyon et
al. Genome Biol., Dec 2004
Campylobacter jejuni 1,325 nodes, 11,484
edges Parrish,et al. unpublished
6
C. jejuni Interaction Map
  • 1,654 ORFs
  • 1,477 cloned
  • 336,000 assays
  • 16,172 interactions
  • 11,616 confirmed
  • 2,829 high confidence

7
Overlap with Reference Sets
Predicted interologs of E. coli reference
interactions
Freq. per 5,000 Random Interactions
Rate of overlap gtgt greater than expected by
chance Campy data enriched for true positives.
8
Chemotaxis Signal Transduction Pathway Mapping
in C. jejuni
Cj0144
Cj0262c
Cj1110c
Cj0448c
Cj0951c
Cj0019c
CheW
Cj1670c
Cj0959c
CheV
CheA
CheY
Cj1012c
Cj1004
FliM
MotB
FliL
FliY
9
Integrating Interaction Data
  • Drosophila Gene and Protein Interactions Database

http//proteome.wayne.edu
Lana Pacifico
10
Conserved Complexes
  • PathBlast - Ideker

C. jejuni
E.coli
C. jejuni
yeast
11
Expression Profiling
  • Protein Expression Profiling
  • PTM profiling
  • Integration of mRNA expression data with
    interaction maps.

12
Anthrax Infection, Human Lung 2001
Armed Forces Institute of Pathology, Dept of
Radiology
13
B. anthracis Spore Germination Occurs Rapidly
germination
outgrowth
Does the proteomic profile change during
germination?
14
The Abundance Ratios of Proteins Vary During
Germination
Protein distribution
Mean 1.00 St Dev .18
Protein distribution
Proteins degraded
Proteins synthesized
Mean 1.04 St Dev .21
2 min/0 min
Protein distribution
Mean 1.12 St Dev .32
7 min/0 min
17 min/0 min
15
Heat Map Based on Standard Deviations
16
Classification of 37 Proteins that Decrease
During Germination
  • 8 Small acid soluble proteins-required to protect
    DNA in the spore
  • 7 Spore coat proteins-5 known and 2 new
  • 1 Spore protease
  • 8 small hypothetical proteins (lt10 kDa)
  • 6 conserved hypothetical proteins (gt10 kDa)
  • 7 misc known proteins

17
Small Acid Soluble Protein BAS0493 Identification
of Cleavage Sites



1 MSKKQQGYNKATSGASIQSTNASYGTEFATETNVQAVKQANAQSEAKK
AQA 51

52 SGASIQSTNASYGTEFATETDVHAVKKQNAQSAAKQSQSSSSNQ
end



Tryptic peptides
GPR protease
Alt trypsin site
18
Time Course ProfilesA549 cell response to TGFbeta
19
Proteome Informatics
  • Open Source code (most efficient approach for
    Proteomics)
  • Standardized on Java
  • IO Framework, MS Expedite, Data Extractor,
    various modules
  • Proteome Commons- Open Source site
  • The PC Dissemination System (DFS)

20
Interactions with other NCRR Centers.
  • PNL- Richard Smith
  • Proteome Commons
  • DFS collaborations in progress
  • Software development- coordination of efforts.
  • North Dakota INBRE- Donald Sens
  • Training, Bioinformatics infrastructure
  • UCSF- Burlingame
  • PRIME Protein Prospector
  • DFS- planned.

21
Proteome Commons
  • Services
  • One stop for most proteomics Open Source tools
  • Data sets to develop and test new algorithms
  • Code development tools (versioning, Subversion)
  • Proteomics news and announcements
  • Distributed File System (The DFS)

22
Data Sharing and Publishing Challenges and
Solutions
  • Bandwidth/file transfer rates.
  • New journal guidelines ask for data access.
  • The vanishing webserver syndrome.
  • There are some file and annotation standards and
    efforts to develop more.

23
DFS
  • Peer-to-Peer (P2P) Distributed File System
  • Open, simple, cross-platform protocols
  • e-Commerce-grade encryption makes it appropriate
    for scientific research (allows peer-review and
    traceability)
  • It can easily grow to accommodate very large
    amounts of data and users
  • Commodity hardware! 0.37 per GB storage

24
(No Transcript)
25
(No Transcript)
26
Current Status
  • 16 TB over 13 servers (46 TB online in June)
  • 1 BioChem, 8 NRPP, 1 ISB, 1 MSU, 1 UC Davis, 1
    UNC
  • 1.3 terabytes of MS/MSMS data (all public data
    available).
  • Docs, tools, code, credits and more
    http//www.proteomecommons.org/dev/dfs
  • Data sets
  • TheGPM, PNNL, Aurum, QqTOF vs QSTAR, sPRG ABRF
    2006, HUPO PPP
  • PeptideAtlas, OPD

27
Acknowledgements
  • Collaborators
  • Curt Wilkerson
  • Brett Phinney
  • Eric Deutsch
  • Joshua Adkins
  • Richard Smith
  • Al Burlingame
  • Janine Maddock
  • Venkat Keshamouni
  • NRPP
  • Xuequn Chen
  • Jayson Falkner
  • Brian Maso
  • Panagiotis Papoulias
  • Gary Rymar
  • Eric Simon
  • John Strahler
  • Peter Ulintz
  • Donna Veine
  • Angela Walker

28
Pathway/Network BLASTRoded SharanSilpa
SuthramRyan KelleyPeter UetzRichard
Karp Genetic interactionsRyan Kelley Functional
orthologsSourav BandyopadhyayRoded
Sharan Transcriptional pathwaysChen-Hsiang
YeangCraig MakScott McCuineChris WorkmanTommi
Jaakkola
CytoscapeBenno SchwikowskiChris SanderAnnette
AdlerLeroy HoodGary BaderEthan CeramiKei
OnoNerius LandysRyan KelleyChris
WorkmanMelissa ClineRichard BonneauRowan
ChristmasIliana Avila-CampilloAllan
KuchinskyAditya Vailaya
Funding NCRR, NIGMS, NSF Packard Fellowship
Unilever Websites www.pathblast.org
www.cytoscape.org
29
DFS Home
  • https//dfs/proteomecommons.org/

Project funded by NCRR
Write a Comment
User Comments (0)
About PowerShow.com