Title: Dr Chris Southan, Protein Atlas Database Manager, Oxford Glycosciences UK Ltd
1Dr Chris Southan, Protein Atlas Database
Manager, Oxford Glycosciences UK Ltd
2Proteomics - fulfilling the promise of genomics
BioMarkers
Therapeutic Antibodies
Target Protein
Small Molecule Drug Candidate
Human Genome
3Genome gt Transcriptome gt Proteome
Secreted protein Bio-markers - quantity? -
origin ?
transcriptional control
translational control
post-translational control
protein complex formation
- Drug target
- Composition
- of complexes
DNA
RNA
Gygi et al. 1999 MCB 19, 1720-1730.
Genomics
Proteomics
Array
4Limitations of the Transcriptome as a Surrogate
for the Proteome
- Lack of correlation, qualitative as well as
quantitative - Limited sub-cellular or translocation information
- Limited insights into protein isoforms
- Ambiguous assignment of an EST to a
protein-coding gene
5How Accurate is the Transcriptome ?
6How Accurate is the Transcriptome?
7 Protein Isoforms
- Protein isoforms are numerous, highly conserved,
and change in a disease-associated way - These isoforms both yield novel insights into
pathogenic mechanisms and provide drug targets,
biomakers - Discovery of protein expression or isoforms by
genomics is strictly limited - Unambiguous annotation of the human genome to
find protein-coding genes can only be achieved
through proteomics
8OGS Proteomics Technology Principles
The whole process is fully integrated by state of
the art proprietary bioinformatics tools
9Proteomics Analytical Platform
- Customer Sample
- Preparation
Biological Samples
Peptide Sequence
Protein Extract
Detection
Separation
Selection
- Protein of Interest
- Customer Request
- Post Separation
- Fluorescence
- High Resolution
- SDS-Page
- 1D 2D
- m-HPLC
- Total Extract
- Serum Depletion
- Fractionation
DNA/EST Database
10First generation proteomics platform 2-D gels ?
MS
11Matrix Technology
Proprietary attachment chemistry
3
4
IEF Zoom
10
3
IEF Std.
facing plate
gel plate
bar-coded
2nd dimension polymer
- reproducibility
- long-term archiving
12Comparison of Staining Methods
Limit of Protein MS
Detection Modification Compatible Coomassie 100
ng No Yes Blue Silver Stain 1.0
ng Yes No Fluorescent .1 ng No Yes OGS Dye
Dynamic range 4 log orders
13Creating a Proteomics Database
Rep No1
Rep No2
Rep No3
Composite Gel
Master Cluster Feature
Database Entry
Master Group
14Electronic Mastergroup
15Serum High Resolution Maps
3-10
gt1200
700
Original serum
Enriched serum
gt2000
6-8
pH Range 3.5-5
5-7
16Bio-Markers from fluids only accessible to
proteomic technologies
- Accessible body fluid eg. blood, urine, CSF
- dynamic markers quantitative variations
- Cross-sectional and longitudinal analysis of
entire biofluid proteome - Discovery of dynamic bio-markers of disease
progression and therapeutic response - Essential for informed clinical trial
decision-making and monitoring - Critical to rank and select new therapeutic
modalities
17Mapping peptides to the human genome
2-D PAGE
ICAT
- Disease/protein/gene Target Class
- Identify disease protein Cell membrane
- Protein sequence Kinases
- Map to gene Proteases
- Complementary approaches Other
18Isotope Coded Affinity Tags (ICAT)
Sample Type 1
Sample Type 2
Fractionate?
Combine
Fragment, e.g., Trypsin
Affinity Purify Labelled Peptides.
19ICAT Affinity Tags
20ICAT Identification of Proteins Following
Quantitative Differential Analysis
Quantitation e.g., LC-MS, MALDI,
ESI-TOF.
Int.
m/z
K
L
Y
G
P
Identification LC-MS/MS
m/z
21How Many Protein-Coding Genes are in the Genome?
- SwissProt / TREMBL (SP/TR) 24211
- Ensembl 3.26.1 34019
- SP / TR / Ensembl 33817
- SP / TR / Ensembl / Ref Seq 54687
- International Protein Index
22Protein Mapping Direct to Genome
peptide sequence
23Mapping peptides to the human genome defining
gene structure
- Map all peptides from protein
- Determine exon structures
- Determine gene structure
- Detect novel exons/splice forms/genes
24Mapping peptides to the human genomeMaking the
Protein Atlas
25Implementation of Proteomics for Cancer
- Critical need for proteomics
- Need to adopt an integrated approach which
- reflects biological processes and clinical
relevance - Proteomics Cancer search gives 18 title hits
in PubMed but none before 1999
26Proteomics for Cancer discovery
- NCE Targets
- Whole cell proteomics
- Highly purified cancer/normal samples
- Sophisticated data analyses
- Direct selection of cancer targets
-
- 2. Antigens for immunotherapy
- Membrane proteomics for candidate antigens
- Cancer targets defined by proteomics other
means -
27Proteomics - Discovery of Protein Targets for
Breast Cancer
Clinical Network
Ludwig Institute of Cancer Research
Disaggregation
Selection
- immunomagnetic
- cell sorting
Cell types
- gt98 purity
- luminal
- myoepithelial
- endothelial
- fibroblasts
28Human Breast Cancer Staging
Normal
HUT Hyperplasia of U Type (Pre invasive)
ADH ( Atypical Ductal Hyperplasia)
DCIS (Ductal Carsinoma In Situ)
LCIS (Lobular Carcinoma In Situ)
IDC (Invasive Ductal Carcinoma) 60 of Cancers
ILC (Lobular Ductal Carcinoma) 30 of Cancers
Grade I
Grade I
Grade II
Grade II
Grade III
Grade III
PE
29Purified luminal vs non-purified cells
30Human Breast Cancer - Protein Database
Pleural Effusions
Normals
Primary Tumours
Breast Cancer Cell Lines
Breast MasterGroup
202 Samples 9809 Protein Features
31Breast Cancer - Stage Specific Proteins
IDC vs. normal
ILC vs. normal
19
123
19
15
2
34
32
p lt 0.05
PE vs. normal
32243 Breast Cancer Targets Markers
- Proteases
- Signal transduction
- Kinases phosphatases
- Metabolism
- Lipid processing
- Chaperones
- Stress proteins
- Structural
- Cell-cell interactions
- Nuclear proteins
33Breast Cancer Antigens - immunotherapy
- Membrane preps from 3 breast cancer cell lines
and 1 large primary, 1D 2D gels -gt MS - Huge enrichment for membrane / secreted proteins
( cell surface ) - Discovery of known antigens, known membrane
proteins, novel antigens - Selection of candidates for immunotherapy
- Ranking via predictions mRNA profiles -
overexpression in cancer, restricted vital organ
expression, and trans membrane domains
34Membrane Antigen Discovery
35Membrane Antigen Discovery 1D gel proteomics
- Better than 2D gels for cell membrane
preparations - Easily adapted to OGS MS identification
technology and data processing - More high MW transmembrane domain containing
proteins identifed
36Membrane Antigen Discovery
Known Antigens Known proteins - new
antigens Novel proteins ErbB2 BCMP453 668H4
Muc-1 NCAM2 BCMP11 ICAM1 FMLP-R BCMP84
CD98 ADAM10 Muc-like GA733.1/2 KIAA0152
BCF317 EGF receptor Ion Channels PECAM Tra
nsporters CD44
37BCMP11 BCMP84 BCMP453 QT PCR across tissue
samples
BCMP11 BCMP84 BCMP453
80 prevalence
38BCMP84 Immuno Cytochemistry ( rabbit
polyclonals )
aBCMP84 MDA468s breast cancer cell line
-ve cell line T47D breast cell line
Membrane staining
39BCMP84 - ICC MDA468s - confocal demonstration
of membrane localisation
40Breast cancer proteomics summary
targets and markers with direct cancer
relevance purified, characterised
samples stringent selection - novel proteins,
disease specific isoforms novel NCE targets and
biomarkers cancer antigens broad
application to many cancers discovery
validation using easier to obtain
samples novel immunotherapy targets
41Acknowledgements
Dr Christian Rohlff and the OGS Operations
team Dr Jon Terret and the OGS Molecular Biology
team