Title: MetaCore data analysis suite and functional analysis
1MetaCore data analysis suite and functional
analysis
2Knowledge-based functional data analysis
Cancer relevant annotations, datatabases, Active
cpds analysis screening
Data parsing, normalization
HTS, HCS
3Knowledge base Three interactions domains
4Pathways are assembled of interaction building
blocks
4
5Knowledge base gt100,000 biological pathways
Pic.1
Ligand Receptor Second messenger Enzyme
TF
Ligand TF TF target Reaction metabolit
e
Ligand Receptor Adaptor Enzyme TF TF
target
5
6Distribution of interactions by mechanism
7Molecular functions in Database
8Compounds distribution by target
9Protein complexes/groups (species-specific)
Protein structure map (group, complex or group of
complexes) is shown on protein page
Chosen protein highlighted
10Disease maps. Delta 508 degradation in cystic
fibrosis
11Pathways Integration
- Interactive, static maps
- 400 maps
- Signaling, regulation, metabolism, diseases
- Backbone of formalized state of art in the field
12Concurrent visualization of different data types,
experiments
Agilent
Affymetrix
Proteomic
SAGE
13Types of biomarkers
Biomarker is a substance used as an indicator of
a biologic state Wikipedia
Biomarkers
Trust, utility
- Gene signature validation
- Large scale studies
- Statistical models
- Statistical clustering
- Need validation studies
- Long term and expensive
- Poor in cross-platform
- Poor robustness
- No mechanism
- Functional descriptors
- Large scale studies
- Statistical models
- Functional clustering
- Need validation studies
- Long term and expensive
- Validated cross-platform
- High robustness
- Mechanism
14Functional analysis must run in parallel to
statistical signatures. Why?
Differentially expressed genes, proteins
(normalization, QC)
Statistical analysis, Gene signatures
- Functional analysis
- Pathways,
- Networks
- Process ontology
- Disease ontology
- Toxicity ontology
15Gene signatures usually dont make functional
sense. No concise networks
70-genes metastases signature tVeer DI network
van 'T Veer L. J., Dai H, van de Vijer, M.J., ,
He Y.D., et al, Gene expression profiling
predicts clinical outcome of breast
cancer. Nature, 2002, 415, 530-36 Wang Y. et al.
Gene-expression profiles to predict distant
metastasis of
lymph-node-negative primary breast cancer.
Lancet, 2005, 365, 671-79
1676-genes metastases signature Wang DI network
Gene signatures usually dont make functional
sense. No concise networks
van 'T Veer L. J., Dai H, van de Vijer, M.J., ,
He Y.D., et al, Gene expression profiling
predicts clinical outcome of breast
cancer. Nature, 2002, 415, 530-36 Wang Y. et al.
Gene-expression profiles to predict distant
metastasis of
lymph-node-negative primary breast cancer.
Lancet, 2005, 365, 671-79
17Knowledge-based functional analysis
- Enrichment by category
- Signaling processes
- Metabolic processes
- Diseases
- Toxicities
- Networks
- Based on binary interactions
- Multiple algorithms
- Filers for specificity
Match IDs
Match IDs
- Parsed data
- -Gene IDs (LLs, GeneBank)
- Protein IDs (Uniprot)
- Compound IDs (CAS, smiles)
Match IDs
- Canonical pathways
- Signaling and metabolic pathways
- Linear, 12-13 steps
- vertical from MR to TF to effect
18Top canonical maps for two signatures for same
condition (metastases in invasive breast
cancers). Not comparable
Wang
Vteer
19Functional analysis tools
Resolution
- Experiment filters
- Species, orthologs, localizations, tissues etc.
- Custom list of targets, IDs
1000 genes Multiple sets
- Enrichment analysis for gene, protein, compound
sets - Hyper G, GSEA, GSA etc.
- Multidimensional analysis multiple ontologies
- GO processes
- GG processes
- Canonical pathways
- Diseases
- Export of sub-sets for network analysis
- Low resolution
- Interactome analysis
- Whole-set analysis
- Over- and underconnected nodes in the dataset
- Interactions neighborhood
- TFs, kinases, receptors, etc.
- Scoring for interactions within set FDR
- Network analysis
- Multiple pre-filters (species, interactions
mechanisms, - organelles etc.)
- Parameters enrichment with genes from set,
- canonical pathways, specific protein classes
- Algorithms SP, DI, AN, TFs, Receptors etc.
- Statistics hubs, preferred pathways etc.
- Highest resolution individual proteins or
isoforms
Most important genes
- - Highly connected
- TFs, receptors, etc.
- Hubs from important networks
- Highest expressed/mutated genes
20Interactome analysis
A
B
P21 206 interactions in 16,000 proteins db
All proteins in the database
Amplicome
Mutome
- Calculates local and global connectivity
- for arbitrary datasets and lists
- Intra-connectivity within set
- Inter-connectivity
- Between sets
- Between set and all proteins in database
Under-connection
Over-connection
Expectation
P21 expect 5 interactions in 320 proteins
dataset
3 interactions for P21
9 interactions for P21
21Data workflow
22Interactome analysis
A
B
P21 206 interactions in 16,000 proteins db
All proteins in the database
Amplicome
Mutome
- Calculates local and global connectivity
- for arbitrary datasets and lists
- Intra-connectivity within set
- Inter-connectivity
- Between sets
- Between set and all proteins in database
Under-connection
Over-connection
Expectation
P21 expect 5 interactions in 320 proteins
dataset
3 interactions for P21
9 interactions for P21
22
23Transcription regulation between datasets
A
B
23
24Interconnectivity between mutome, amplicons
24
25Functional synergy in enrichment analysis
EA in ontology KEGG, GO, GeneGo pathways,
disease biomarkers etc..
Unconnected datasets
Pathway 1 p-value 4e-8 Pathway 2 1.2e-7 .
Synergistic datasets
25
2626
27DM data parsers
- Custom interactions data
- Y2H
- Pull-down
- Co-expression
- annotation
Structures sdf, MOL
HTS, HCS
Custom maps, networks, pathways
Metabolites
Gene lists
Molecular bio data
ISIS DB
HTS, HCS
Pathway Editor
MapEditor
MetaLink
Structure parser
General parser
Metabolic parser
28 Batch upload Browse files in
2. Click Next
1. Browse in file to upload. For batch upload,
choose zip archive files
29Experimental filters and options
30Results management
- Analysis results saved in different formats
- - Gene lists
- - Network object lists
- - Workflows
- - Network lists
- - Networks
- - Netshots
- - New maps
- Hierarchical folders structure
- One-button activation of experiments
- Graphics saved in up to 300dpi format
- (cover publication quality)
31Data export
- Experiments
- Gene lists
- Network object lists
- Excel or Word or text formats
32Choose a disease
33Cancer-related Diseases and Maps Distributions
by GO terms
Ontologies that contain specified targets will be
emphasized with green font
34Top 5 Maps will be saved to report
35Network are built and saved to report and to the
workflows folder in the Data Manager. Network
statistics are generated and reported for
common, similar, and unique genes.
36Included in MetaCore
37MetaSearch overview
- Flexible way to extract data from
MetaCore/MetaDrug Discovery Platform
(MetaDiscovery) - Real-time communication with MetaDiscovery tools
- Results can be visualized and analyzed in
MetaCore and MetaDrug
38(No Transcript)
3924. The query is complete
4025. Click Go!, to launch MetaSearch
4126. As a result we obtain a list of 27 chemicals
(from over 600 000 chemicals from the database),
that satisfy our conditions Interact with
human beta 2 adrenoreceptor, activating it at
less than 100 nM concentrations
42(No Transcript)
43MapEditor Lite/MetaMap
AFTER
BEFORE
Wet lab research, Writing reviews
Minutes Hours
Days? Weeks?
44MapEditor
Additional Localizations can be added
45Your NEW map is now an interactive part of
MetaCore
Users can visualize their experimental data on
the new map
46Direct export of interaction refs to EndNote