Title: Too many matches
1Too many matches
2Too many matches
What are the potential TF sites involved in
regulation of my gene of interest ?
Lets run MatInspector over the promoter region
of my gene
3Too many matches
Where do I get my input promoter DNA sequence
from?
Lets extract from NCBI. 3kb upstream of TSS to
be sure to have the promoter
4Too many matches
Which of those matches are relevant? How do I get
rid of all those false positives ?
5TF binding sites
There is not a single false positive match
MatInspector gives you all physical TF binding
sites
A physical TFBS is found every 10 to 15 bps
throughout the genome
A single isolated TF binding site carries no
function
TFs work through complexes which are represented
on sequence level through sets of TF binding
sites in certain distance relationship and
orientation -gtpromoter frameworks
6TF binding sites
Okay, what is now a physical TF binding site
? What is a functional TF binding site?
7False positives?
8Physical vs functional TFBS
...but binding proteins are present only in 2
cell types! -gt no functional binding site
in the other 3 cell types!
One binding site, five cell types...
...biological function may require additional
binding sites!
Even when binding proteins are present...
9Transcriptional modules
Transcriptional modules integrate signals via the
interacting TFs
10Why uses nature modules?
No common organization?
Common modules!
11Transcriptional modules
Synergistic
Antagonistic
Synergistic
Composite elements
Short range module distance 50 bp
Looping module distance up to 300bp
Short range module distance 50 bp
or
or
Binding Affinity
High / Low Is possible
High / Low Is possible
High / Low Is possible
High / High only
12Transcriptional modules
Induced by 2 pathways !
13Transcriptional modules
14Transcriptional modules
Gene A, transcript n
Protein complex
15Transcriptional modules
16Transcriptional modules
Experimentally verified evidence that TFBSs from
modules, which are crucial for regulation in one
biological context (cell type), are totally
irrelevant in another !
Fessele, S., Maier, H., Zischek, C., Nelson,
P.J., Werner, T. (2002) "Regulatory context is a
crucial part of gene function" Trends in
Genetics 18, 60-63 (MEDLINE 1181130)
17Transcriptional modules
18Promoter sequences
Very interesting but how does all this help me
with my original question ? The question still
is What are the potential TF sites involved in
regulation of my gene of interest ?
19Promoter sequences
Where do I get my input promoter DNA sequence
from?
Lets extract from NCBI. 3kb upstream of TSS to
be sure to have the promoter
20Promoter sequences
3 kb is too large for meaningful analysis
even going 10kb upstream of TSS is no guarantee
to have the relevant promoter sequence
multiple promoters are the rule, not an exeption
the non-coding first exon is always part of the
promoter
Huh? What does this mean ? Where do I get this
damn promoter now?
21Alternative transcripts/promoters
22Alternative transcripts/promoters
23Alternative transcripts/promoters
Data from ElDorado
24Alternative transcripts/promoters
Alternative promoter usage is often tied to
regulation of tissue specific gene expression
Alternative promoter usage is of very high
biological relevance. There are several examples
where aberrant regulation of the identical
primary transcript leads to severe biological
effects
25Alternative transcripts/promoters
The gene product is absolutely identical. The
only difference is in the alternative promoter
usage. On transcript level this can be seen only
in the non-coding first exon.
26Promoter Analysis
1. Identification of the promoter sequence
2. Prediction of physical transcription
factor binding sites
3. Functional context
4. Context dependent functional
transcription factor binding sites
27ElDorado promoter sequence retrieval
Yes! I know all of this! I just wanted to know
from where I can get my promoter sequence(s)
easily!
If you dont have one already, sign up for a free
evaluation account. first... ... then login here!
www.genomatix.de
28ElDorado promoter sequence retrieval
29ElDorado promoter sequence retrieval
30ElDorado promoter sequence retrieval
IMPORTANT! Affymetrix probe-set-ID input Our
annotation is NOT based on the Affymetrix NetAffx
assignment!It is rather based on genomic mapping
of each single probe. A transcript will be
retrieved if at least one probe of the set
(usually 11 probes) matches. For mixed probe
sets (cross-hybridisation), all relevant
transcripts will be retrieved, which might lead
to a result with transcripts from different loci.
Input in this section delivers results based on
gene name or keyword search. Over a million of
names, synonyms and gene IDs help to find what
you want - fast!
HMGCS1 ( for example)
Input in this section delivers results based on
ultra fast sequence mapping. Copy and paste raw
sequence data here (min.15 nucleotides) or enter
an accession number. In contrast to the entry of
an accession number above, here the sequence is
actually retrived from data base and mapped onto
the genome(s). NOTE many EST based accession
numbers have poor sequence homology and deliver
no result.
31ElDorado promoter sequence retrieval
32ElDorado promoter sequence retrieval
This gives you an interactive graphical
representation of the genomic context of your
gene
33ElDorado promoter sequence retrieval
mapping positions of Affymetrix single probes !
switch display of components on and off
scale/slide the retrieved genomic "window"
select regions of the graphics and safe them into
a file
Orange indicates your input. In this case a gene
name. It is very informative when your query is
based on sequence data. Then you see the mapping
positions.
Everything is clickable just play around !
Here you can scale the view
34ElDorado promoter sequence retrieval
Clicking on this trancriptional start region
(TSR)...
...displays this hyperlink to ...
Now we have zoomed into the promoter region
35ElDorado promoter sequence retrieval
...this profile of the different experimentally
verified TSS (CAGE tags) in the different tissue
types.
36ElDorado promoter sequence retrieval
This is a table-like representation of all
annotated elements. It is especially useful for
quick and easy retrieval of the dna sequence(s)
of interest.
37ElDorado promoter sequence retrieval
Tick/un-tick the boxes of what you would like to
see, and then...
38ElDorado promoter sequence retrieval
This for instance...
...tells you that this SNP deletes three
potential TF binding sites and creates a new one.
A potential regulatory active SNP...
39ElDorado promoter sequence retrieval
from here you can directly run a MatInspector
analysis for this promoter...
...again,play around with the interactive
graphics... Click the symbols and jump right
into MatBase, the TF knowledge base..
40ElDorado promoter sequence retrieval
now, finally the first way to extract a promoter
sequence ... ...and/or any other element
displayed in the list below.
Choose your desired length. Unless you have good
reason to change the length of the proximal
promoter, leave the defaults!
41ElDorado promoter sequence retrieval
This shows you all annotated alternative
transcripts plus all Affymetrix probe set single
probe mappings plus
another way to extract your promoter
sequence(s)
42ElDorado promoter sequence retrieval
You know this already... Three different known
transcripts for this locus...
... and four distinct promoters ! How this
comes, Ill tell you in a minute
43ElDorado promoter sequence retrieval
Tick the promoter of your interest...
Or submit the promoter directly to
MatInspector for graphical analysis. It works on
a single sequence, too.
Or submit sequences directly to one of those
tasks. But they make sense only with multiple
sequences. More on that later!
...choose format...
...and extract the sequence.
44ElDorado promoter sequence retrieval
But why do I have four promoters here? And two
even dont have a transcript assigned, as it is
written here! And whats all that CompGen thing
about?
The multiple promoter thing I showed you before.
Remember the GCK example, liver and
pancreas? Now to the CompGen promoters. They are
derived by a proprietary comparative genomics
approach.
45ElDorado promoter sequence retrieval
46ElDorado promoter sequence retrieval
The tick-boxes you know already... We need them
for later promoter retrieval.
For our example we have an homologous locus
assigned in chimp, macaca, human, rat, dog, cow,
opossum, chicken, and zebrafish.
Note the Promoter Set number !
Exhaustive cross-mapping of all transcripts to
all genomes of all organisms in ElDorado
generates our homology groups.
47ElDorado promoter sequence retrieval
Get a feeling for the degree of phylogenetic
conservation of the resp. promoter. See how
much experimental evidence supports this promoter
48ElDorado promoter sequence retrieval
A Promoter Set represents phylogenetically
conserved promoters
You should be familiar with this view,
now. Here the orange indicates a promoter
belonging to a promoter set.
With these tick-boxes you can switch on and off
the display of the different Promoter Sets
49ElDorado promoter sequence retrieval
Dont waste my time here! How do I get my
promoter sequence now? And which one of all
those promoters should I take ?
Well, which one? If you do not have any other
information (experimental or from literature), I
would recommend that you consider all available
alternative promoters for further analysis
50ElDorado promoter sequence retrieval
Dont waste my time here! How do I get my
promoter sequence now? And which one of all
those promoters should I take ?
Two easy ways of promoter sequence retrieval by
two mouse clicks I showed you some minutes
ago. There are more...
oh... you cannot access these options?
51ElDorado promoter sequence retrieval
Otherwise it is slightly more cumbersome...
... and use that for sequence retrieval from your
second to Genomatix favorite system, e.g. NCBI
Use one of the options I showed you before and
get Contig and positional information...
Hint If you are interested in the TF results
rather than the sequence, use the search for
common transcription factor binding sites option
as shown before.
52From physical to functional TF site
Quite interesting But I am not a single step
closer to the answer of my real question What
are the potential TF sites involved in regulation
of my gene of interest ?
Well, I think you are. Essential first step is to
analyze the right sequence in a length that
allows for meaningful results. Now that you have
the real promoter sequence(s), lets see how to
go on from here...
53From physical to functional TF site
Then we have to look for additional evidence that
some of the physical TF sites might be functional
ones. Best would be to go for a ChromatinIP
experiment. However, for such you would need some
hints for which TF to make or buy antibodies.
Further computer analysis is required
anyhow! There are three different roads to go...
The ideal situation for determining potential
functional binding sites would be to have a set
of genes apparently being co-regulated in the
given cellular and experimental context, f.i.
from a microarray experiment. A comparative
promoter analysis with FrameWorker would very
likely give you a pattern of involved TFs, as
shown in numerous publications (see our web site
at About us -gt Publications).
But I have only a single gene. And thats the
one I am interested in!
54From physical to functional TF site
We talked about promoter modules before. Search
your sequence for promoter modules with
ModelInspector. Our Promoter Module Library
contains over 550 promoter modules, each of them
experimentally verified to carry transcriptional
regulatory activity. A module match increases
probability that an involved TF site is
functional.
Okay, how do I do this? Lets go !
Look for phylogenetically conserved patterns of
TF sites in a comparative genomics promoter set
with FrameWorker. TFs being part of such
phylogenetically conserved frameworks carry
higher probability for being functional.
Do extensive literature data mining with
BiblioSpherePE for known TF correlations, pathway
analysis and gene set creation for comparative
promoter analysis. TFs showing biological
activity in another experimental context are
functional (at least in that context).
55ElDorado promoter sequence retrieval
Lets start with an analysis for promoter
modules...
56Search for promoter modules
If you are licensed, you can have a quick look
at the promoter module library. Each module is
experimentally verified to carry regulatory
activity.
57Search for promoter modules
Choose a sequence file from your directory
Or copy paste a raw sequence here. or you know
the rest !
Dont click anything below, unless you want to
scan an entire data base !
58Search for promoter modules
59Search for promoter modules
60Search for promoter modules
61Search for promoter modules
Now we have focused down to 21 very interesting
positions in this promoter with modules that are
composed of a total of 11 different transcription
factor binding sites. Our arbitrary chosen
example HMGCS1 belongs to the cholesterol
biosynthesis pathway. Some of the found promoter
modules do have proven function in sterol
regulation!
Wow! Thats impressive! But that example is a
mock-up, isnt it?
Not really. It is a nice example to show this
approach. Very frequently one finds functionally
related modules. However, there is no
guarantee It adds just another line of evidence.
62Phylogenetically conserved frameworks
Thats right. For this approach you first need a
set of phylogenetically conserved promoters.
Remember several slides before ?
Okay, how does the other thing help? How did you
call it, phylogenetically conserved frameworks?
Not really. It is a nice example to show this
approach. Very frequently one finds functionally
related modules. However, there is no
guarantee It adds just another line of evidence.
63ElDorado promoter sequence retrieval
and tick the promoters of one set.
In this example I choose Promoter Set 3 for
human, rat, dog and cow.
...scroll to the top of the page...
Inspect and choose your Promoter Set...
64Phylogenetically conserved frameworks
...scroll down...
Great ! That is what I really want to know
Which TF sites do they have in common?
From here you can have a look at TF binding sites
which are common to the input promoters
65Phylogenetically conserved frameworks
Great ! That is what I really want to know
Which TF sites do they have in common?
This is not more than a tiny hint! I can show you
many cases where totally unrelated exons do have
more TF sites in common than closely co-regulated
promoters. What you are really looking for is a
conserved pattern of TF sites. And we are going
to do so. But first lets have a look on the
nucleotide sequence level...
Be careful !!
66Phylogenetically conserved frameworks
DiAlign TF gives an overlay of a true multiple
sequence alignment (not pairwise) and common TF
sites. Check DiAlign for other sequences
(including amino acids)! It is extremely fast
and especially powerful for finding short
homologies in largely unrelated sequences.
67Phylogenetically conserved frameworks
The parameters should be self explanatory.
You can always click for help
68Phylogenetically conserved frameworks
Here an output example.
69Phylogenetically conserved frameworks
Why did you do this? What does it tell me?
It is pretty informative to get a feeling for the
degree of homology, which parts are more
conserved than others and which TF binding sites
reside in the homologous parts. Then, it is of
interest to see where the evolutionary pressure
was rather on functional conservation (TFBS) than
on sequence conservation.
70Phylogenetically conserved frameworks
Why did you do this? What does it tell me?
Then, if you do a framework analysis on two
highly homologous sequences we run into a
combinatorial explosion. FrameWorker checks for
it and might give you a warning. However, in this
case everything is fine...
71Phylogenetically conserved frameworks
Why did you do this? What does it tell me?
If you do a framework analysis on two highly
homologous sequences we run into a combinatorial
explosion. FrameWorker checks for it and might
give you a warning. However, in this case
everything is fine...
Now, we finally go to the FrameWorker analysis!
72Phylogenetically conserved frameworks
This filter is a positive filter! Only TFs known
to be associated with a tissue are listed here. A
TF not listed in a certain tissue does NOT mean
that it is not expressed there! It just has not
been reported, yet.
Here you can choose the matrix library
Here you can select for TFs only, known to be
associated with certain tissues.
73Phylogenetically conserved frameworks
More options gives you...
...well, more options !
Dont change those parameters unless you know
exactly what you are doing !
74Phylogenetically conserved frameworks
If you know that a certain TF is involved in the
regulation of your gene, make it a mandatory
element and search only for frameworks containing
such. Mandatory elements are most helpful in
focusing your analysis. If you dont know one a
priory, Ill show you later in BiblioSpherePE how
to get to those. Toggle multiple choices by
holding the "Ctrl" key when clicking!
This decides the number of input sequences which
have to show a common pattern of TF sites
One word on this parameter. It decides the
minimum/maximum number of TF sites being allowed
in one framework. In this case I increased the
default value from 6 up to 10 since we want to
identify the largest conserved pattern in this
phylogenetic promoter set. We might lower this
later.
This sets the distance constraints between two
adjacent TF sites. More important than the
absolute distance is the distance variance.
Always start at default values (unless you know
already better) and relax gradually if nothing
meaningful is found.
And always think about the HELP pages !
This option gives you an idea of the specificity
of the found frameworks. It checks how often a
framework would be found in a background of 5.000
random human promoter sequences.
Use it with care! It slows down FrameWorker
considerably!
75Phylogenetically conserved frameworks
The longest frameworks contain 8 TF sites. There
are 4 different frameworks. If you click the
link, you jump direct to the graphical
representation
All four promoters have 18 TF sites in common.
This number might differ from the search for
common TF job earlier, since now we take strand
specificity into account.
76Phylogenetically conserved frameworks
Here you see the detailed description of the
framework. It is perfectly conserved throughout
the species
You can save this framework in your personal
directory for subsequent sequence or database
scans
Here you have a graphical representation. You
already know how this works...
Scroll down to the bottom of the page...
77Phylogenetically conserved frameworks
At the bottom of the output you find this list.
Now we not only have identified the TFs but also
the exact positions which are worth a closer
look. You can scan with your saved frameworks all
of our promoter databases for promoters with
similar organization.
Why should I do this? Would this give me
additional information ?
78Phylogenetically conserved frameworks
In this example with an 8 element framework and
almost no distance variation between the TF
sites, you will find exactly 1 match in over
56.000 human promoters the input gene. How to
use this approach with less selective frameworks
for identification of similarly organized
promoters? I'll show you later
79Knowledge based analysis
Yes. The third is knowledge driven and bases on a
combination of literature data mining, sequence
analysis and pathway/network analysis. For this
you need first to download and install the Java
client of BiblioSpherePE
Fine! I think I have seen now two strategies.
You mentioned three?
80Knowledge based analysis
81Knowledge based analysis
For more detailed introduction to BiblioSpherePE
please have a look at http//www.genomatix.de/pr
oducts/BiblioSphere/BiblioSpherePE5.html
82Knowledge based analysis
...un-tick this box... We are interested in the
full network around our gene, not only the
connected transcription factors
Choose "single gene" here...
HMGCS1
83Knowledge based analysis
84Knowledge based analysis
This sets the context sensitive filter
stringency. The most stringent including
computer based semantic analysis is an ordered
Gene1 function word Gene2 level (B3). (B4)
shows expert curated gene-gene relationships
only. Expert knowledge is derived by different
sources, like Genomatix experts, Molecular
Connections NetPro data base, STKE, etc...
Here you have a list of all other genes, being
connected to your input gene by at least one
co-citation in entire PubMed on abstract level
Click around, and see what happens !
85Knowledge based analysis
This filters the co-citation frequency
I have intentionally chosen an example with no
expert curation available, since I want to
demonstrate how to generate new knowledge!
86Knowledge based analysis
Here you see the network around HGMCS1, all other
genes connected on GFG level
87Knowledge based analysis
Here connected transcription factors only on GFG
level.
88Knowledge based analysis
Now all connected transcription factors.
89Knowledge based analysis
A connection line between two genes means that
there is a bibliographic connection on abstract
level (BO)...
90Knowledge based analysis
"Mouse over" and clicking gives you more
information...
91Knowledge based analysis
The green indicates that there is a binding site
for SREBF1 (VSREB) in at least one of the
promoters of HMGCS1
92Knowledge based analysis
There is more encoded in the connection
lines...
93Knowledge based analysis
The little symbols give you some information
about the gene and its association with pathways
94Knowledge based analysis
The tagged text tells us that the TF SREBF1 is
involved in regulation of HMGCS1
Some more helpful options from this page...
95Knowledge based analysis
This you know already...
You can get all info about any gene you click up
there...
over here...
96Knowledge based analysis
..as well as this.
97Knowledge based analysis
Hey, hey hey ! Stop it ! I want to know about
the regulation of my gene, not to play around
with your Biblio...thing!
98Knowledge based analysis
Hey, hey hey ! Stop it ! I want to know about
the regulation of my gene, not to play around
with your Biblio...thing!
BiblioSphere PathwayEdition ! We already found
TFs of interest, known to be involved in
regulation of our gene. Now lets see the
biological environment of our gene and find a
group of related genes which might share some
regulatory motifs. Lets go back and display all
genes contained in this network...
99Knowledge based analysis
Lets load the GO-Filter "biological process"...
100Knowledge based analysis
Go to the table view by this tab...
Here you see the tree for the selected filter.
Expand and collapse by clicking on the /-
101Knowledge based analysis
The Z-Score gives you a measure whether certain
categories are significantly over- or
under-represented by the displayed gene set.
Top scoring is sterol and cholesterol
metabolism... Everything above 3 is
statistically significant!
Clicking here opens the tree on the left and
highlights the category as well as the resp.
genes in the pathway view.
102Knowledge based analysis
This finally applies the filter to your gene
set. Superimpose as many filters as youd like !
103Knowledge based analysis
We see two TFs in here, SREBF1 and SREBF2, both
Sterol Regulatory Element Binding Protein
factors.
The "redraw" button
Double-click on SREBF1 in order to see all
connections to that TF
104Knowledge based analysis
Another table view...
105Knowledge based analysis
...the colors encode for...
Highlight those genes with your mouse, and copy
them...
106Knowledge based analysis
Now we have expanded our single input gene with a
set of seven additional genes! And we know
already quite a lot about them!
They all are connected with my original gene in
PubMed
All genes, with very high high statistical
significance, belong to the GO-category
"Cholesterol Metabolic Process"
SREB transcription factors seem to play a role in
the regulation of those genes
Now lets check whether the promoters of
those genes share a complex framework. For such
we first need to export those genes into
GenomatixSuites Gene2Promoter
107Back to sequence level
Oh my god... more... Where do I find this now
?
Relax ! Its easy and not far away...
108Back to sequence level
APOA1, LDLR, SREBF2, VLDLR, FDFT1, FDPS, MVK,
HMGCS1
Paste here the gene symbols which we just copied
in BiblioSpherePE
Dont forget this ! Otherwise you will be asked
for all findings in all organisms.
109Back to sequence level
110Back to sequence level
Hey stop ! Havent I seen this before ?
You are right! It pretty much is the same display
as the comparative genomics page which we have
generated earlier. The difference in this case
is that we now compare promoters of different
genes within one organism
111Back to sequence level
Eight loci with 26 different unique promoters !
9.216 combinations possible for exhaustive
analysis! Combinatorial explosion !
How should I know which ones? How do I do this
?
Since we are concentrating on SREB TF-sites,
lets concentrate on those promoters which
contain an VSREB binding site.
We have to find a way to circumvent this
Very easy! Just scroll down to the bottom of the
page...
112Back to sequence level
Select the desired TF-matrix family here
113Back to sequence level
...and all relevant promoters are checked already
for you
Now we have reduced to 12 different promoters
from 8 different loci, each containing at least
one SREB site.
114Back to sequence level
Scroll to the bottom of the Gene2Promoter result
page...
We have done this before...
115Back to sequence level
You see? Now we have tolerable combinatorics and
can perform an exhaustive promoter analysis.
116Back to sequence level
...but now we choose VSREB as a mandatory
element for our framework. Hint you can select
multiple elements by holding the "Ctrl" key
while clicking.
...and with these parameters you have to play
around a little bit. Start at default. Gradually
relax stringency. Go down in Quorum Constraint
step by step, or allow for higher distance
variance (e.g. 20, 30, 40, 50, usw...) The lower
the distance variance and the more elements per
model, the higher is the resulting model
selectivity.
Remember? We have been here before, too...
117Back to sequence level
Tick the boxes of the models for subsequent
database search for other promoters with similar
organization. With 6 elements I expect to find
the 3 genes from which this models were derived
only SREBF2, HMGCS1, and MVK
There are frameworks with 6 elements! This is
quite significant and expected to be extremely
selective.
For example, at quorum of 30, allowed distance
range of 5 to 200 bp, distance variance of 50 bp
maximum elements allowed 10 we find quite a lot
of frameworks in the different promoter
combinations.
118Back to sequence level
Now lets see how selective this model is...
Scroll all the way down...
This list is quite interesting! Here we have the
differents TF sites in this set of
frameworks. This list represents those TFs which
we should concentrate on, when analyzing the
regulation of the original input gene. It is
pretty comparable to the list from our
phylogenetic approach before. There is now good
evidence that those factors play a role in
regulation in the biological context of
cholesterol metabolism.
119Back to sequence level
It is just one click away...
120Back to sequence level
This should look familiar to you ! But now we
are going for the database section...
Unless you have a good reason to do so, always go
for the database of promoters of annotated genes.
This allows for GO-group Z-scoring of the
database hits later on...
121Back to sequence level
This is a termination parameter. If this number
of hits is reached before the end of the
database, the search is terminated
Careful! Some browsers crash with too many hits
to display in HTML ! (gt10.000)
A database search usually takes several minutes.
In order to avoid a server time-out go for the
e-mail option. Youll receive a mail with a
direct link to your result file ( it will be kept
in your "Results Directory", too)
122Back to sequence level
Eight matches!
In four sequences. Each model matches exactly
once per sequence...
The three genes of our "training set"...
...out of a total of 56.193 different promoters
Wow !
123Back to sequence level
...plus one additional "new" gene! This one was
not in our input list and is identified only by
common promoter organization!
124Back to sequence level
Those four genes now are extremely likely to
share common regulation in the given biological
context! The TFs in the framework now are the top
candidates for further inspection.
125Back to sequence level
Those four genes now are extremely likely to
share common regulation in the given biological
context! The TFs in the framework now are the top
candidates for further inspection.
126New Knowledge
I am terribly sorry for that! However,
eukaryotic transcriptional regulation is pretty
complex. Our group of researchers works in this
field since more than two decades. As you have
seen, our tools - though pretty easy to use -
require some explanations and sometimes a
slightly different mind-setting, going beyond
looking at single, isolated TF binding sites. I
hope I was able to show you some basic strategies
to follow. Nevertheless, lets have a final look
at the additional gene which we have found with
the database search in our example...
127New Knowledge
128New Knowledge
129New Knowledge
MMAB is a transferase involved in vitamin B(12)
activation and linked to a disease methylmalonyl
aciduria
130New Knowledge
Feeding all 4 genes from ModelInspector into
BiblioSpherePE shows that they are all connected
plus...
131In our example, we started with a single gene (
HMGCS1), ElDorado put it into biological context
in and concentrated on an potential regulator (
SREB), BiblioSpherePE identified common promoter
organization (TF-Framework) GEMS Launcher ,
FrameWorker searched for additional genes with
similar promoter organization and GEMS Launcher ,
ModelInspector put the genes back into
biological context. BiblioSpherePE Literature
confirmed that we indeed found a co-regulated
network and identified the molecular basis for
such. This could NEVER be achieved by
statistical analysis of isolated TFBS
132. There is so much more in GenomatixSuite PE I
did neither say a word to matrix generation, nor
to direct experimental planning for
knock-out/knock-in experiments with
SequenceShaper Expand the hit-list by shortening
the framework, etc... etc... Get in touch with
us via support_at_genomatix.de and we will give you
a tour through the entire system at a
web-meeting. Some informative links http//www.
genomatix.de/company/publications1.html http//ww
w.genomatix.de/training/tasks.html http//www.gen
omatix.de/download/download4.html http//www.geno
matix.de/cgi-bin/UMapps/register.pl