Structural%20Bioinformatics%20Seminar - PowerPoint PPT Presentation

About This Presentation
Title:

Structural%20Bioinformatics%20Seminar

Description:

X-ray Crystallography. NMR (Nuclear Magnetic Resonance) EM (Electron microscopy) ... Use colors, pictures, tables and animation, but don't exaggerate. Lecture ... – PowerPoint PPT presentation

Number of Views:131
Avg rating:3.0/5.0
Slides: 62
Provided by: maxs9
Category:

less

Transcript and Presenter's Notes

Title: Structural%20Bioinformatics%20Seminar


1
Structural Bioinformatics Seminar
  • Dina Schneidman
  • Email duhovka_at_post.tau.ac.il

2
Outline
  • Seminar requirements
  • Biological Introduction
  • How to prepare seminar lecture?

3
Seminar Requirements
  • No prior knowledge in Biology is assumed or
    required!
  • Attend ALL lectures
  • Prepare one of the lectures

4
Seminar Goals
  • Learn how to study new subject from articles
  • Learn how to present work in Computer Science

5
Biological Introduction
6
Schedule
  • Introduction to molecular structure.
  • Introduction to pattern matching.
  • Introduction to protein structure alignment
    (comparison).
  • Protein docking.

7
Small Ligands
  • Small organic molecules, composed of tens of
    atoms.
  • Highly flexible can have many torsional degrees
    of freedom.

8
DNA The code of life
  • DNA is a polymer.
  • The monomer units of DNA are nucleotides A, T,
    C, G.
  • DNA is a normally double stranded macromolecule.

9
RNA
  • RNA is a polymer too.
  • The monomer units of RNA are nucleotides A, U
    (instead of T), C, G.
  • DNA serves as the template for the synthesis of
    RNA.

10
Protein
  • Protein is a polymer too.
  • The monomer units of Protein are 20 amino acids.
  • Each amino acid is encoded by 3 RNA nucleotides.

Hemoglobin sequence VHLTPEEKSAVTALWGKVNVDEVGGEALG
RLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGA
FSDGLAHLDNLKGTFATLSELHXDKLHVDPENFRLLGNVLVCVLAHHFGK
EFTPPVQAAYQKVVAGVANA LAHKYH
11
The Central Dogma
DNA RNA
Protein
Symptomes
(Phenotype)
Cells express different subset of the genes in
different tissues and under different conditions.
12
The central dogma
  • DNA ---gt mRNA ---gt Protein
  • A,C,G,T A,C,G,U A,D,..Y
  • Guanine-Cytosine T-gtU
  • Thymine-Adenine
  • 4 letter alphabets 20 letter
    alphabet
  • Sequence of nucleic acids Sequence of amino acids

13
Bioinformatics - Computational Genomics
  • DNA mapping.
  • Protein or DNA sequence comparisons.
  • Exploration of huge textual databases.
  • In essence one- dimensional methods and
    intuition.

14
Structural Bioinformatics - Structural Genomics
  • Elucidation of the 3D structures of biomolecules.
  • Analysis and comparison of biomolecular
    structures.
  • Prediction of biomolecular recognition.
  • Handles three-dimensional (3-D) structures.
  • Geometric Computing. (a methodology shared by
    Computational Geometry, Computer Vision, Computer
    Graphics, Pattern Recognition etc.)

15
Protein Structural Comparison
Pseudoazurin - 1pmy
ApoAmicyanin - 1aaj
16
Algorithmic Solution
About 1 sec. Fischer, Nussinov, Wolfson 1990.
17
Introduction to Protein Structure
18
Amino acids and the peptide bond
Cb first side chain carbon (except for glycine).
19
Backbone or Secondary structure display
20
Wire-frame or ribbons display
21
Spacefill model
22
Geometric Representation
3-D Curve vi, i1n
23
(No Transcript)
24
Secondary structure
25
? strands and sheets
  • Hydrogen bonds.

26
(No Transcript)
27
(No Transcript)
28
The Holy Grail - Protein Folding
  • From Sequence to Structure.
  • Relatively primitive computational folding models
    have proved to be NP hard even in the 2-D case.

29
Determination of protein structures
  • X-ray Crystallography
  • NMR (Nuclear Magnetic Resonance)
  • EM (Electron microscopy)

30
An NMR result is an ensemble of models
  • Cystatin (1a67)

31
The Protein Data Bank (PDB)
  • International repository of 3D molecular data.
  • Contains x-y-z coordinates of all atoms of the
    molecule and additional data.
  • http//pdb.tau.ac.il
  • http//www.rcsb.org/pdb/

32
(No Transcript)
33
(No Transcript)
34
Why bother with structureswhen we have sequences
?
  • In evolutionary related proteins structure is
    much better preserved than sequence.
  • Structural motifs may predict similar
    biological function
  • Getting insight into protein folding.
    Recovering the limited (?) number of protein
    folds.

35
Applications
  • Classification of protein databases by structure.
  • Search of partial and disconnected structural
    patterns in large databases.
  • Extracting Structure information is difficult, we
    want to extract new folds.

36
Applications (continued)
  • Speed up of drug discovery.
  • Detection of structural pharmacophores in an
    ensemble of drugs (similar substructures in
    drugs acting on a given receptor
    pharmacophore).
  • Comparison and detection of drug receptor active
    sites (structurally similar receptor cavities
    could bind similar drugs).

37
Object Recognition
38
Model Database
39
Scene
40
Recognition
Lamdan, Schwartz, Wolfson, Geometric
Hashing,1988.
41
Protein Alignment Geometric Pattern
Discovery
42
Protein Alignment
  • The superimposition pattern is not known
    a-priori pattern discovery .
  • The matching recovered can be inexact.
  • We are looking not necessarily for the
  • largest superimposition, since other
  • matchings may have biological meaning.

43
Geometric Task
Given two configurations of points in the
three dimensional space,
find those rotations and translations of one
of the point sets which produce large
superimpositions of corresponding 3-D
points.
44
Geometric Task (continued)
  • Aspects
  • Object representation (points, vectors, segments)
  • Object resemblance (distance function)
  • Transformation (translations, rotations, scaling)

-gt Optimization technique
45
Transformations
  • Translation
  • Translation and Rotation
  • Rigid Motion (Euclidian Trans.)
  • Translation, Rotation Scaling

46
Inexact Alignment. Simple case two closely
related proteins with the same number of amino
acids.
Question how to measure alignment error?
47
Superposition - best least squares(RMSD Root
Mean Square Deviation)
Given two sets of 3-D points Ppi, Qqi ,
i1,,n rmsd(P,Q) v S ipi - qi 2 /n Find a
3-D rigid transformation T such that rmsd(
T(P), Q ) minT v S iTpi - qi 2 /n
A closed form solution exists for this task. It
can be computed in O(n) time.
48
Problem statement with RMSD metric.
Given two configurations of points in the
three dimensional space, and e threshold
find the largest alignment, a set of matched
elements and transformation, with RMSD less than
e. (belong to NP,)
49
Distance Functions
  • Two point sets Aai i1n
  • Bbj j1m
  • Pairwise Correspondence
  • (ak1,bt1) (ak2,bt2) (akN,btN)

(1) Exact Matching aki bti0
(2) RMSD (Root Mean Square Distance)
Sqrt( Saki bti2/N) lt e (3) Bottleneck
max aki bti
  • Hausdorff distance h(A,B)maxa?A minb?B a
    b
  • H(A,B)max(
    h(A,B), h(B,A))

50
Docking Problem
  • Given two molecules find their correct
    association



51
Docking Problem

?
52
Docking Problem

?
53
How to present a paper in Computer Science
54
Lecture Preparation
  • The lecture should cover a given slot of time
    (90 minutes).
  • Use PowerPoint slides for presentation.
  • Each slide usually spans 1-2 minutes.
  • The slides should not be overloaded.
  • Use mouse or pointer.
  • Use colors, pictures, tables and animation, but
    dont exaggerate.

55
What to say and how
  • Communicate the key ideas during your lecture.
  • Dont get lost in technical details.
  • Structure your talk.
  • Use a top-down approach.

56
Lecture Structure
  • Introduction general description of the paper.
  • Body - abstract of the current method.
  • Technical details.
  • Conclusions and discussion.

57
Introduction
  • Most important part of your talk!
  • Title short explanation about the presented
    topic.
  • Lecture outline.
  • Problem definition, input and output. Dont
    forget to define the problem!
  • Problem motivation.
  • Introduce terminology of the field.
  • Short review of existing approaches (dont
    forget to add references!).

58
Body
  • Abstract of the major results presented in the
    paper.
  • Significance of the results.
  • Sketch of the method.

59
Technicalities
  • Extended presentation of the method.
  • Present key algorithmic ideas clearly and
    carefully.
  • Complexity of the method.
  • Experimental results.

60
Conclusions and Discussion
  • Summarize major contributions of the work.
  • You can highlight points based on technical
    details you couldnt discuss in introduction.
  • Present related open problems.
  • Dont forget to thank the audience !!!
  • Questions.

61
Getting to the Audience
  • Use repetitions
  • Tell them what you're going to tell them.
  • Tell them.
  • Then tell them what you told them".
  • Remind, dont assume
  • Maintain eye contact
  • Control your voice and motion

62
Thanks!!!and Good Luck in your lectures!
Write a Comment
User Comments (0)
About PowerShow.com