Title: Bioinformatics: Practical Application of Simulation and Data Mining Markov Modeling II
1Bioinformatics Practical Application of
Simulation and Data MiningMarkov Modeling II
- Prof. Corey OHern
- Department of Mechanical Engineering
- Department of Physics
- Yale University
1
2Markov Modeling of Proteins
Describing protein folding kinetics by Molecular
Dynamics Simulations. 1. Theory W. C. Swope, J.
W. Pitera, and F. Suits, J. Phys. Chem. B 108
(2004) 6571.
Describing protein folding kinetics by Molecular
Dynamics Simulations. 2. Example applications to
Alanine Dipeptide and a ?-hairpin peptide W. C.
Swope, J. W. Pitera, et al., J. Phys. Chem. B
108 (2004) 6582.
2
3I. Alanine Dipeptide
6 backbone atoms 3 dihedral angles (?, ?, ?)
3
4Macrostate Definition
1
T500K
1
2
5
4
3
1
1
4
5Kinetics at T500 K
- 10,000 separate trajectories sampled 200 times at
- 0.5ps intervals (100ps) using AMBERShake
5
66
7MS Lifetime Distributions
MS1
MS5
1/(1-Tii)
7
8Transition Matrix Eigenvalues
?F 550ps
Markovian
Non- Markovian
spurious
?F gtgt tkin 100ps
8
9II. ?-hairpin motif of protein G
G41EWTYDDATKTFTVTE56
6
1
2
4
5
3
Hydrogen bonding
9
10Macrostate Definition
- 287 conformations run at NVE (310 K) for 0.5 ns
- using explicit water and Na counterions
- Order parameters Rg, number and order of
- hydrogen bonds
Hydrogen bonds
Radius of gyration
000000
turn
termini
111111
S,M,L,E
000001
00011X
264?22-35 macrostates
10
1111
12MS Lifetime Distributions
Non-Markovian
Markovian gt 50ps
000000E
00111X
12
13Transition Matrix Eigenvalues
Time reversed
Non- Markovian
13
14Predicted Folding Time
?F 20 ns ltlt 6 ?s
- Short 0.5 ns trajectories (4 orders
- of magnitude difference)
- 2. Long-lived conformations
14
15Long-lived Conformations
Misregistered H-bonds
Misregistered H-bonds
splayed, ion association
misformed turn
tight turn
15
16Using massively parallel simulation and
Markovian Models to study protein folding
Examining the dynamics Of the villin headpiece,
J. Chem. Phys. 124 (2006) 164902.
16
17Villin headpiece-HP-36
MLSDEDFKAVFGMTRSAFANLPLWKQQNLKKEKGLF PDB 1 VII
17
18Simulation Details
50,000 trajectories 10ns/trajectory 500 ?s
- Gromacs with explicit solvent (5000 water
molecules) - and eight counterions Amber bond constraints
18
19Native State Ensemble
19