The kinetics and thermodynamics of complex transitions in biomolecules can be
modeled in terms of a network of transitions between the relevant conformational
substates. Such a transition network, which overcomes the fundamental
limitations of reaction-coordinate-based methods, can be constructed either
based on the features of the energy landscape, or from molecular dynamics
simulations. Energy-landscape-based networks are generated with the aid of
automated path-optimization methods, and, using graph-theoretical adaptive
methods, can now be constructed for large molecules such as proteins. Dynamics-
based networks, also called Markov State Models, can be interpreted and
adaptively improved using statistical concepts, such as the mean first passage
time, reactive flux and sampling error analysis. This makes transition networks
powerful tools for understanding large-scale conformational changes.
The 1D energy of the system
This energy plot is sample with a basic Markov Chain Monte Carlo (MCMC)
1000000 steps of MCMC and we compute the transition matrix
Here the distribution obtained with th MCMC which fit quite well the energy plot
The plot of the transition matrix
The transition matrix is then diagonalized
Just a simple watershed on the energy to identify energy basins based on the
energy plot
A function to substitute values in an array
The projection of the transition matrix onto the second and third eigen vector
The code below identifies the three metastable states and compute the Markov
chain model
And here the full story in one plot:
The construction of a transition network. (a) Sample potential, defined over a
one-dimensional coordinate that is discretized into 100 microstates. It has
three metastable basins (0 (blue), 1 (green), and 2 (red)). (b) Transition
matrix T for a Markov lagtime of 200 steps. The transition probability was
obtained from a Metropolis Monte Carlo, jumping each step only to the current or
adjacent microstates. T exhibits three clusters corresponding to the metastable
states. (c) Left eigenvectors of T indicating the transition modes among
microstates. The first eigenvector gives the stationary distribution. The sign
structure of the second eigenvector partitions the state space into two
metastable states, blue and red. The sign structure of the third eigenvector
further splits green and red, obtaining three metastable states. (d) The
eigenvalue spectrum of T. The clear gaps after 2 and 3 eigenvalues indicate how
many states are metastable. (e) Coordinates of the 100 microstates projected
onto the second and third right eigenvectors of T. Metastable states are
identified by clustering the microstates in this eigenspace. (f) Transition
network between the 3 metastable states A, B and C