Reversible-jump Markov chain Monte Carlo

Simulation method in statistics

In computational statistics, reversible-jump Markov chain Monte Carlo is an extension to standard Markov chain Monte Carlo (MCMC) methodology, introduced by Peter Green, which allows simulation (the creation of samples) of the posterior distribution on spaces of varying dimensions.[1] Thus, the simulation is possible even if the number of parameters in the model is not known. The "jump" refers to the switching from one parameter space to another during the running of the chain. RJMCMC is useful to compare models of different dimension to see which one fits the data best. It is also useful for predictions of new data points, because we do not need to choose and fix a model, RJMCMC can directly predict the new values for all the models at the same time. Models that suit the data best will be chosen more frequently then the poorer ones.

Details on the RJMCMC process

Let n m N m = { 1 , 2 , , I } {\displaystyle n_{m}\in N_{m}=\{1,2,\ldots ,I\}\,} be a model indicator and M = n m = 1 I R d m {\displaystyle M=\bigcup _{n_{m}=1}^{I}\mathbb {R} ^{d_{m}}} the parameter space whose number of dimensions d m {\displaystyle d_{m}} depends on the model n m {\displaystyle n_{m}} . The model indication need not be finite. The stationary distribution is the joint posterior distribution of ( M , N m ) {\displaystyle (M,N_{m})} that takes the values ( m , n m ) {\displaystyle (m,n_{m})} .

The proposal m {\displaystyle m'} can be constructed with a mapping g 1 m m {\displaystyle g_{1mm'}} of m {\displaystyle m} and u {\displaystyle u} , where u {\displaystyle u} is drawn from a random component U {\displaystyle U} with density q {\displaystyle q} on R d m m {\displaystyle \mathbb {R} ^{d_{mm'}}} . The move to state ( m , n m ) {\displaystyle (m',n_{m}')} can thus be formulated as

( m , n m ) = ( g 1 m m ( m , u ) , n m ) {\displaystyle (m',n_{m}')=(g_{1mm'}(m,u),n_{m}')\,}

The function

g m m := ( ( m , u ) ( ( m , u ) = ( g 1 m m ( m , u ) , g 2 m m ( m , u ) ) ) ) {\displaystyle g_{mm'}:={\Bigg (}(m,u)\mapsto {\bigg (}(m',u')={\big (}g_{1mm'}(m,u),g_{2mm'}(m,u){\big )}{\bigg )}{\Bigg )}\,}

must be one to one and differentiable, and have a non-zero support:

s u p p ( g m m ) {\displaystyle \mathrm {supp} (g_{mm'})\neq \varnothing \,}

so that there exists an inverse function

g m m 1 = g m m {\displaystyle g_{mm'}^{-1}=g_{m'm}\,}

that is differentiable. Therefore, the ( m , u ) {\displaystyle (m,u)} and ( m , u ) {\displaystyle (m',u')} must be of equal dimension, which is the case if the dimension criterion

d m + d m m = d m + d m m {\displaystyle d_{m}+d_{mm'}=d_{m'}+d_{m'm}\,}

is met where d m m {\displaystyle d_{mm'}} is the dimension of u {\displaystyle u} . This is known as dimension matching.

If R d m R d m {\displaystyle \mathbb {R} ^{d_{m}}\subset \mathbb {R} ^{d_{m'}}} then the dimensional matching condition can be reduced to

d m + d m m = d m {\displaystyle d_{m}+d_{mm'}=d_{m'}\,}

with

( m , u ) = g m m ( m ) . {\displaystyle (m,u)=g_{m'm}(m).\,}

The acceptance probability will be given by

a ( m , m ) = min ( 1 , p m m p m f m ( m ) p m m q m m ( m , u ) p m f m ( m ) | det ( g m m ( m , u ) ( m , u ) ) | ) , {\displaystyle a(m,m')=\min \left(1,{\frac {p_{m'm}p_{m'}f_{m'}(m')}{p_{mm'}q_{mm'}(m,u)p_{m}f_{m}(m)}}\left|\det \left({\frac {\partial g_{mm'}(m,u)}{\partial (m,u)}}\right)\right|\right),}

where | | {\displaystyle |\cdot |} denotes the absolute value and p m f m {\displaystyle p_{m}f_{m}} is the joint posterior probability

p m f m = c 1 p ( y | m , n m ) p ( m | n m ) p ( n m ) , {\displaystyle p_{m}f_{m}=c^{-1}p(y|m,n_{m})p(m|n_{m})p(n_{m}),\,}

where c {\displaystyle c} is the normalising constant.

Software packages

There is an experimental RJ-MCMC tool available for the open source BUGs package.

The Gen probabilistic programming system automates the acceptance probability computation for user-defined reversible jump MCMC kernels as part of its Involution MCMC feature.

References

  1. ^ Green, P.J. (1995). "Reversible Jump Markov Chain Monte Carlo Computation and Bayesian Model Determination". Biometrika. 82 (4): 711–732. CiteSeerX 10.1.1.407.8942. doi:10.1093/biomet/82.4.711. JSTOR 2337340. MR 1380810.