Bayesian statistical methods have become popular and their use continues to grow in the applied sciences. This increase in popularity is largely due to the availability of Markov chain Monte Carlo (MCMC) algorithms which allow for the estimation of posterior distributions. However, in a large number of applications, MCMC-based estimates are reported without a valid measure of their quality and there is no coherent strategy for deciding when to stop the simulation. To a large extent, this is due to the fact that analyses (asymptotic and non-asymptotic) of the usual MCMC estimators are typically challenging. For example, as opposed to classical Monte Carlo methods, establishing the central limit theorems (CLTs) that allow for an asymptotic analysis of MCMC estimators is not straightforward. This is a serious practical problem that needs attention because the current valid strategies for assessing the quality of estimation and the choice of (MCMC) sample size rest upon theoretical assumptions such as the existence of CLTs. This project addresses this issue and consists of two parts. The first part consists of asymptotic and non-asymptotic analyses of MCMC estimators based on Gibbs samplers for several widely applicable Bayesian versions of the generalized linear mixed model. The investigator considers probit and identity link functions as well as popular choices of proper and improper prior densities for the parameters. In the second part of the project, the investigator attempts to generalize several results for two-variable Gibbs samplers concerning the convergence rate of the algorithm and its connection to the parametrization of the statistical model. The goal is to generalize known results to the general k-variable Gibbs sampler. Since Gibbs samplers are very popular MCMC algorithms, these results will likely have many applications. In particular, they could be used in the analyses performed in the first part of the project.
This project addresses the quality of estimation in MCMC procedures used in Bayesian statistics, which is a very important issue in the applied sciences. The reason why this is so important is that misleading MCMC-based estimates can lead to incorrect conclusions, which could potentially negatively affect public policy. The main goal of this research is to find simple sufficient conditions (that the user can check) under which the considered MCMC procedures are honest; that is, there is at least one valid measure of the quality of estimation and a coherent strategy for deciding when to stop the simulation. The more ambitious goal is to provide the MCMC user with explicit (non-asymptotic) bounds on the number of iterations needed to achieve a predetermined level of accuracy. The statistical models and MCMC algorithms considered in this project have numerous applications in nearly every scientific discipline. Consequently, the results produced in this project will be used by researchers in many different fields.