Random intercept models for binary data are useful tools for addressing between-subject heterogeneity. the literature and can be used to synthesize several seemingly disparate modeling approaches. In addition, this family of models offers considerable computational benefits. A diverse series of examples is explored that illustrates the wide applicability of this approach. and benefits of this class of models. Notably, we explore models that yield conditional and marginal interpretations of parameters. To summarize the results, the proposed model offers the conceptual benefit of containing a wide class of common models for binary data as either special or limiting cases. Furthermore, we highlight some interesting practical advantages of these models. In particular, the added flexibility of placing a mixture distribution on the random effects protects against misspecification of this distribution. Placing Raltegravir a mixture distribution on the PRKM1 conditional link distribution both allows for easy post-hoc approximations of marginal link functions and easier fitting for marginalized multilevel models. Finally we demonstrate that the latent variable representations of the mixture distributions for the conditional link and random effect distributions can result in simple and elegant Gibbs samplers for Bayesian analysis. The manuscript is laid out as follows. In Section 2 we present the notation and the model. In Section 3 we connect the mixture of normals model with several variants of random effect models in the literature. In Section 4 we illustrate with a diverse collection of useful applications of the mixture of normals approximation. Finally, in Section 5, we provide a summary and discussion of future work. 2. Random intercept model for binary outcomes 2.1. Notation Consider the data given in Table 1, which arose from a teratology experiment (Weil, 1970), and was subsequently analyzed in Liang Raltegravir and Hanfelt (1994) and Heagerty and Zeger (1996). The objective is to compare the survival of rat pups in 16 control litters with that of the pups in the 16 treated litters. The treatment was a chemical agent administered to the mothers of each treated litter. We use this data set and experiment to motivate the model. Table 1 Teratology data. Numbers Raltegravir are (number survived, number dead) in each litter by treatment arm. For example, in the first control litter, all thirteen pups survived. Source Weil (1970). Assume that {= 1, , and response = 1, represents survival or not (1 versus 0 respectively) for pup from litter be a vector of covariates associated with = (1, be a link function (see McCullagh and Nelder, 1989) that relates the probability of a success to a function of the covariates. As is typical for binary data, we assume that (the inverse link function) is a distribution function, referred to as the link distribution. We assume that as in (2). The superscript on the slope effects is used to denote that the effects are conditional, having an interpretation on the conditional link functions scale. Defining the as such implies a marginal model. Specifically is the distribution of the sum of independent random variables having distribution functions and is the standard normal distribution, is a normal distribution with 0 mean and variance is defined as in Equation 2. This model then corresponds to a probit-normal GLMM. By the standard properties of the normal distribution, the distribution of the sum of a standard normal ((being a normal distribution with 0 mean and variance estimates the marginal probit-scale change in the probability of death. 2.3. Marginal Models Consider again the Teratology probit-normal example from the previous section – i.e. is a standard normal and is a normal with mean 0 and variance results in parameters with marginal interpretations. In fact, Heagerty and Zeger (2000) showed that this technique can be.