Sampling via Rejection

Sampling via Rejection#

Recall that in the lecture, we discussed rejection sampling for two different cases for a \(\text{Beta}(2,2)\) density

Using a uniform proposal density
Using a Gaussian density.

Uniform (box) example had \(M = 1.5\) whereas Gaussian had \(M = 1.3\). Of course the acceptance rates are respectively \(\hat{a}_{\text{box}} = 1/1.5 \approx 0.67\) and \(\hat{a}_{\text{Gauss}} = 1/1.3 \approx 0.76\). We will test this now numerically (by counting the ratio of accepted samples).

Let us first get our basics. We will define the densities and their log-densities (as we do not want to implement the densities due to possible numerical errors).

import numpy as np
import matplotlib.pyplot as plt

rng = np.random.default_rng(52) # random number generator

# compute beta pdf
def beta_pdf(x, a, b):
    return 6 * x**(a-1) * (1-x)**(b-1)

def gaussian_function(x, mu, sigma):
    return 1/(sigma * np.sqrt(2 * np.pi)) * np.exp( - (x - mu)**2 / (2 * sigma**2))

def log_gaussian_function(x, mu, sigma):
    return - np.log(sigma * np.sqrt(2 * np.pi)) - (x - mu)**2 / (2 * sigma**2)

def log_beta_pdf(x, a, b):
    return np.log(6) + (a-1) * np.log(x) + (b-1) * np.log(1-x)

Let us now plot the \(\text{Beta}(2,2)\) density and the uniform and Gaussian proposal densities.

# plot beta pdf
xx = np.linspace(0, 1, 1000)

# beta parameters
a = 2
b = 2

m = 0.5 # mean for the Gaussian proposal
v = 0.25 # variance for the Gaussian proposal

M_gauss = 1.3 # for the Gaussian as shown in the lecture
M_box = 1.5 # for the box as shown in the lecture

plt.plot(xx, beta_pdf(xx, a, b), color='k', linewidth=2)
plt.plot(xx, M_gauss * gaussian_function(xx, m, v), color=[0.8, 0.6, 0], linewidth=2)
plt.plot(xx, M_box * np.ones(len(xx)), color=[0.8, 0, 0], linewidth=2)
plt.xlim(0, 1)
plt.ylim(bottom=0)
plt.legend(["$p(x)$", "$M_{\\text{gauss}} q(x)$", "$M_{\\text{box}} q(x)$"])
plt.show()

../_images/785bbd5f25060f26adca0529718cc7fac795457e44b54bb387ce3ad09524f039.png

We will now efficiently show a few things

How to implement rejection sampling
Comparison of acceptance rates of two proposal densities as predicted above

Let us write now a sampler function for different proposals.

def beta_rej_sampler_uniform_proposal(N, a, b, M):

    # N will be the number of trials but due to rejections, we might not get N samples
    # a and b are the parameters of the beta distribution
    # M is the maximum of the ratio of the beta pdf and the proposal pdf (hand computed)

    acc_samples = np.array([]) # accepted samples

    for i in range(N): # writing a for loop for the sake of clarity
        x_prop = rng.uniform(0, 1) # sample from the proposal
        u = rng.uniform(0, 1) # sample from the uniform distribution

        # the following step is u < p / M*q (in the log-domain for stability)
        if np.log(u) < log_beta_pdf(x_prop, a, b) - np.log(M):
            acc_samples = np.append(acc_samples, x_prop)
        
    acc_rate = len(acc_samples) / N
    return acc_samples, acc_rate

def beta_rej_sampler_gaussian_proposal(N, a, b, m, v, M):
    
        # N will be the number of trials but due to rejections, we might not get N samples
        # a and b are the parameters of the beta distribution
        # m and v are the parameters of the Gaussian proposal
        # M is the maximum of the ratio of the beta pdf and the proposal pdf (hand computed)
    
        acc_samples = np.array([]) # accepted samples
    
        for i in range(N): # writing a for loop for the sake of clarity
            x_prop = rng.normal(m, v) # sample from the proposal
            u = rng.uniform() # sample from the uniform distribution
    
            # the following step is u < p / M*q (in the log-domain for stability)
            if np.log(u) < log_beta_pdf(x_prop, a, b) - log_gaussian_function(x_prop, m, v) - np.log(M):
                acc_samples = np.append(acc_samples, x_prop)
            
        acc_rate = len(acc_samples) / N
        return acc_samples, acc_rate

With these functions at hand, we will now test the acceptance rates.

N = 10000
m = 0.5
v = 0.25

# beta parameters
a = 2
b = 2

M_gauss = 1.3 # for the Gaussian as shown in the lecture
M_box = 1.5 # for the box as shown in the lecture

# uniform proposal
acc_samples_box, acc_rate_box = beta_rej_sampler_uniform_proposal(N, a, b, M_box)
print("Acceptance rate for the box proposal: ", acc_rate_box)
print("Theoretical acceptance rate for the box proposal: ", 1/M_box)

# Gaussian proposal
acc_samples_gauss, acc_rate_gauss = beta_rej_sampler_gaussian_proposal(N, a, b, m, v, M_gauss)
print("Acceptance rate for the Gaussian proposal: ", acc_rate_gauss)
print("Theoretical acceptance rate for the Gaussian proposal: ", 1/M_gauss)

Acceptance rate for the box proposal:  0.67
Theoretical acceptance rate for the box proposal:  0.6666666666666666
Acceptance rate for the Gaussian proposal:  0.7686
Theoretical acceptance rate for the Gaussian proposal:  0.7692307692307692

/var/folders/d8/86whr2t12h70n511w7s7cm_w0000gn/T/ipykernel_53775/2947415207.py:17: RuntimeWarning: invalid value encountered in log
  return np.log(6) + (a-1) * np.log(x) + (b-1) * np.log(1-x)

As can be seen from the output, theoretical acceptance rates matches empirical acceptance rates.

We can also plot histograms. For the box proposal, we get the following histogram.

plt.hist(acc_samples_box, bins=50, density=True, color=[0.8, 0, 0], alpha=0.5)
plt.plot(xx, beta_pdf(xx, a, b), color='k', linewidth=2)
plt.show()

../_images/d94af0f135b4905991c8c59f403c8fe468e49dc688391a0ad622d67683d51e88.png

For the Gaussian proposal, we get the following histogram.

plt.hist(acc_samples_gauss, bins=50, density=True, color=[0.8, 0.6, 0], alpha=0.5)
plt.plot(xx, beta_pdf(xx, a, b), color='k', linewidth=2)
plt.show()

../_images/63fb00240823ec4cb66dd68800f7c794c46061ad5de040ac7c2b7fe942843e4a.png