Cluster Random Sampling

About cluster Random Sampling
When to Use
Procedure
Advantages
Parameter Estimation
Variance Analysis
Sample Allocation

❯

❯

sampling methods

❯

Cluster Random Sampling

About cluster Random Sampling
When to Use
Procedure
Advantages
Parameter Estimation
Variance Analysis
Sample Allocation

Cluster Random Sampling

Feb 22, 20262 min read

About cluster Random Sampling

Cluster random sampling is a probability sampling technique used when a complete list of individuals (sampling frame) is unavailable, but clusters (groups) can be identified.

When to Use

Suitable when populations lack individual lists (e.g., students across Jakarta universities).
Example: Surveying opinions of UI students vs. all Jakarta students—UI has a list, Jakarta does not.

Procedure

Identify clusters: Define groups (e.g., universities, hospitals) containing individuals.
Select clusters randomly: Choose a subset of clusters (e.g., 4 out of 50 universities).
Sample within clusters: Randomly select individuals from chosen clusters using simple, systematic, or stratified sampling.

Advantages

Reduces effort compared to sampling all individuals across a population.
Feasible when individual data is inaccessible (e.g., only cluster lists exist).

Parameter Estimation

Population size $N$ split into $M$ clusters, sizes $N_{1}, N_{2}, \dots, N_{M}$ .
Sample $m$ clusters, sizes $n_{1}, n_{2}, \dots, n_{m}$ , total sample $n = n_{1} + n_{2} + \dots + n_{m}$ .
Total estimator: $\hat{X} = \frac{M}{m} \sum_{i = 1}^{m} \frac{N _{i}}{n _{i}} \sum_{j = 1}^{n_{i}} x_{ij}$ .
Mean estimator: $\overset{ˉ}{\hat{X}} = \frac{X ^}{N}$ .
Unbiased: $E (\hat{X}) = X$ .

Variance Analysis

Variance: $V (\hat{X}) = M^{2} (\frac{M - m}{M} \frac{S _{b}^{2}}{m}) + \frac{M}{m} \sum_{i = 1}^{M} N_{i}^{2} (\frac{N _{i} - n _{i}}{N _{i}} \frac{S _{i}^{2}}{n _{i}})$ .
$S_{b}^{2}$ : Between-cluster variance.
$S_{i}^{2}$ : Within-cluster variance.
Estimator: $\hat{V} (\hat{X})$ uses sample variances $s_{b}^{2}$ and $s_{i}^{2}$ , unbiased for $V (\hat{X})$ .

Sample Allocation

Optimum allocation: Minimize variance given cost $c = c_{1} m + c_{2} m \overset{n}{ˉ}$ .
Result: $\overset{n}{ˉ} = \frac{c _{1} S _{2 i}^{2}}{c _{2} S _{1 b}^{2}}$ , $m = \frac{c}{c _{1} + c _{2} n ˉ}$ .

Recent Notes

Tugas 1
Feb 27, 2026
Rangkuman
Feb 27, 2026
Rust
Feb 27, 2026
- type/category
When to use Struct Derives
Feb 27, 2026
Welcome to my Notes
Feb 22, 2026
- linker-exclude

Graph View

Related notes

sampling methods

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community