About cluster Random Sampling

Cluster random sampling is a probability sampling technique used when a complete list of individuals (sampling frame) is unavailable, but clusters (groups) can be identified.

When to Use

  • Suitable when populations lack individual lists (e.g., students across Jakarta universities).
  • Example: Surveying opinions of UI students vs. all Jakarta students—UI has a list, Jakarta does not.

Procedure

  1. Identify clusters: Define groups (e.g., universities, hospitals) containing individuals.
  2. Select clusters randomly: Choose a subset of clusters (e.g., 4 out of 50 universities).
  3. Sample within clusters: Randomly select individuals from chosen clusters using simple, systematic, or stratified sampling.

Advantages

  1. Reduces effort compared to sampling all individuals across a population.
  2. Feasible when individual data is inaccessible (e.g., only cluster lists exist).

Parameter Estimation

  • Population size split into clusters, sizes .
  • Sample clusters, sizes , total sample .
  • Total estimator: .
  • Mean estimator: .
  • Unbiased: .

Variance Analysis

  • Variance: .
  • : Between-cluster variance.
  • : Within-cluster variance.
  • Estimator: uses sample variances and , unbiased for .

Sample Allocation

  • Optimum allocation: Minimize variance given cost .
  • Result: , .