eVCGsampler provides a principled framework for sampling of VCG (Virtual Control Group), using energy distance-based covariate balancing. The package includes visualization tools for assessing covariate balance, as well as a permutation test to evaluate the statistical significance of the deviations.
Test for 3 covariates before balancing, comparison of the treated groups (TG) with the data pool (POOL), shows high imbalance.
Distance permutation test TG vs POOL:
By running the function: VCG_sampler(treated ~ cov1 + cov2 + cov3, data=dat, n=10)
Distance permutation test TG vs VCG:
Plot specifically for the variable cov3: plot_var(dat_out, what=‘cov3’)
With BestVCGsize(treat ~ cov1 + cov2 + cov3, data=dat), you can explore the best size for VCG with the best balance of covariates. It may not necessarily be the best size in terms of power or validity of the study.
If multiple VCG samples are required, use: multiSampler(treat~cov1+cov2+cov3, n=10, Nsamples=10, data=dat)
Overview of sample overlapping: