Generate list of subsamples
getSubsamps.RdGenerate list of subsamples
Arguments
- n
Integer or numeric; sample size of the data set.
- B
Integer or numeric; the number of subsamples. Note: For
sampling_type=="MB"the total number of subsamples will beB; forsampling_type="SS"the number of subsamples will be2*B. Default is 100 forsampling_type="MB"and 50 forsampling_type="SS".- sampling_type
A character vector; either "SS" or "MB". For "MB", all B subsamples are drawn randomly (as proposed by Meinshausen and Bühlmann 2010). For "SS", in addition to these B subsamples, the B complementary pair subsamples will be drawn as well (see Faletto and Bien 2022 or Shah and Samworth 2013 for details). Default is "SS", and "MB" is not supported yet.
Value
A list of length B (or 2*B for sampling_type="SS"), where each
element is an integer vector of length floor(n/2) containing the indices
of a subsample of 1:n. For sampling_type=="SS", the last B subsamples
will be complementary pairs of the first B subsamples (see Faletto and
Bien 2022 or Shah and Samworth 2013 for details).
References
Faletto, G., & Bien, J. (2022). Cluster Stability Selection. arXiv preprint arXiv:2201.00494. https://arxiv.org/abs/2201.00494.
Shah, R. D., & Samworth, R. J. (2013). Variable selection with error control: Another look at stability selection. Journal of the Royal Statistical Society. Series B: Statistical Methodology, 75(1), 55–80. https://doi.org/10.1109/RITA.2014.2302071.