Oversampling Sample Clauses

Oversampling. Oversampling was employed for some subgroups of interest to help increase the precision of estimates associated with members of those subgroups. Before going into details, the concept of oversampling will be discussed. In a sample where all persons in a population are selected with the same probability and survey coverage of the population is high, the sample distribution is expected to be proportionate to the population distribution. For example, if Hispanics represent 15 percent of the general population, one would expect roughly 15 percent of the persons sampled to be Hispanic. However, in order to improve the precision of estimates for subgroups of a population, one might decide to select samples from those subgroups at higher rates than the remainder of the population. Thus, one might select Hispanics at twice the rate (i.e., at double the probability) of persons not oversampled. As a result, subgroups that are "oversampled" are represented at disproportionately high rates in the sample. Sample weights help ensure that population estimates account for this disproportionate contribution from oversampled subgroups, as the base sample weights for oversampled groups will be smaller than for the portion of the population not oversampled. For example, if a subgroup is sampled at roughly twice the rate of sample selection for the remainder of the population not oversampled, members of the oversampled subgroup will receive base or initial sample weights (prior to nonresponse or poststratification adjustments) that are roughly half the size of the group "not oversampled". As mentioned above, oversampling a subgroup is done to improve the precision of survey estimates for that particular subgroup. The "cost" of oversampling is that the precision of estimates for the general population and subgroups not oversampled will be reduced to some extent compared to the precision one could have achieve if the same overall sample size were selected without any oversampling. For MEPS, some of the oversampling was achieved through its linkage to the NHIS. For the NHIS, Hispanic households were oversampled at a rate of roughly 2 to 1. That is, the probability of selecting a Hispanic household for participation in the NHIS was roughly twice that for households in the general population that were not oversampled. The NHIS oversampling rate for black households was roughly 1.5 to 1. The oversampling approach for both Panels 10 and 11 was the same. From among the NHIS households...