Subject:Computer SciencePrice: Bought3
Sampling Using R programming
*Create *a subset of the dataset from Part1 with only the top 5 departments based on the number of employees working in that department. The top 5 departments should be computed using R code. Then, use %in% operator to create the required subset.
Use a sample size of 50 for each of the following. Set the start seed for random numbers as the last 4 digits of your BU id
A) Order the data using the Department variable. Draw a stratified sample using proportional sizes based on the Department variable. Show the frequencies for the selected departments. Show the percentages of these with respect to the sample size.
b) Compare the means of Earnings variable for these four samples against the mean for the data.