statistical calculations

Problem Set Week One

All statistical calculations

Problem Set Week One

All statistical calculations will use Employee Salary Data Set.

  1. Using the Excel Analysis ToolPak or the StatPlus:mac LE software function descriptive statistics, generate and show the descriptive

    statistics for each appropriate variable in the sample data set.

    1. For which variables in the data set does this function not work correctly for? Why?
  2. Sort the data by Gen or Gen 1 (into males and females) and find the mean and standard deviation

    for each gender for the following variables:

    1. sal, compa, age, sr and raise. Use either the descriptive stats function or the Fx functions (average and stdev).
  3. What is the probability for a:
    1. Randomly selected person being a male in grade E?
    2. Randomly selected male being in grade E?
    3. Why are the results different?
  4. Find:
    1. The z score for each male salary, based on only the male salaries.
    2. The z score for each female salary, based on only the female salaries.
    3. The z score for each female compa, based on only the female compa values.
    4. The z score for each male compa, based on only the male compa values.
    5. What do the distributions and spread suggest about male and female salaries?
    6. Why might we want to use compa to measure salaries between males and females?
  5. Based on this sample, what conclusions can you make about the issue of male and female pay equality?
  6. Are all of the results consistent with your conclusion? If not, why not?

will use Employee Salary Data Set.

  1. Using the Excel Analysis ToolPak or the StatPlus:mac LE software function descriptive statistics, generate and show the descriptive

    statistics for each appropriate variable in the sample data set.

    1. For which variables in the data set does this function not work correctly for? Why?
  2. Sort the data by Gen or Gen 1 (into males and females) and find the mean and standard deviation

    for each gender for the following variables:

    1. sal, compa, age, sr and raise. Use either the descriptive stats function or the Fx functions (average and stdev).
  3. What is the probability for a:
    1. Randomly selected person being a male in grade E?
    2. Randomly selected male being in grade E?
    3. Why are the results different?
  4. Find:
    1. The z score for each male salary, based on only the male salaries.
    2. The z score for each female salary, based on only the female salaries.
    3. The z score for each female compa, based on only the female compa values.
    4. The z score for each male compa, based on only the male compa values.
    5. What do the distributions and spread suggest about male and female salaries?
    6. Why might we want to use compa to measure salaries between males and females?
  5. Based on this sample, what conclusions can you make about the issue of male and female pay equality?
  6. Are all of the results consistent with your conclusion? If not, why not?