Fill in the highlighted blanks to answer/complete the statements.
Another name for the arithmetic average of a set of numbers is the ________.
What is an outlier?
Which measure of central tendency is most likely to be adversely affected by outliers?
We compute standard deviation by taking the ________ of the variance.
For the following terms, write the equivalent mathematical symbol (letter or letters) for the sample statistic.
5-c) Standard deviation:
5-d) Number of participants in sample:
These are the scores of eleven patients on the Beck Depression Inventory (BDI ® ). Scores can range from 0 to 63.
6-a) What is the mean of the scores? (Compute this using a calculator):
6-b) What is the median of these scores?
6-c ) Compare the mean and median. Does the difference between them suggest that the data are skewed very much?
Question 7a- 7f
• At this website, (http://projects.newyorker.com/story/subway/) you will find an interactive graph concerning New York City’s geography and income.
• Read the introduction and click on different “subway lines” to see how the interactive graph works.
• Note that the abbreviations stand for the four different boroughs:
♣ MAN = Manhattan
♣ BRX = Bronx
♣ BRK = Brooklyn
♣ QNS = Queens
• Also note that hovering your mouse over the dots on the graph displays the subway stop and the median income of households in that area.
7-a) In which of the four boroughs is the median household income highest? (This is made evident as you click on the different lines.)
7-b) Click on the “A” line. Does the line graph for Manhattan show high or low variability? What does this level of variability tell us about the household income in this area of Manhattan?
7-c) Click on Line 6. Though both boroughs depicted show some variability, which borough (not a street) shows the most variability in median household income?
7-d) On Line 3, find the following two subway stops: New Lots Ave. (the first of the Brooklyn stops) and Park Place (the first of the highest stops in Manhattan). What is the difference (calculate) between the median household incomes of the two areas?
7-e) Click on the “F” line. Which subway stop in Brooklyn seems to be an outlier?
7-f) Based on this module/week’s reading, why do you think the author of this graph chose the median household income instead of the mean household income to describe central tendency?
Part II: SPSS Analysis
Green and Salkind Course Text, Lesson 21: Exercises 1, 4, 7, and 8
• Questions 1 and 4
o Green and Salkind Text, Lesson 21
o Lesson 21 Exercise File 1 (located under course’s Assignment Instructions folder)
Exercises 1a-1d and Exercise 4
Green and Salkind, Lesson 21, Exercises 1 and 4 ( do not complete numbers 2 or 3)
Open “Lesson 21 Exercise File 1” in order to complete these exercises.
David collects anxiety scores from 15 college students who visit the university health center during finals week:
1) Compute descriptive statistics on the anxiety scores.
From the output, identify the following:
c. Standard deviation
Answer: SPSS Table- Descriptive Statistics for Anxiety Scores
(paste Table in this cell)
1-c) Standard deviation:
Create a histogram to show the distribution of the anxiety scores.
Edit the graph so that most of the normal curve is visible.
Answer: SPSS Figure- Histogram for Anxiety Scores with Normal Curve
(paste Figure in this cell)
Part III: SPSS Data Entry and Analysis
Data provided below.
The steps will be the same as the ones you have been practicing in Part I of the assignment; the only difference is that you are now responsible for creating the data file, as well. Remember to do the following:
a) Name and define your variables under the Variable View, then return to the Data View to enter the data; and
b) Paste all SPSS output and graphs into your homework file at the appropriate place.
The following data are based on numbers taken from the Bureau of Labor Statistics surveys from the years 2009–2013, with some modifications. They represent the average weekly pay for wage and salary earners measured at 4 different quarters each year and broken down by gender.
• Enter these data into a new file containing one variable for gender and one variable for salary.
• For the gender variable in column 1, code women as 1 and men as 2
o Remember to define these in Value Labels as covered in presentations since Module 1.
o There will be twenty “1”s and twenty “2”s (as many participants as in each group).
• The corresponding earnings will be entered in the salary column (second column).
• If you need an example, look at the set-up of gender in the “Lesson 21 Practice Data File 1” document.
649 654 652 675 660 667 675 668 681 671
685 692 676 689 683 700 692 688 688 694
825 818 813 831 822 851 815 816 833 827
830 828 833 846 837 851 868 831 881 854
In SPSS, compute descriptive statistics for each gender for these data.
The necessary steps are covered in the last 3 slides of this module’s SPSS tutorial.
Answer- SPSS Table- Descriptive Statistics for Weekly Pay by Gender:
(Paste one table; should include weekly pay grouped by men and women)
Create a boxplot describing the average overall weekly salary for each gender.
Answer- SPSS boxplot:
Based on these data, what is the mean weekly income for men?
What is the standard deviation of the weekly income for women?
Part IV: Cumulative
• The BDI (Beck et al., 1961) is an instrument widely used to assess levels of depression in individuals in a variety of settings.
• The scores range from 0–63 (whole numbers only).
• A researcher administers the BDI to a sample of college students. The results appear in the table below.
• They are entered as two columns to save space (i.e., you will not need two columns in the SPSS file, only one).
Using SPSS, run a frequencies analysis on these scores that includes:
• A Frequency Table
• Table of Descriptive Statistics
o Include measures of central tendency, dispersion/spread, and distribution.
• Paste the output into your homework file.
Answer- SPSS Table- Frequency Table for BDI Scores:
Answer- SPSS Table- Descriptive Statistics for BDI Scores:
Create a graph to describe the data set.
Justify why this graph is the best choice for these data.
Answer- SPSS graph
Based on your SPSS output, what is the numerical skewness value of this data set?
What does this value say about how the scores are distributed?
Based on your graph and your answers above, which measure of central tendency is most appropriate for describing this group of scores?