LESSON SIX

SCATTERPLOTS AND BIVARIATE PROCEDURE

 

Example 1. The fifth-grade students in Mr. Brown’s class were given a vocabulary test and a reading comprehension test.

 TASKS 

1] Create a scatterplot.

 Plot the scores on two variables for each subject.

 2] Compute the correlation coefficient 

I. Scatterplot

SPSS for Windows

A. Define the variables: voc and com. Enter data.

Note that you may press the Tab key to move one cell to the right (the second variable).

B. Create a scatterplot with scores on the vocabulary test as the X variable.

        a. From the menus choose: Graphs / Scatter

         Select the Simple picture button. Click on Define.

         Select `com` as the Y Axis variable. Select `voc` as the X Axis variable. Click on OK.

         b. Modify the chart to produce a linear regression line.

         Instructions for SPSS 11.5

      (a) Double-click on the scatterplot to bring up the Chart Editor window.

         (b) From the Chart Editor Window menus choose: Chart / Options

         In the Fit Line area, click on Total. Click on the Fit Options button.

         Fit Method. Select Linear Regression. It is the default.

         Regression Options. Click on Display R-squared in legend as shown above.

         Click Continue and OK. Close the Chart Editor window and return to the viewer window.

          Instructions for SPSS 12.0 ,

         (a) Double-click on the scatterplot to bring up the Chart Editor window.

         (b) Click on any one of the data points to highlight all of them.

        (c) From the Chart Editor Window menus choose: Chart / Add Chart Element/ Fit Line at Total.

        Close the Chart Editor window and return to the viewer window.
 


SPSS Output

A. Inspect the scatterplot.

 
 

 

a. It is a positive linear relationship. There is a tendency for high scores on the vocabulary test to be associated with high scores on the reading comprehension test. There is a tendency for low scores on the vocabulary test to be associated with low scores on the reading comprehension test       

Readings: Scatterplots by Oswego City School District

B. R-squared = .6378

About 64 percent of the variance in the reading comprehension scores is predictable from the vocabulary scores and vice versa.

   

II. Apply the Bivariate procedure and obtain the Pearson r.

 SPSS for Windows

A. From the menus choose: Analyze / Correlate / Bivariate

B. Select the variables to be correlated (voc and com). Click OK to obtain the default Pearson correlation.

SPSS Printout

The Pearson product-moment correlation coefficient is about .80. Note that the degrees of freedom are 7 - 2 = 5.

Test of Pearson Correlation

r = .80

Research Question: Is there a significant relationship between the vocabulary test scores and  the reading comprehension test scores? 
 

Hypotheses

The null hypothesis states that the population correlation, rho, is zero.
The alternative hypothesis states that the population correlation, rho, is different from zero. 

To test if the observed correlation is significantly different from zero, first compute a test statistic, a t ratio. How far does the sample correlation (r) from the assumed population correlation (0) in standard error units?

The t ratio can be computed as

 

The denominator is the standard error of r.

Examine the formula

Note that as the sample size (N) increases, the standard error of r decreases. Even small correlations may be statistically significance.

 Next, locate the obtained t value in the t distribution with 5 degrees of freedom and find the associated probability. Finally, compare the probability to the .05 significance level. Note that  the two-tailed observed significance level was 031. Since the probability is less than .05, the researcher would reject the null hypothesis and declare the result to be statistically significant.  

Report the results

The correlation between vocabulary and reading  comprehension was significant, r(5) = .80, p < .05. Students who earned high scores on the vocabulary test also tended to earn high scores on the reading comprehension test. Students who earned low scores on the vocabulary test also tended to earn low scores on the reading comprehension test. 

 

Example 2. Open an existing SPSS data file -- Cars. Create a scatterplot of miles per gallon and vehicle weight. Compute the correlation between the two variables.

A. Open the Cars data file.

B. Create a scatterplot:  vehicle weight ( X) and miles per gallon (Y)

         a. From the menus choose: Graphs / Scatter

         Select the Simple picture button. Click on Define.

         Select ` Miles per Gallon ` as the Y Axis variable.

         Select ` Vehicle Weight ` as the X Axis variable. Click on OK.

         b. Modify the chart to produce a linear regression line.

         Instructions for SPSS 11.5

        (a) Double-click on the scatterplot to bring up the Chart Editor window.

        (b) From the Chart Editor Window menus choose: Chart / Options

       In the Fit Line area, click on Total. Click on the Fit Options button.

         Fit Method. Select Linear Regression.

         Regression Options. Click on Display R-squared in legend.

         Click Continue. Click OK. Close the Chart Editor window.

         Instructions for SPSS 12.0 ,

         (a) Double-click on the scatterplot to bring up the Chart Editor window.

         (b) Click on any one of the data points to highlight all of them.

         (c) From the Chart Editor Window menus choose: Chart / Add Chart Element/ Fit Line at Total.

         Close the Chart Editor window and return to the viewer window.

 

What do you observe?

There is a negative relationship between miles per gallon and vehicle weight.. About 65 percent of the variance in miles per gallon is predictable from vehicle weight.

C. From the menus choose: Analyze / Correlate / Bivariate

Select the variables to be correlated (Miles per Gallon and Vehicle Weight). Click OK to obtain the default Pearson correlation. r = -.807

 

Example 3. Add a categorical variable (Country of Origin) to the scatterplot. Create a scatterplot of miles per gallon and vehicle weight by country of origin.

         From the menus choose: Graphs / Scatter

          Select the Simple picture button. Click on Define.

         Select `Miles per Gallon` as the Y Axis variable.

         Select `Vehicle Weight` as the X Axis variable.

         Set Markers by: Select Country of Origin. Click on OK.