The development of this E-Book has been
supported by the British Academy.
Multiple Regression practical
In this practical we will look at regressing two different predictor variables individually on a response, followed by a model containing both
of them. We will also look at a second approach to doing this. This work builds on the earlier simple linear regression practical.
Family background is known to be an important predictor of educational achievement, but as a construct it encompasses many different
dimensions of parental resources, some of which may be more important for children’s learning. In this practical, we explore how two
aspects of parental resources – an indicator of a family’s wealth and the degree of emotional support provided by parents for a child’s
learning – are associated with performance on the PISA science test (SCISCORE). The first predictor variable is WEALTH, which is derived
from reports of whether the family owns eight items, such as a car, a computer and a room of the child’s own. The second predictor
variable is EMOSUPS, which is derived from four items with which students rated their strength of agreement, e.g. “My parents support my
educational efforts and achievements” (see PISA datafile description for further details).
Multiple Regression in SPSS worksheet (Practical)
We start by running the first linear regression to look at if there is a significant (linear) effect of WEALTH on SCISCORE. This is done in
SPSS as follows:
Select Linear from the Regression submenu available from the Analyze menu.
Copy the Science test score[SCISCORE] variable into the Dependent box.
Copy the Family wealth score[WEALTH] variable into the Independent(s) box.
Click on the Statistics button.
On the screen appears add the tick for Confidence Interval to those for Estimates and Model fit.
Click on the Continue button to return to the main window.
Click on the OK button to run the command.
SPSS will produce several tabular outputs but here we will focus on only the model summary and coefficients tables that can be seen
below:
Model Summary
Model
R R Square Adjusted R Square Std. Error of the Estimate
1 .090 .008 .008 102.19569
a. Predictors: (Constant), Family wealth score
Here we see some fit statistics for the overall model. The statistic R here takes the value .090 and is equivalent to the Pearson correlation
coefficient for a simple linear regression, that is a regression with only one predictor variable. R squared (.008) is simply the value of R
squared (R multiplied by itself) and represents the proportion of variance in the response variable, SCISCORE explained by WEALTH. The
table also includes an adjusted R square measure which here takes value .008 and is a version of R squared that is adjusted to take account
of the number of predictors (one in the case of this simple linear regression) that are in the model. We next look at the coefficients table
which is shown below:
Coefficients
Model
Unstandardized Coefficients Standardized Coefficients
t Sig.
95.0% Confidence Interval for B
B Std. Error Beta Lower Bound Upper Bound
1 (Constant) 519.868 1.621 320.763 .000 516.690 523.045
Family wealth score 9.290 1.450 .090 6.406 .000 6.447 12.133
This table often gives the most interesting information about the regression model. We begin with the coefficients that form the regression
equation. The regression intercept (labelled Constant in SPSS) takes the value 519.868 and is the predicted value of SCISCORE when
WEALTH takes value 0. The regression slope, or unstandardised coefficient, (B in SPSS) takes value 9.290 and is the amount by which we
predict that SCISCORE changes for an increase of 1 unit in WEALTH.
Both coefficients have associated standard errors that can be used to assess their significance. SPSS also reports a standardised coefficient
(the Beta) that can be interpreted as a "unit-free" measure of effect size, one that can be used to compare the magnitude of effects of
predictors measured in different units. Here Beta takes the value .090 which represents the predicted change in the number of standard
deviations of SCISCORE for an increase of 1 standard deviation in WEALTH.
To test for the significance of the coefficients we need to form test statistics which are reported under the t column and these are simply B
/ Std.Error. For the slope on WEALTH the t statistic is 6.406 and this value can be compared with a t distribution to test the null hypothesis
that the slope is 0. We can see the resulting p value for the test under the Sig. column. The p value (quoted under Sig.) is .000 (reported as
p < .001) which is less than 0.05. We therefore have significant evidence to reject the null hypothesis that the slope coefficient on WEALTH
is zero.
a