LAB4 OUTLIER
1.Open dataset “数据1-RESPONSETIME”. Try to Use bivariate scatter plot to identify the most possible outlier.Show your figure below.
2.Open dataset “数据2-SAT”.Regress “average total score” on other variables and use hii to findout the extreme value in three independent variables (Just show the results ofSPSS).
CASENAME = ( )
Hii= ( )
Regress“average total score” on “average student/teacher ratio”, for this case (thecase name you write above), Hii = ( )
3.Open dataset “数据2-SAT”.Regress “average total score” on “annual salary of teachers” and use studentizeddeleted residual method to check over the “Y29”(South Carolina).
ModelC: Y=( )-( )annual_salary
ModelA: Y = ( )
PRE= ( )
F*= ( )
4.Cook’s D for case 29(SouthCarolina)(consideringthe three independent variables).
D29= ( )
5.Delete the two outliers (change them into missing variables). Redo the multipleregression and write down your regression equation and the PRE value.
Regressionequation: ( )
PRE= ( )