Outliers - PowerPoint PPT Presentation

1 / 9
About This Presentation
Title:

Outliers

Description:

1. Main Ideas. Outliers. Residual plot. Visual keys to selecting the correct model ... of x versus the residuals (a residual is same as DEV which is how far above or ... – PowerPoint PPT presentation

Number of Views:474
Avg rating:3.0/5.0
Slides: 10
Provided by: Dic112
Category:
Tags: keys | outliers

less

Transcript and Presenter's Notes

Title: Outliers


1
Lesson 7b
Judging Whether a Regression Line Adequately
Describes the Data
  • Outliers
  • Residual plot
  • Visual keys to selecting the correct model

Main Ideas
1
2
  • 1. Outliers
  • An outlier is an observation that doesnt appear
    to fit the pattern of the other data.
  • An outlier can have a significant effect on the
    regression line, especially if it occurs at the
    extreme of the data set.

!
2
3
  • 2. Outlier Will Change Equation of Line

With outlier y 3.65 .52x
Without outlier y 1.76 1.11x
3
4
  • 3. Residual Plot
  • This is a plot of x versus the residuals (a
    residual is same as DEV which is how far above or
    below the line a point is).
  • What to look for?
  • Are residuals randomly scattered about 0?
  • If not random, what trend do the residuals
    follow?
  • Do the residuals show greater variability for
    some xs than others?

4
5
  • 4. Some Non-Random Patterns in Residuals.

Quadratic-like Trend
Increasing Variability
0
5
6
  • 5. Residuals Plots Can Help Select the Right
    Equation for the Data
  • Generally, if some pattern in the residual is
    present it will lead us to consider some way to
    describe the data other than with a straight
    line.
  • Curved pattern may mean a quadratic equation.
  • If variability depends on x, then a
    transformation, such as a logarithmic
    transformation, may be appropriate.

6
7
  • 6. Before and After a Transformation.
  • Comparing x vs y and x vs log10(y)

x vs y
x vs log10(y)
7
8
  • 7. Residual Plots for the Previous Graphs

Curved pattern for residuals
Residuals scattered about 0
8
9
Why isnt a plot of x vs. y good enough? Why do
need residual plots?
Residual plots are particularly useful in
so-called multiple regression where we use more
than one predictor at once. What we are doing
now is illustrating an idea.
9
Write a Comment
User Comments (0)
About PowerShow.com