|
1.
Exploratory Data Analysis
1.3. EDA Techniques 1.3.3. Graphical Techniques: Alphabetic
|
|||
|
Purpose: Check Randomness |
Autocorrelation plots
(Box and Jenkins, pp. 28-32)
are a commonly-used tool for checking randomness in a data set.
This randomness is ascertained by computing autocorrelations for
data values at varying time lags. If random, such autocorrelations
should be near zero for any and all time-lag separations. If
non-random, then one or more of the autocorrelations will be
significantly non-zero.
In addition, autocorrelation plots are used in the model identification stage for Box-Jenkins autoregressive, moving average time series models. |
||
|
Sample Plot: Autocorrelations should be near-zero for randomness. Such is not the case in this example and thus the randomness assumption fails |
This sample autocorrelation plot shows that the time series is not random, but rather has a high degree of autocorrelation between adjacent and near-adjacent observations. |
||
|
Definition: r(h) versus h |
Autocorrelation plots are formed by
|
||
| Questions |
The autocorrelation plot can provide answers to the following
questions:
|
||
|
Importance: Ensure validity of engineering conclusions |
Randomness (along with fixed model, fixed variation, and fixed distribution) is one of the four assumptions that typically underlie all measurement processes. The randomness assumption is critically important for the following three reasons:
|
||
| Examples | Examples of the autocorrelation plot for several common situations are given in the following pages. | ||
| Related Techniques |
Partial Autocorrelation
Plot Lag Plot Spectral Plot Seasonal Subseries Plot |
||
| Case Study | The autocorrelation plot is demonstrated in the beam deflection data case study. | ||
| Software | Autocorrelation plots are available in most general purpose statistical software programs including Dataplot. | ||