4.
Process Modeling
4.5.
Use and Interpretation of Process Models
4.5.1.
What types of predictions can I make using the model?
4.5.1.2.
|
How can I estimate the value and uncertainty of a single observable response?
|
|
|
A Different Type of Prediction
|
In addition to estimating the average value of the response variable, as discussed on the
previous page, it is also possible to make predictions of the values of new measurements or
observations from a process. Unlike the true average response, a new measurement is often actually observable
in the future. However, there are a variety of different situations when a prediction of a measurement value may be
more desirable than actually making an observation from the process.
|
|
Example
|
For example, suppose that a concrete supplier needs to supply concrete of a specified measured strength for a particular
contract, but knows that strength varies systematically with the ambient temperature when the concrete is poured. In
order to be sure that the concrete will meet the specification, prior to pouring, samples from the batch of raw materials
can be mixed, poured, and measured in advance, and the relationship between temperature and strength can be modeled.
Then predictions of the strength across the range of possible field temperatures can be used to ensure the product is
likely to meet the specification. Later, after the concrete is poured (and the temperature is recorded), the accuracy of
the prediction can be verified.
|
|
The mechanics of predicting a new measurement value associated with a combination of predictor variable values are similar
to the steps used in the estimation of the average response value. In fact, the actual estimate of the new measured value
is obtained by evaluating the estimated regression function at the relevant predictor variable values, exactly as is done
for the average response. The estimates are the same for these two quantities because, assuming the model fits the data,
the only difference between the average response and a particular measured response is a random error. Because the error
is random, and has mean of zero, there is no additional information in the model that can be used to predict the particular
response other than the information than is available when predicting the average response.
|
|
Uncertainties Do Differ
|
As when estimating the average response, a probabilistic interval is used when predicting a new measurement to provide the
information needed to make engineering or scientific conclusions. However, even though the estimates of the average
response and a particular response values are the same, the uncertainties of the two estimates do differ from one another.
This is because the uncertainty of the measured response must include both the uncertainty of the estimated average
response, and the uncertainty of the new measurement which could conceptually be observed. This uncertainty must be
included if the interval that will be used to summarize the prediction result is to contain the new measurement with the
specified confidence. To help distinguish the two types of predictions, the probabilistic intervals for estimation of a
new measurement value are called prediction intervals rather than confidence intervals.
|
|
Standard Deviation of Prediction
|
The estimate of the standard deviation of the average response, , is obtained
as described earlier. Because the residual standard deviation describes the random variation
in each individual measurement or observation from the process, , the estimate of the
residual standard deviation obtained when fitting the model to the data, is used to account for the extra uncertainty
needed to predict a
measurement value. Since the new observation is independent of the data used to fit the model, the estimates of the
two standard deviations are then combined by "root-sum-of-squares" or "in quadrature", according to standard formulas for
computing variances, to obtain the standard deviation of the prediction of the new measurement,
. The formula for is
.
|
|
Coverage Factor and Prediction Interval Formula
|
Because both and are mathematically
nothing more than different scalings of , and coverage factors from the
t distribution only depend on the amount of data available for estimating ,
the coverage factors are the same for confidence and prediction intervals. Combining the coverage factor and the standard
deviation of the prediction, the formula for constructing prediction intervals is given by
.
As for computation of confidence intervals, some software may provide the total uncertainty for
the prediction interval given the equation above, or may provide the lower and upper prediction bounds.
As suggested before, however, it is a good idea to test the software on an example for which
prediction limits are already available to make sure that the software is computing the expected type of intervals.
|
|
Prediction Intervals for the Example Applications
|
Computing prediction intervals for the measured pressure in the
Pressure/Temperature example, at temperatures of 25, 45 and 65, and for the
measured torque on specimens from the polymer relaxation example at different
times and temperatures gives the results listed in the tables below. Note: the number of significant digits shown in the
tables below is larger than would normally be reported. However, as many significant digits as possible should be carried
throughout all calculations and results should only rounded for final reporting. If reported numbers may be used in
further calculations, then they should not be rounded even when finally reported. A useful rule for rounding final results
that will not be used for further computation is to round all of the reported values to one or two digits in the total
uncertainty, . This is the convention for rounding that has been used in the tables
below.
|
|
Pressure / Temperature Example
|
|
|
|
|
|
|
|
Lower 95% Prediction Bound
|
Upper 95% Prediction Bound
|
|
| 25 |
106.0025 |
4.299099 |
1.197616 |
4.462795 |
2.024394 |
9.034455 |
97.0 |
115.0 |
| 50 |
204.2560 |
4.299099 |
0.735741 |
4.361601 |
2.024394 |
8.829600 |
195.4 |
213.1 |
| 65 |
263.2081 |
4.299099 |
1.244162 |
4.475510 |
2.024394 |
9.060197 |
254.1 |
272.3 |
|
|
Polymer Relaxation Example
|
|
|
|
|
|
|
|
|
Lower 95% Prediction Bound
|
Upper 95% Prediction Bound
|
|
| 20 |
25 |
5.586307 |
0.04341221 |
0.02840153 |
0.05187742 |
2.000298 |
0.10377030 |
5.48 |
5.69 |
| 80 |
25 |
4.998012 |
0.04341221 |
0.01217109 |
0.04508609 |
2.000298 |
0.09018560 |
4.91 |
5.09 |
| 20 |
50 |
6.960607 |
0.04341221 |
0.01371149 |
0.04552609 |
2.000298 |
0.09106573 |
6.87 |
7.05 |
| 80 |
50 |
5.342600 |
0.04341221 |
0.01007761 |
0.04456656 |
2.000298 |
0.08914639 |
5.25 |
5.43 |
| 20 |
75 |
7.521252 |
0.04341221 |
0.01205401 |
0.04505462 |
2.000298 |
0.09012266 |
7.43 |
7.61 |
| 80 |
75 |
6.220895 |
0.04341221 |
0.01330727 |
0.04540598 |
2.000298 |
0.09082549 |
6.13 |
6.31 |
|
|
Interpretation of Prediction Intervals
|
Simulation of many sets of data from a process model provides a good way to get a detailed understanding
of the probabilistic nature of the prediction intervals. The main advantage of using simulation is that it
allows direct comparison of how prediction intervals constructed from a limited amount of data relate to
the measured values that are being estimated.
|
|
|
The plot below shows 95% prediction intervals computed using 50 independently generated data sets that follow the same
model as the data in the Pressure/Temperature example. Random errors from the normal distribution with a mean of zero
and a known standard deviation are added to each set of true temperatures and true pressures that lie on a perfect
straight line to obtain the simulated data. Then the noisy data in each data set are used to compute a prediction interval
for a newly observed pressure at a temperature of 65. The newly observed measurements, observed after making the
prediction, are noted with an "X" for each data set.
|
|
Prediction Intervals Computed from 50 Sets of Simulated Data
|
|
|
Confidence Level Specifies Long-Run Interval Coverage
|
From the plot it is easy to see that not all of the intervals contain the pressure values observed after the prediction was
made. Data set 4 produced intervals that did not capture the newly observed pressure measurement at a temperature of
65. However, for 49 out of 50, or a bit over 95% of the data sets, the prediction intervals did capture the measured
pressure. When the number of data sets was increased to 5000, prediction intervals computed for 4734, or 94.68%, of the
data sets covered the new measured values. Finally, when the number of data sets was increased to 10000, 94.92% of the
confidence intervals computed covered the true average pressure.
Thus, the simulation shows that although any particular prediction interval might not cover its associated new measurement,
in repeated experiments this method produces intervals that contain the new measurements at the rate specified by the user
as the confidence level.
|
|
Comparison with Confidence Intervals
|
It is also interesting to compare these results to the analogous results for confidence
intervals. Clearly the most striking difference between the two plots is in the sizes of the uncertainties. The
uncertainties for the prediction intervals are much larger because they must include the standard deviation of a single
new measurement, as well as the standard deviation of the average response value. The standard deviation of the average
response value is lower because a lot of the random error that is each measurement cancels out when the data is used to
estimate the unknown parameters in the model. In fact, if as the sample size increases, the limit on the length of a
confidence interval approaches zero while the limit on the length of the prediction interval as the sample size increases
approaches . Understanding the different types of intervals and the bounds on interval length
can be important when planning an experiment that requires a result to have no more than a specified level of uncertainty
to have engineering value.
|
|
Interpretation Summary
|
To summarize the interpretation of the probabilistic nature of confidence intervals in words, in independent, repeated
experiments, will cover their true values, given that the assumptions needed for
the construction of the intervals hold.
|