|
4.
Process Modeling
4.1. Introduction to Process Modeling
|
|||
| Model Components |
There are three main parts to every process model. These are
|
||
| Form of Model |
The general form of the model is
All process models discussed in this chapter have this general form. As alluded to earlier, the random errors that are included in the model make the relationship between the response variable and the predictor variables a "statistical" one, rather than a perfect deterministic one. This is because the functional relationship between the response and predictors holds only on average, not for each data point. |
||
| Some of the details about the different parts of the model are discussed below, along with alternate terminology for the different components of the model. | |||
| Response Variable |
The response variable, |
||
| Mathematical Function |
The mathematical function consists of two parts. These parts are
the predictor variables, The parameters are the quantities that will be estimated during the modeling process. Their true values are unknown and unknowable, except in simulation experiments. As for the predictor variables, the collection of all of the parameters is denoted by |
||
|
The parameters and predictor variables are combined in different forms
to give the function used to describe the deterministic variation in the
response variable. For a straight line with an unknown intercept and
slope, for example, there are two parameters and one predictor variable
For a straight line with a known slope of one, but an unknown intercept, there would only be one parameter For a quadratic surface with two predictor variables, there are six parameters for the full model. |
|||
| Random Error |
Like the parameters in the mathematical function, the random errors
are unknown. They are simply the difference between the data and the
mathematical function. They are assumed to follow a particular probability
distribution, however, which is used to describe their aggregate behavior.
The probability distribution that describes the errors has a mean of zero
and an unknown standard deviation, denoted by
|
||
| Alternate Terminology | Unfortunately, there are no completely standardardized names for the parts of the model discussed above. Other publications or software may use different terminology. For example, another common name for the response variable is "dependent variable". The response variable is also simply called "the response" for short. Other names for the predictor variables include "explanatory variables", "independent variables", "predictors" and "regressors". The mathematical function used to describe the deterministic variation in the response variable is sometimes called the "regression function", the "regression equation", the "smoothing function", or the "smooth". | ||
| Scope of "Model" | In its correct usage, the term "model" refers to the equation above and also includes the underlying assumptions made about the probability distribution used to describe the variation of the random errors. Often, however, people will also use the term "model" when referring specifically to the mathematical function describing the deterministic variation in the data. Since the function is part of the model, the more limited usage is not wrong, but it is important to remember that the term "model" might refer to more than just the mathematical function. | ||