The cv.lm() function is defined under the DAAG package used for k-fold validation while the stepAIC() function is defined under the MASS package that performs stepwise model selection under exactAIC.
In R, cv.lm() and stepAIC() are functions commonly used in the context of linear regression modeling and model selection.
cv.lm() Function:
cv.lm() stands for cross-validation for linear models. It is part of the DAAG (Data Analysis and Graphics Data and Functions) package in R. This function is used for model selection and validation by performing k-fold cross-validation on linear regression models.
Usage:
cv.lm(data, form.lm, m=10, dots=FALSE)
data: A data frame containing the variables.
form.lm: A formula specifying the linear model.
m: The number of folds in cross-validation.
dots: A logical value indicating whether dots should be printed as the function works.
Explanation:
The function fits linear models for each fold of the data and computes the cross-validated mean squared error (CV MSE). It helps in assessing the performance of the linear model and selecting the appropriate model complexity.
stepAIC() Function:
stepAIC() is part of the MASS (Modern Applied Statistics with S) package in R. It is used for stepwise model selection based on Akaike Information Criterion (AIC). The AIC is a measure that balances the goodness of fit with the complexity of the model, and the goal is to find a model that minimizes the AIC.
Usage:
stepAIC(object, scope, direction = c(“both”, “backward”, “forward”),
trace = 1, k = 2, scale = 0, …)
object: A fitted model object (usually a linear model).
scope: A list indicating the range of models to be considered.
direction: The direction of the stepwise search.
trace: A logical value indicating whether to print the stepwise progress.
k: A penalty term in the AIC formula.
scale: A scaling factor for the AIC.
Explanation:
The function fits models by adding or removing variables based on the AIC, stopping when no further improvements are possible. It helps automate the process of model selection by iteratively adding or removing predictors to find a parsimonious model.
In summary, cv.lm() is used for cross-validation of linear models, while stepAIC() is used for stepwise model selection based on the AIC criterion. These functions are useful tools in the process of building and refining linear regression models in R.