hidden pixel

Test Set Information

A test set is a set of data used in various areas of information science to assess the strength and utility of a predictive relationship. Test sets are used in artificial intelligence, machine learning, genetic programming, intelligent systems, and statistics. In all these fields, a test set has much the same role.

Contents

Rationale

Many procedures have been developed regression analysis was one of the earliest such approaches to be developed. The data used to construct or discover a predictive relationship are called the training data set. Most approaches that search through training data for empirical relationships tend to overfit the data, meaning that they can identify apparent relationships in the training data that do not hold in general. A test set is a set of data that is independent of the training data, but that follows the same probability distribution as the training data. If a model fit to the training set also fits the test set well, minimal overfitting has taken place. If the model fits the training set much better than it fits the test set, overfitting is likely the cause.

Example

A training set (left) and a test set (right) from the same statistical population are shown as blue points. Two predictive models are fit to the training data. Both fitted models are plotted with both the training and test sets. In the training set, the MSE of the fit shown in orange is 4 whereas the MSE for the fit shown in green is 9. In the test set, the MSE for the fit shown in orange is 15 and the MSE for the fit shown in green is 13. The orange curve severely overfits the training data, since its MSE increases by almost a factor of four when comparing the test set to the training set. The green curve overfits the training data much less, as its MSE increases by less than a factor of 2.

See also

External links

This programming language-related article is a stub. You can help Wikipedia by expanding it.
Statistics
Descriptive statistics
Continuous data
Location
Dispersion
Shape
Count data
Summary tables
Dependence
Statistical graphics
Data collection
Designing studies
Survey methodology
Controlled experiment
Uncontrolled studies
Statistical inference
Statistical theory
Bayesian inference
Frequentist inference
Specific tests
General estimation
Correlation and regression analysis
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical, multivariate, time-series, or survival analysis
Categorical data
Multivariate statistics
Time series analysis
General
Time domain
Frequency domain
Survival analysis
Applications
Biostatistics
Engineering statistics
Social statistics
Spatial statistics

Categories:

 

The above information uses material from Wikipedia and is licensed under the GNU Free Documentation License.
Some facts may not have been fully verified for accuracy. [Disclaimers]
This page was last archived by our server on Sat Jun 2 05:14:10 2012.
Displaying this page or its contents does not use any Wikimedia Foundation's resources.
The owners of this site proudly support the Wikimedia Foundation.