Data matrix (multivariate statistics)

In multivariate statistics, a data matrix is a matrix of data of dimension n-by-p, where n is the number of samples observed, and p is the number of variables (features) measured in all samples.^[1]^[2]

In this representation different rows typically represent different repetitions of an experiment, while columns represent different types of data (say, the results from particular probes). For example, suppose an experiment is run where 10 people are pulled off the street and asked four questions. The data matrix M would be a 10×4 matrix (meaning 10 rows and 4 columns). The datum in row i and column j of this matrix would be the answer of the i ^th person to the j ^th question.

References

↑ Johnson, Richard A; Wichern, Dean W (2001). Applied Multivariate Statistical Analysis. Pearson. pp. 111–112. ISBN 0131877151.
↑ "Basic Concepts for Multivariate Statistics p.2" (PDF). SAS Institute.

This article is issued from Wikipedia - version of the 11/26/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.

Data matrix (multivariate statistics)

See also

See also

References