ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

DETECTING OUTLIER IN THE MULTIVARIATE DISTRIBUTION USING PRINCIPAL COMPONENTS

Journal: INTERNATIONAL JOURNAL OF ENGINEERING SCIENCE TECHNOLOGIES (Vol.7, No. 2)

Publication Date:

Authors : ;

Page : 107-113

Keywords : Outliers; Principal Components; Eigenvalues; Proximity; Multivariate Distribution;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

It is crucial to make inference out of the data at hand. It makes sense to discard spurious observations prior to application of statistical analysis. This study advances a procedure of determining outliers based from the principal components of the original variables. These variables are sorted and given weights based on the magnitude of their inner product with the principal components formulated from the centered and scaled variables. The weights are the corresponding variances explained by the principal components. The measure of proximity among observations is proportionate to the variance (eigenvalues) associated with the principal components. The methodology defines two distinct subintervals where the suspected outliers settle in one of these subintervals based on the proximity measures δo. On the merit of simulated data, the procedure detected 100 percent when the outliers are coming from distinct distribution. On the other hand, the procedure detected 98.7 per cent when the distribution of outliers have equal variance-covariance matrix with the outlier-free distribution and a slight difference in the vector of means.

Last modified: 2023-05-04 14:18:39