Projection pursuit is a multivariate statistical technique aimed at finding interesting low-dimensional data projections by maximizing a measure of interestingness commonly known as projection index. Widespread use of projection pursuit has been hampered by the computational difficulties inherent to the maximization of the projection index. The problem is addressed within the framework of skewness-based projection pursuit, focused on data projections with highest third standardized cumulants. First, it is motivated the use of the right dominant singular vector of the third multivariate, standardized moment to start the maximization procedure. Second, it is proposed an iterative algorithm for skewness maximization which relies on the analytically tractable maximization of a third-order polynomial in two variables. Both visual inspection and formal testing based on simulated data clearly suggest that the asymptotic distribution of the maximal skewness achievable by a linear projection of normal data might be skew-normal. The potential of skewness-based projection pursuit for uncovering data structures is illustrated with Olympic decathlon data.

Skewness-Based Projection Pursuit: a Computational Approach

Nicola Loperfido
Membro del Collaboration Group
2018

Abstract

Projection pursuit is a multivariate statistical technique aimed at finding interesting low-dimensional data projections by maximizing a measure of interestingness commonly known as projection index. Widespread use of projection pursuit has been hampered by the computational difficulties inherent to the maximization of the projection index. The problem is addressed within the framework of skewness-based projection pursuit, focused on data projections with highest third standardized cumulants. First, it is motivated the use of the right dominant singular vector of the third multivariate, standardized moment to start the maximization procedure. Second, it is proposed an iterative algorithm for skewness maximization which relies on the analytically tractable maximization of a third-order polynomial in two variables. Both visual inspection and formal testing based on simulated data clearly suggest that the asymptotic distribution of the maximal skewness achievable by a linear projection of normal data might be skew-normal. The potential of skewness-based projection pursuit for uncovering data structures is illustrated with Olympic decathlon data.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11576/2655403
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 40
  • ???jsp.display-item.citation.isi??? 34
social impact