[Back]


Publications in Scientific Journals:

M.C. Mert, P. Filzmoser, K. Hron:
"Sparse principal balances";
Statistical Modelling, 15 (2015), 2; 159 - 174.



English abstract:
Compositional data analysis deals with situations where the relevant information is contained only in the ratios between the measured variables, and not in the reported values. This article focuses on high-dimensional compositional data (in the sense of hundreds or even thousands of variables), as they appear in chemometrics (e.g., mass spectral data), proteomics or genomics. The goal of this contribution is to perform a dimension reduction of such data, where the new directions should allow for interpretability. An approach named principal balances turned out to be successful for low dimensions. Here, the concept of sparse principal component analysis is proposed for constructing principal directions, the so-called sparse principal balances. They are sparse (contain many zeros), build an orthonormal basis in the sample space of the compositional data, are efficient for dimension reduction and are applicable to high-dimensional data.

Keywords:
principal component analysis; compositionala data; isometric logration transformation; sparseness


"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
http://dx.doi.org/10.1177/1471082X14535525


Created from the Publication Database of the Vienna University of Technology.