Robust PCA

Institute Homepage

Institute Homepage DE Sign In

Perceiving Systems Members Publications

Robust PCA

Research photo animation — The animation show the Trimmed Grassmann Average (TGA) algorithm on synthetic data (orange circles). The algorithm is initialized roughly orthogonal to the correct component. In each iteration, all observations are given a sign to be as similar as possible to the current component estimate. The component (blue line) is then updated as the trimmed mean of the re-signed data (green circles). This is repeated until convergence.

There are several ways to approach robust principal component analysis (RPCA). In early work [] we noted that PCA constructs a linear subspace that minimizes least squares reconstruction error of the data. Since least squares is not robust to ourliers, neither is standard PCA. We reformulated the reconstruction term robustly and solved for the subspace that minimized a robust error term. The approach worked well but was computationally expensive.

As the collection of large datasets becomes increasingly automated, the occurrence of outliers will increase -- big data implies big outliers. As a scalable approach to robust PCA we propose to compute the average subspace spanned by the data; this can then be made robust by computing robust averages [].

We note that in a zero-mean dataset, each observation spans a one-dimensional subspace, giving a point on the Grassmann manifold. We show that the average subspace corresponds to the leading principal component for Gaussian data. We provide a simple algorithm for computing this Grassmann Average (GA), and show that the subspace estimate is less sensitive to outliers than PCA for general distributions. Because averages can be efficiently computed, we immediately gain scalability. We exploit robust averaging to formulate the Robust Grassmann Average (RGA) as a form of robust PCA. The resulting Trimmed Grassmann Average (TGA) is appropriate for computer vision because it is robust to pixel outliers. The algorithm has linear computational complexity and minimal memory requirements. We demonstrate TGA for background modeling, video restoration, and shadow removal. We show scalability by performing robust PCA on the entire Star Wars IV movie; a task beyond any current method.

This page provides source code and video material for the paper Grassmann Averages for Scalable Robust PCA.

Members

Perceiving Systems

Søren Hauberg

Post doc. at the Section for Cognitive Systems at the Technical University of Denmark.

Senior Research Engineer @ Software Workshop

Publications

Perceiving Systems Software Workshop Article Scalable Robust Principal Component Analysis using Grassmann Averages Hauberg, S., Feragen, A., Enficiaud, R., Black, M. IEEE Trans. Pattern Analysis and Machine Intelligence (PAMI), December 2015 () preprint pdf from publisher supplemental BibTeX

Perceiving Systems Conference Paper Grassmann Averages for Scalable Robust PCA Hauberg, S., Feragen, A., Black, M. J. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), :3810 -3817, Columbus, Ohio, USA, IEEE International Conference on Computer Vision and Pattern Recognition, June 2014 () pdf code supplementary material tutorial video results video talk poster DOI BibTeX

Research

Departments

Research Groups

People

Contact

Our Institute

Our History

Career

Doctoral Programs

Training

Service Units

Central Scientific Facilities

Workshops

Campus Services

Impact

Cooperation

Partners and Initiatives