Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018;12(1):1150-1180.
doi: 10.1214/18-EJS1421. Epub 2018 Mar 27.

Supervised multiway factorization

Affiliations

Supervised multiway factorization

Eric F Lock et al. Electron J Stat. 2018.

Abstract

We describe a probabilistic PARAFAC/CANDECOMP (CP) factorization for multiway (i.e., tensor) data that incorporates auxiliary covariates, SupCP. SupCP generalizes the supervised singular value decomposition (SupSVD) for vector-valued observations, to allow for observations that have the form of a matrix or higher-order array. Such data are increasingly encountered in biomedical research and other fields. We use a novel likelihood-based latent variable representation of the CP factorization, in which the latent variables are informed by additional covariates. We give conditions for identifiability, and develop an EM algorithm for simultaneous estimation of all model parameters. SupCP can be used for dimension reduction, capturing latent structures that are more accurate and interpretable due to covariate supervision. Moreover, SupCP specifies a full probability distribution for a multiway data observation with given covariate values, which can be used for predictive modeling. We conduct comprehensive simulations to evaluate the SupCP algorithm. We apply it to a facial image database with facial descriptors (e.g., smiling / not smiling) as covariates, and to a study of amino acid fluorescence. Software is available at https://github.com/lockEF/SupCP.

Keywords: Faces in the wild; dimension reduction; latent variables; parafac/candecomp; singular value decomposition; tensors.

PubMed Disclaimer

Figures

Fig 1
Fig 1
Rank-1 components corresponding to Phenylalin, Tryptophan and Tryosine. The bottom two rows give the loadings for emission and excitation wavelengths, and the top row gives their resulting product.
Fig 2
Fig 2
Examples of frontalized faces.
Fig 3
Fig 3
Log-likelihood for the estimated SupCP and SupSVD models, for different ranks, on the training images used to fit the model and the test images.
Fig 4
Fig 4
Log-likelihood for the model over 2000 EM iterations. The first 500 iterations (a) incorporate random annealing to avoid local modes.
Fig 5
Fig 5
Constructed faces under different attribute covariates: SupCP.
Fig 6
Fig 6
Constructed faces under different attribute covariates: SupSVD.
Fig 7
Fig 7
True rank vs. estimated rank for 100 randomly generated simulations, under different levels of noise variance. A small amount of jitter is added to show multiple points with the same coordinates.

References

    1. Allen G. Sparse higher-order principal components analysis. International Conference on Artificial Intelligence and Statistics. 2012:27–36.
    1. Allen GI. Computational Advances in Multi-Sensor Adaptive Processing (CAM-SAP), 2013 IEEE 5th International Workshop on. IEEE; 2013. Multi-way functional principal components analysis; pp. 220–223.
    1. Andersen CM, Bro R. Practical aspects of PARAFAC modeling of fluorescence excitation-emission data. Journal of Chemometrics. 2003;17:200–215.
    1. Bair E, Hastie T, Paul D, Tibshirani R. Prediction by supervised principal components. Journal of the American Statistical Association. 2006;101:119–137.
    1. Bauckhage C. Image Analysis and Recognition. Springer; 2007. Robust tensor classifiers for color object recognition; pp. 352–363.

LinkOut - more resources