Comparison of Canonical Correlation and Partial Least Squares analyses of simulated and empirical data

07/14/2021
by   Anthony R. McIntosh, et al.
0

In this paper, we compared the general forms of CCA and PLS on three simulated and two empirical datasets, all having large sample sizes. We took successively smaller subsamples of these data to evaluate sensitivity, reliability, and reproducibility. In null data having no correlation within or between blocks, both methods showed equivalent false positive rates across sample sizes. Both methods also showed equivalent detection in data with weak but reliable effects until sample sizes drop below n=50. In the case of strong effects, both methods showed similar performance unless the correlations of items within one data block were high. For PLS, the results were reproducible across sample sizes for strong effects, except at the smallest sample sizes. On the contrary, the reproducibility for CCA declined when the within-block correlations were high. This was ameliorated if a principal components analysis (PCA) was performed and the component scores used to calculate the cross-block matrix. The outcome of our examination gives three messages. First, for data with reasonable within and between block structure, CCA and PLS give comparable results. Second, if there are high correlations within either block, this can compromise the reliability of CCA results. This known issue of CCA can be remedied with PCA before cross-block calculation. This, however, assumes that the PCA structure is stable for a given sample. Third, null hypothesis testing does not guarantee that the results are reproducible, even with large sample sizes. This final outcome suggests that both statistical significance and reproducibility be assessed for any data.

READ FULL TEXT

page 11

page 14

page 17

page 19

page 22

page 26

page 28

page 30

research
12/30/2019

B-Value and Empirical Equivalence Bound: A New Procedure of Hypothesis Testing

In this study, we propose a two-stage procedure for hypothesis testing, ...
research
07/12/2019

Can Bayes Factors "Prove" the Null Hypothesis?

It is possible to obtain a large Bayes Factor (BF) favoring the null hyp...
research
11/15/2020

MixTwice: large-scale hypothesis testing for peptide arrays by variance mixing

Peptide microarrays have emerged as a powerful technology in immunoprote...
research
09/29/2017

Comparison of PCA with ICA from data distribution perspective

We performed an empirical comparison of ICA and PCA algorithms by applyi...
research
06/04/2018

Topic Modelling of Empirical Text Corpora: Validity, Reliability, and Reproducibility in Comparison to Semantic Maps

Using the 6,638 case descriptions of societal impact submitted for evalu...
research
11/22/2021

Using prior information to boost power in correlation structure support recovery

Hypothesis testing of structure in correlation and covariance matrices i...

Please sign up or login with your details

Forgot password? Click here to reset