A Fast and Effective Large-Scale Two-Sample Test Based on Kernels

10/07/2021
by   Hoseung Song, et al.
0

Kernel two-sample tests have been widely used and the development of efficient methods for high-dimensional large-scale data is gaining more and more attention as we are entering the big data era. However, existing methods, such as the maximum mean discrepancy (MMD) and recently proposed kernel-based tests for large-scale data, are computationally intensive to implement and/or ineffective for some common alternatives for high-dimensional data. In this paper, we propose a new test that exhibits high power for a wide range of alternatives. Moreover, the new test is more robust to high dimensions than existing methods and does not require optimization procedures for the choice of kernel bandwidth and other parameters by data splitting. Numerical studies show that the new approach performs well in both synthetic and real world data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2020

Generalized Kernel Two-Sample Tests

Kernel two-sample tests have been widely used for multivariate data in t...
research
12/31/2021

Kernel Two-Sample Tests in High Dimension: Interplay Between Moment Discrepancy and Dimension-and-Sample Orders

Motivated by the increasing use of kernel-based metrics for high-dimensi...
research
02/21/2023

Boosting the Power of Kernel Two-Sample Tests

The kernel two-sample test based on the maximum mean discrepancy (MMD) i...
research
06/14/2023

MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting

We propose novel statistics which maximise the power of a two-sample tes...
research
06/03/2020

Learning Kernel Tests Without Data Splitting

Modern large-scale kernel-based tests such as maximum mean discrepancy (...
research
02/02/2022

KSD Aggregated Goodness-of-fit Test

We investigate properties of goodness-of-fit tests based on the Kernel S...
research
06/27/2023

A new classification framework for high-dimensional data

Classification is a classic problem but encounters lots of challenges wh...

Please sign up or login with your details

Forgot password? Click here to reset