Unweighted estimation based on optimal sample under measurement constraints

10/08/2022
by   Jing Wang, et al.
0

To tackle massive data, subsampling is a practical approach to select the more informative data points. However, when responses are expensive to measure, developing efficient subsampling schemes is challenging, and an optimal sampling approach under measurement constraints was developed to meet this challenge. This method uses the inverses of optimal sampling probabilities to reweight the objective function, which assigns smaller weights to the more important data points. Thus the estimation efficiency of the resulting estimator can be improved. In this paper, we propose an unweighted estimating procedure based on optimal subsamples to obtain a more efficient estimator. We obtain the unconditional asymptotic distribution of the estimator via martingale techniques without conditioning on the pilot estimate, which has been less investigated in the existing subsampling literature. Both asymptotic results and numerical results show that the unweighted estimator is more efficient in parameter estimation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2018

More Efficient Estimation for Logistic Regression with Optimal Subsample

Facing large amounts of data, subsampling is a practical technique to ex...
research
07/17/2019

Optimal Sampling for Generalized Linear Models under Measurement Constraints

Suppose we are using a generalized linear model to predict a scalar outc...
research
05/23/2019

Divide-and-Conquer Information-Based Optimal Subdata Selection Algorithm

The information-based optimal subdata selection (IBOSS) is a computation...
research
06/16/2013

Local case-control sampling: Efficient subsampling in imbalanced data sets

For classification problems with significant class imbalance, subsamplin...
research
04/05/2023

Batch mode active learning for efficient parameter estimation

For many tasks of data analysis, we may only have the information of the...
research
10/25/2021

Nonuniform Negative Sampling and Log Odds Correction with Rare Events Data

We investigate the issue of parameter estimation with nonuniform negativ...
research
07/06/2020

Surprise sampling: improving and extending the local case-control sampling

Fithian and Hastie (2014) proposed a new sampling scheme called local ca...

Please sign up or login with your details

Forgot password? Click here to reset