Efficient Estimation of the number of neighbours in Probabilistic K Nearest Neighbour Classification

05/05/2013
by   Ji Won Yoon, et al.
0

Probabilistic k-nearest neighbour (PKNN) classification has been introduced to improve the performance of original k-nearest neighbour (KNN) classification algorithm by explicitly modelling uncertainty in the classification of each feature vector. However, an issue common to both KNN and PKNN is to select the optimal number of neighbours, k. The contribution of this paper is to incorporate the uncertainty in k into the decision making, and in so doing use Bayesian model averaging to provide improved classification. Indeed the problem of assessing the uncertainty in k can be viewed as one of statistical model selection which is one of the most important technical issues in the statistics and machine learning domain. In this paper, a new functional approximation algorithm is proposed to reconstruct the density of the model (order) without relying on time consuming Monte Carlo simulations. In addition, this algorithm avoids cross validation by adopting Bayesian framework. The performance of this algorithm yielded very good performance on several real experimental datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2008

On the underestimation of model uncertainty by Bayesian K-nearest neighbors

When using the K-nearest neighbors method, one often ignores uncertainty...
research
03/25/2023

Measuring Classification Decision Certainty and Doubt

Quantitative characterizations and estimations of uncertainty are of fun...
research
12/13/2018

Local Probabilistic Model for Bayesian Classification: a Generalized Local Classification Model

In Bayesian classification, it is important to establish a probabilistic...
research
12/29/2021

Model Averaging for Support Vector Machine by J-fold Cross-Validation

Support vector machine (SVM) is a classical tool to deal with classifica...
research
05/23/2023

Clustering Indices based Automatic Classification Model Selection

Classification model selection is a process of identifying a suitable mo...
research
02/26/2018

Estimation of Local Degree Distributions via Local Weighted Averaging and Monte Carlo Cross-Validation

Owing to their capability of summarising interactions between elements o...
research
04/09/2020

k-Nearest Neighbour Classifiers – 2nd Edition

Perhaps the most straightforward classifier in the arsenal or machine le...

Please sign up or login with your details

Forgot password? Click here to reset