KNN classifier algorithm not working for all cases

2.2k Views Asked by user1691399 At 06 June 2025 at 02:40

To effectively find n nearest neighbors of a point in d-dimensional space, I selected the dimension with greatest scatter (i.e. in this coordinate differences between points are largest). The whole range from minimal to maximal value in this dimension was split into k bins. Each bin contains points which coordinates (in this dimensions) are within the range of that bin. It was ensured that there are at least 2n points in each bin. The algorithm for finding n nearest neighbors of point x is following:

Identify bin kx,in which point x lies(its projection to be precise).
Compute distances between x and all the points in bin kx.
Sort computed distances in ascending order.
Select first n distances. Points to which these distances were measured are returned as n nearest neighbors of x.

This algorithm is not working for all cases. When algorithm can fail to compute nearest neighbors? Can anyone propose modification of the algorithm to ensure proper operation for all cases?

Original Q&A

There are 3 best solutions below

Dino On 14 November 2014 at 22:55

This is failing because (any of) the k nearest neighbors of x could be in a different bin than x.

Vihari Piratla On 24 December 2014 at 12:14

What do you mean by "not working"? You do understand that, what you are doing is only an approximate method.
Try normalising the data and then choosing the dimension, else scatter makes no sense.

The best vector for discrimination or for clustering may not be one of the original dimensions, but any combination of dimensions.
Use PCA (Principal Component Analysis) or LDA (Linear Discriminant Analysis), to identify a discriminative dimension.

Ravi On 27 July 2018 at 23:05

Where KNN failure:

If the data is a jumble of all different classes then knn will fail because it will try to find k nearest neighbours but all points are random
outliers points

Let's say you have two clusters of different classes. Then if you have a outlier point as query, knn will assign one of the classes even though the query point is far away from both clusters.

KNN classifier algorithm not working for all cases

There are 3 best solutions below

Related Questions in ALGORITHM

Related Questions in CLASSIFICATION

Related Questions in PATTERN-RECOGNITION

Related Questions in KNN

Trending Questions

Popular # Hahtags

Popular Questions