BigList = rand(20, 3)
LittleList = rand(5, 3)
I'd like to find for each row in the big list the 'closest' row in the little list, as defined by the euclidean norm (i.e. sum of squared distances between the corresponding values in the k=3 dimension).
I can see how to do this using two loops, but it seems like there ought to be a better way to do this using built in matrix operations.
Approach #1
There is a built in MATLAB function
pdist2
which finds"Pairwise distance between two sets of observations"
. With it, you can calculate the euclidean distance matrix and then find indices of minimum values along the appropriate dimension in the distance matrix that would represent the "closest" for each row ofbigList
inlittleList
.Here's the one-liner with it -
Approach #2
If you care about performance, here's a method that leverages
fast matrix multiplication in MATLAB
and most of the code presented here is taken from this smart solution.Benchmarking
Benchmarking Code -
Benchmark results -
Quick conclusions: The runtimes with Shai's second approach that was a combination of
bsxfun
and matrix multiplication were very close with the one based onpdist2
and no clear winner could be decided between those two.