1.I had two numpy arrays which are data_test and data_train respectively
data_partial_test = data_test[:2000,:]
test_lable = label_test
print(test_lable.shape)
print(data_partial_test[0].shape)
print(data_train[0].shape)
dis = (( data_partial_test- data_train[:21000,])**2).sum(axis=1)
2.The shape of data_test is (21000,784) and the shape of data_train is(2000,784). When I run this code it said :operands could not be broadcast together with shapes (2000,784) (21000,784)
When you perform arrays subtraction, like arr_1 - arr_2, then actually Numpy attempts:
This scheme works as long as both arrays have the same number of rows and columns.
There are 3 exceptions to this rule:
Read about broadcasting in Numpy to have more detailed view on this.
In your case no of the above sitiations takes place. Both arrays have the same number of columns, but the number of rows is different. The consequence is that the above broadcasting can not be performed and the whole operation fails.
Possible solution
Maybe each row in the first array (with smaller number of rows) can be "paired up" with a row in the second array, e.g. based on some key field. Such operation can be performed in Pandas. See for join method in Pandas.
Then you can:
Then you can: