img = Image.open(path)
data_origin = np.asarray( img, dtype='uint8' )
data = np.array(data_origin) # 300 x 300 x 4
1)for i in range(data.shape[0]): # invert colors
for j in range(data.shape[1]):
for k in range(3): # only first three dims r g b
data[i,j,k] = abs(data[i,j,k] - 255)
2)data[:,:,:3] = np.abs(data[:,:,:3] - 255) # why does it work worse?
''''
applying max pooling
...
end
When I transform the array into an image and look at it, the image, that have been inverted by first method, has better quality then inverted by second one. And I cannot understand why. Can you help me?
One bug to look out for performing both methods on the same
data
object. However, I've tried your methods on my own images, and the results were incorrect.You can fix the behavior in method two by rephrasing the calculation to
255 - data[:,:,:3]
. This solution assumes that your images have a max values of (255, 255, 255) for (R,G,B).You can verify this method by running the code
If you want to understand why the code is behaving this way, it's because your code
data_origin
array is datatypenp.uint8
.uint8
is an unsigned integer with 8 bits meaning it can only be values from [0,255]. Subtracting a uint8 by 255 doesn't result in a negative number, but an overflow of x - 255.For example,
Even though
np.abs(data[:,:,:3] - 255)
should behave like255 - data[:,:,:3]
(becausef(x) = abs(x-255)
equalsf(x) = 255 -x
for domain in range [0, 255]), the datatype makes this conversion incorrect.Another fix to this code would be to replace
dtype='uint8'
todtype='int32'
(because int32 allows for negative values). However, I don't recommend this solution because int32 is much larger than uint8.