Scikit-image read satellite image from tif and drop all rows/columns with a nodata value

757 Views Asked by At

I'm attempting to run a thresholding algorithm on a satellite-derived raster stored as a tif. The no data values (-3.40282306e+38) that occur on the borders of the scene are being considered in the algorithm and is causing undesired behaviour. I'm looking for a way to remove all rows/columns with a no data value prior to running the thresholding algorithm.

I've attempted to set all cells with the no data value to np.nan then dropping them but I'm not getting the results I'm looking for.

My code:

import numpy as np
from skimage import filters
from skimage import exposure
from skimage.io import imread, show
from skimage.filters import try_all_threshold

no_data_value = -3.40282306e+38

ndwi = imread(<'my.tif'>)
ndwi[ndwi == no_data_value] = np.nan
ndwi = ndwi[~np.isnan(ndwi).any(axis=1)]
val = filters.threshold_otsu(ndwi)
3

There are 3 best solutions below

0
On BEST ANSWER

filters.threshold_otsu does not care about the shape of the array passed in, so you can also do:

threshold_otsu(z[z > -1])
0
On

the values of my raster range from -1 to 1 so changing

ndwi[ndwi == no_data_value] = np.nan

to

ndwi[ndwi <= -1] = np.nan

gives me my desired results. Seems a bit hack but works.

1
On

Two things,

It is possible that your values are not exactly -3.40282306e+38. If nothing else is that large of a negative value I would maybe change the filter to be something like -3e+38.

Also I think you are missing your array at the beginning of the "is not null" filter. This line right here "ndwi = nwdi[~np.isnan(ndwi)]"

no_data_value = -3.0e+38
ndwi = imread(<my.tif>)
ndwi[ndwi <= no_data_value] = np.nan
ndwi = nwdi[~np.isnan(ndwi)]
val = filters.threshold_otsu(ndwi)