I supposed that
data[data.agefm.isnull()]
and
data[data.agefm == numpy.nan]
are equivalent. But no, the first truly returns rows where agefm
is NaN, but the second returns an empty DataFrame. I thank that omitted values are always equal to np.nan
, but it seems wrong.
agefm
column has float64 dtype:
(Pdb) data.agefm.describe()
count 2079.000000
mean 20.686388
std 5.002383
min 10.000000
25% 17.000000
50% 20.000000
75% 23.000000
max 46.000000
Name: agefm, dtype: float64
What does data[data.agefm == np.nan]
mean exactly?
np.nan
is not comparable tonp.nan
... directly.While
Could also do
examples
Filters nothing because nothing is equal to
np.nan
Filters out the null
Use odd comparison behavior to get what we want anyway. If
np.nan != np.nan
isTrue
thenJust
dropna