Counting all words in a column of a dataset pandas

279 Views Asked by Adekunle Agboke At 17 August 2025 at 20:42

I am carrying out EDA on a dataset and want to count the total number of words in a column, before and after deleting duplicates.

Here is my code:

print(train_dataset['text'].apply(lambda x: len(x.split(' '))).sum())

It is throwing this error:

AttributeError: 'float' object has no attribute 'split'

Original Q&A

There are 1 best solutions below

gremur On 05 March 2022 at 19:53

You could try to convert column values to string type before split:

train_dataset['text'] = train_dataset['text'].astype(str)
train_dataset['text'].apply(lambda x: len(x.split())).sum()
# or
train_dataset['text'].apply(lambda x: len(str(x).split())).sum()

Counting all words in a column of a dataset pandas

There are 1 best solutions below

Related Questions in PYTHON

Related Questions in PANDAS

Related Questions in DATASET

Related Questions in COLUMNSORTING

Related Questions in EDA

Trending Questions

Popular # Hahtags

Popular Questions