I came here due a question that surged while I'm following the tutorial's methodology https://docs.rapids.ai/api/cudf/nightly/user_guide/10min.html.
I have a dataframe imported as csv with the following structure:
x_tick.head()
- LocalTime Ask Bid Spread
- 0 2004-10-25 00:01:01.975 86.837 86.877 0.04
- 1 2004-10-25 00:01:19.300 86.791 86.891 0.10
- 2 2004-10-25 00:01:30.759 86.812 86.842 0.03
- 3 2004-10-25 00:01:41.798 86.801 86.831 0.03
- 4 2004-10-25 00:01:42.213 86.794 86.824 0.03
x_tick.dtypes
- LocalTime datetime64[ns]
- Ask float64
- Bid float64
- Spread float64
- dtype: object
My goal would be to perform statistical tests and graph analysis, but by following the Rapids Tutorial :
x_tick.Spread.mean().compute()
TypeError: 'sub' operator not supported between <class 'cudf.core.column.numerical.NumericalColumn'> and <class 'cudf.core.column.string.StringColumn'>
What's happening?
Thank you