I am reading a large csv file. In order to set the datatypes properly when reading, I do a sample read of just 5 rows and then get the dtypes pandas has inferred. Then, I want to hand-edit this to properly configure the datatypes and then call the read_csv to read the full file.
However, when I do df1.dtypes.to_dict()
pandas then produces this
{'Invoice Date': dtype('O'),
'Invoice ID': dtype('O'),
'Item ID': dtype('float64'),
'Line Amount': dtype('float64'),
'Line Amount Tax': dtype('float64')
}
I don't want that dtype as I get error when I paste it to assign it to a temp dict that I edit. Hence, I copy and paste this output to VSCode, use a regex to extract the following output:
{'Invoice Date': O,
'Invoice ID': O,
'Item ID': float64,
'Line Amount': float64,
'Line Amount Tax': float64
}
Is there any way to get this directly in pandas?
You can convert values to strings and for
object
useSeries.replace
: