I read multiple available questions about this topic, but still do not understand my problem.
I am trying to build a regression, using PyCaret
:
from pycaret.regression import *
fooPy = setup(data = foo, target = 'pts', session_id = 123)
I receive error:
TypeError: unsupported operand type(s) for +: 'int' and 'str'
Not sure where is the problem, because I do not see any strings in my structure:
pts_500 float64
pts_500_p float64
OBP_avg float64
SLG_avg float64
SB_avg float64
RBI_avg float64
R_avg float64
home int64
first_time_pitcher int32
park_ratio_OBP float64
park_ratio_SLG float64
order float64
SO_avg_p float64
pts_500_parkadj_p float64
pts_500_parkadj float64
SLG_avg_parkadj float64
OPS_avg_parkadj float64
SLG_avg_parkadj_p float64
OPS_avg_parkadj_p float64
pts_BxP float64
SLG_BxP float64
OPS_BxP float64
whip_SO_BxP float64
whip_SO_B float64
whip_SO_B_parkadj float64
order float64
ops x pts_500 order15 float64
ops x pts_500 parkadj float64
ops23 x pts_500 float64
ops x pts_500 orderadj float64
whip_p float64
whip_SO_p float64
whip_SO_parkadj_p float64
whip_parkadj_p float64
pts float64
dtype: object
home
and first_time_pitcher
are integers.
Full error looks like:
Appreciate any tips!
I found the answer myself and it was very trivial and embarassing.
Order
variable was included twice in the dataset. I checked the correlation and got 1.0 correlation between the same variables.