I'm doing an econometrics project with stata where I'm looking at the impact of individual's BMI (body mass index and Height (called height 5 as I divided the height by 5 so I could interpret by saying "being 5 inches taller increases your probability of being in earnings category Y by X% points" on their earnings. The data set didn't have a continuous earnings variable, only categorical so for example, category 3 is >$30000 and <$40000. I have found that height increases the likelihood of someone being in the highest income category, however I want to know if this relationship is stronger for men, through using an interaction variable, however I'm not sure how to add in the interaction variable.
Currently my oprobit looks like this
oprobit earncat height5 BMI age i.sex i.mrd i.cworker i.race
I'm unsure what to add to the end to create an interaction, and furthermore, how to interpret it.
Ideally I would be able to say something like "being taller as a man is more advantageous than being a woman in terms of likelihood to land in the higher earnings category" or something of the sort.
Once I can do this, I can add a lot of interactions such as looking at if the relationship between height and earnings varies across different occupations.
It's been a while since I studied ordered and multinomial probits so I may be missing something, but I think that programmatically at least this is easily handled by Stata's factor variable notation (see
help factor variables
).In the example below, I create an interaction between the categorical variable
foreign
(analogous to yoursex
) and the continuous variablempg
(analogous to yourheight5
).Do you then need to use the
margins
command to get interpretable marginal effects? Seehelp margins
or these slides from Richard Williams. As I say it's been a while since I looked at these models so I don't know if this is necessary or correct in your context but I'd think that one way to get these marginal effects, if you need them, would be to usemargins, dydx(mpg) at(foreign=(0 1))
after estimating the probit, so that you can get the marginal effect for each category separately by domestic and foreign. Another way to do this, which gives different results, ismargins, dydx(mpg) over(foreign)
. As to which method is better suited to your context, you can find some discussion here.