In my data, there is a string variable "type" that can take on values "A", "B", "C", or "D". I have encoded the "type" variable to be numeric (encode type, gen(nytpe)). I have also created dummy variables "typeA", "typeB", "typeC", and "typeD" that take on values of 0 or 1 based on the value of "type".
The regression output of the following three models is the same:
Why are the average marginal effects different? What is the correct way to treat this variable if I am interested in average marginal effects?
Additionally, later in my analysis, I will need to include complete interaction effects between "type" and a continuous variable in my dataset ("employment"), and I will need to view these marginal effects, as well.
Thanks in advance!
The regression output of the following three models is the same:
- probit depvar i.ntype covariate1 i.covariate2 i.covariate3, cl(cluster_var)
- probit depvar typeB typeC typeD covariate1 i.covariate2 i.covariate3, cl(cluster_var)
- probit depvar i.typeB i.typeC i.typeD covariate1 i.covariate2 i.covariate3, cl(cluster_var)
Why are the average marginal effects different? What is the correct way to treat this variable if I am interested in average marginal effects?
Additionally, later in my analysis, I will need to include complete interaction effects between "type" and a continuous variable in my dataset ("employment"), and I will need to view these marginal effects, as well.
Thanks in advance!
Comment