hello everyone,
I am unsure which type of regression to use. I have a dataset sorted by firm and a dependent variable, that is called "engagement" - the variable has multiple possible categories: no engagement; customer, supplier or partner or a combination of any two or three of the outcomes. Here are two examples:
Firm: BOSTON SCIENTIFIC CORP
Engagement: customers; suppliers
Firm: GOODYEAR TIRE & RUBBER CO
Engagement: partners
The initial idea was an mlogit (the engagements are not naturally ordered) where "no engagement" was the base outcome, where I had 7 categories (as there are 7 different outcome combinations of the Enagement variable), but as some categories overlapped, the IIA assumption was violated.
I have now separated the outcomes in the following way:
gen customers = strpos(Engagement, "customers") > 0
gen partners = strpos(Engagement, "partner") > 0
gen suppliers = strpos(Engagement, "suppliers") > 0
gen number_partners = customers + partners + suppliers
I thought of using a nested model or a mixed logit model, but I am not sure how to actually integrate the engagement variable as my dependent variable, as I would like to understand in the end which category and which amount of partners, and maybe also which combination of partners has the highest rrr for my independent variable (which is binary).
I appreciate any advice or code, thanks a lot
Kim
I am unsure which type of regression to use. I have a dataset sorted by firm and a dependent variable, that is called "engagement" - the variable has multiple possible categories: no engagement; customer, supplier or partner or a combination of any two or three of the outcomes. Here are two examples:
Firm: BOSTON SCIENTIFIC CORP
Engagement: customers; suppliers
Firm: GOODYEAR TIRE & RUBBER CO
Engagement: partners
The initial idea was an mlogit (the engagements are not naturally ordered) where "no engagement" was the base outcome, where I had 7 categories (as there are 7 different outcome combinations of the Enagement variable), but as some categories overlapped, the IIA assumption was violated.
I have now separated the outcomes in the following way:
gen customers = strpos(Engagement, "customers") > 0
gen partners = strpos(Engagement, "partner") > 0
gen suppliers = strpos(Engagement, "suppliers") > 0
gen number_partners = customers + partners + suppliers
I thought of using a nested model or a mixed logit model, but I am not sure how to actually integrate the engagement variable as my dependent variable, as I would like to understand in the end which category and which amount of partners, and maybe also which combination of partners has the highest rrr for my independent variable (which is binary).
I appreciate any advice or code, thanks a lot

Kim
Comment