The Question I'm solving for: How does the proportion of immigrants in each US state impact the national market share of companies in an industry based in that state from 2002-2015? In other words, do companies that are located in states with a higher proportion of immigrants have a larger market share of their industry?
Description of my data: Panel Data at the state level
Column 1: ID– a company's unique ID code (integer)
Column 2: fyear– fiscal year (so a company's unique ID code is the same across years, years 2002-2015 for each company) (integer)
Column 3: state– state the company is located in (integer)
Column 4: sic– a two digit code that represents company's industry (integer)
Column 5: propImmigrants– that state population's proportion of immigrants in that year (float)
Column 6: marketShare– a company's market share in its industry in a specific year (float)
My Current Regression:
Right now, I first create fixed effects variables for each fyear, state and sic. Then, I run:
(this regresses a state's proportion of immigrants on the market share of a company, with the _I* representing all the fixed effects for fyear, state and sic)
MY QUESTION:
In my current regression, I'm just seeing the general impact of the proportion of immigrants on a firm's market share. However, I want to see if the proportion of immigrants are disproportionally impacting large firms or small firms. So, is there a way I can run the regression 3 times separately? Once for "small" firms, once for "medium" firms and once for "large" firms? I'm thinking the way I would define small/medium/large is how big the companys were in the initial year 2002. So, something like "only if a firm's market share is below the 33rd percentile for that industry in 2002, then run the above regression for all those firms from 2002-2015 (which I would assume you'd need to match the unique ID code somehow)". I would want to do the same for firms "with a market share between the 34th and 66th percentile in 2002" and firms "with a market share above the 66th percentile in 2002".
Thanks and let me know if something isn't clear enough.
Description of my data: Panel Data at the state level
Column 1: ID– a company's unique ID code (integer)
Column 2: fyear– fiscal year (so a company's unique ID code is the same across years, years 2002-2015 for each company) (integer)
Column 3: state– state the company is located in (integer)
Column 4: sic– a two digit code that represents company's industry (integer)
Column 5: propImmigrants– that state population's proportion of immigrants in that year (float)
Column 6: marketShare– a company's market share in its industry in a specific year (float)
My Current Regression:
Right now, I first create fixed effects variables for each fyear, state and sic. Then, I run:
Code:
reg marketShare propImmigrants _I*
MY QUESTION:
In my current regression, I'm just seeing the general impact of the proportion of immigrants on a firm's market share. However, I want to see if the proportion of immigrants are disproportionally impacting large firms or small firms. So, is there a way I can run the regression 3 times separately? Once for "small" firms, once for "medium" firms and once for "large" firms? I'm thinking the way I would define small/medium/large is how big the companys were in the initial year 2002. So, something like "only if a firm's market share is below the 33rd percentile for that industry in 2002, then run the above regression for all those firms from 2002-2015 (which I would assume you'd need to match the unique ID code somehow)". I would want to do the same for firms "with a market share between the 34th and 66th percentile in 2002" and firms "with a market share above the 66th percentile in 2002".
Thanks and let me know if something isn't clear enough.
Comment