Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Problem with specifying count data models

    Dear Statalists,

    I am performing nonlinear regression analysis and despite using several books on microeconometrics and count data my econometric knowlegde has reached its limits.
    My topic: I want to estimate the causal effect of increased alcohol availability on suicides. I have suicide counts for all counties in one U.S state for several years. My explanatory variable for alcohol availability is the number of on-premise licenses per 1,000 population in each county and year. Further control variables for each county and year are divorce rate, real income per capita, percentage of male population, percentage of white population, percentage of population belonging to age group 15-24, percentage of population belonging to age group 35-44, population density and a categorical variable for the urbanicity level of the county. I also have dummies for county-FE, year-FE, county-specific FE (county x year) and use clustered standard errors. On-premise licenses as proxy for alcohol availability is endogenous, so I plan to use an instrument from a natural experiment which causes exogenous increases in the number of licences for my first stage. I adapted the instrument and parts of my model specification from "Wet laws, drinking Establishments and violent crime by Anderson et al. (2016). Before I include the instrumented on-premise variable, I wanted to discuss the eligible count models for the dependent suicide variable in order to justify why I chose a specific count model for my second stage. After that, I wanted to think about how to combine the IV approach with my count model of choice.
    My starting point is a regular Poisson-MLE:
    poisson suicides on_premise1_per1000 unemployment realinc_1000 divorcerate per_male per_white per_age1 per_age2 urb2013 pop_density i.year i.county_fips county_FE* [pweight = totalpop], cluster(county_fips)
    “totalpop” is a variable for the county population in each year.
    I then perform the overdispersion test from Cameron and Trivedi which indicates overdispersion. Overdispersion was also evident from descriptive statistics and histograms already.
    Therefore, I perform Poisson-quasi-MLE with robust standard errors:
    poisson suicides on_premise1_per1000 pop_density unemployment realinc_1000 divorcerate per_male per_white per_age1 per_age2 urb2013 i.year i.county_fips county_FE* [pweight = totalpop], vce(cluster county_fips)
    First question: Both, Poisson-MLE and Poisson quasi-MLE regressions result in exactly the same outputs, e.g. same coefficients, standard errors and p-values and I am asking myself how this is possible when one regression uses robust standard errors (vce) and the other not or if my commands are wrong.
    Because of overdispersion I tried to perform a NegBin-MLE and a NegBin quasi-MLE with robust errors to compare the results and model fit with Poisson quasi-MLE:
    nbreg suicides on_premise1_per1000 pop_density unemployment realinc_1000 divorcerate per_male per_white per_age1 per_age2 urb2013 i.year i.county_fips county_FE* [pweight = totalpop], cluster(county_fips)
    nbreg suicides on_premise1_per1000 pop_density unemployment realinc_1000 divorcerate per_male per_white per_age1 per_age2 urb2013 i.year i.county_fips county_FE* [pweight = totalpop], vce(cluster county_fips)
    I had to break the commands because the iterations did not stop. I already figured out that this is caused by the inclusion of the three FE dummies.
    Second question: From other threads I partially understood that the nbreg with FE dummies could suffer from incidential parameter problem and inconsistent estimation while xtnbreg does not really consider FE in the right way. One recommendation was to stick with Poisson quasi-MLE and to forgo these two negbin models. Is this right and does this justify to not perform nbreg or xtnbreg apart from that the regressions do not work anyway?
    Third question: So far I used pooled regression but does it make sense to consider panel estimators such as xtpoisson in general? I was asking myself the same question in case of zero-inflated or Hurdle models, does it make sense to apply these models in my context?

    I appreciate any suggestion or advice you might have, thank you in advance!




  • #2
    Dear Joana Bennet,

    First question - You are actually using clustered standard errors in both specifications and that is why the results are the same.
    Second question - From what you describe of what you want to do, I would just use Poisson regression with FE and suitable standard errors; it is much more robust and reliable than the NB-based estimators.
    Third question - From what you describe, it looks as if you are already using FE by including the dummies in the model; xtpoisson with fe should give you the same results. Only you can know whether using FE is the right thing to do in your case.

    Best wishes,

    Joao

    Comment


    • #3
      Thank you for your reply Joao!
      I will have to think about your suggestions now.

      Comment


      • #4
        Hello Joao,

        Your recommendations gave me the right direction, I think I could clear my confusion on count models in combination with panel structure and FE or RE. I will use the poisson command with fixed effect dummies as you recommended.
        I have a further question on my model specification and would appreciate your advice if possible. I am thinking about whether to use a model with or without county-specific time trends (additionally to the county-FE and time-FE) and whether to perform weighted regressions or not. The authors of the work I am relying to weighted all their regressions and descriptive statistics with the county population. I understand the idea behind that but also read in Cameron&Trivedi (2008) that weighting should be used when efficiency can be improved and that microeconemtric analysis often do not use weighting. How can I assess if county-specific time trends and weighting should be used? Are log-likelohood, AIC and BIC appropriate measures to decide between this model versions? I performed all analysis with and without weighting and with and without county-specific time trends and achieve different outputs for all versions, so it seams relevant to decide on this question.

        Thank you very much,
        Joana

        Comment

        Working...
        X