Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Levels Problem in Regression

    I hope you are doing well.

    I am facing a problem with levels in model specification. I have an independent variable at the restaurant level and an outcome variable at the health district level. There are many restaurants at the health district level. Is there any statistical approach/regression model that corrects for such levels issue? I would appreciate your help.

    Moeen

  • #2
    are restaurants nested within health district levels (i.e., does each restaurant appear in one and only one health district)?

    Comment


    • #3
      Moeen:
      we need to know a bit more about your dataset.
      I'm not clear if your data can be analysed with OLS invoking a clustered standard errors at health district level or if you can take the whole stuff a notch up with -mixed- (if you have a time variable).
      Kind regards,
      Carlo
      (Stata 19.0)

      Comment


      • #4
        Originally posted by Rich Goldstein View Post
        are restaurants nested within health district levels (i.e., does each restaurant appear in one and only one health district)?
        Yes, restaurants are nested within health districts.

        Comment


        • #5
          Originally posted by Carlo Lazzaro View Post
          Moeen:
          we need to know a bit more about your dataset.
          I'm not clear if your data can be analyzed with OLS invoking a clustered standard errors at health district level or if you can take the whole stuff a notch up with -mixed- (if you have a time variable).
          My dataset comprises restaurants level inspections. The restaurants are located in health districts. There are total 20,000 restaurants located in 26 health districts. My independent variable is restaurant inspections (i.e., number of times a restaurant is inspected per year). My outcome variable is Foodborne illnesses, which is at the health district level. Therefore there is a level issue between my independent and outcome variables.

          Comment


          • #6
            Moeen:
            do your data refer to one wave of data (that is, the 20K restaurants are inspected only once during a given year; cross-sectional data) or to different ones (that is the same restaurants are inspected in different occasions, equally spaced, more or less, across time (panel data)?
            Kind regards,
            Carlo
            (Stata 19.0)

            Comment


            • #7
              Hi Carlo, I have four years of data (panel data). Restaurants can be inspected multiple times per year. Some restaurants are inspected once per year, whereas some are inspected three or more times per year.

              Comment


              • #8
                Moeen:
                assuming that your regressand is categorical (Foodborne illnesses yes/no), you may want to consider -xtlogit- with (conditional) fixed effect.
                As Stata is likely to complain about -repeated time values- you can -xtset- your data with -panelid- only (restaurant, I assume), being aware that this fix comes at the cost of making time-series operators (such as lags and leads) unavailable.
                Kind regards,
                Carlo
                (Stata 19.0)

                Comment

                Working...
                X