Hi everyone,
I’m conducting an analysis on the impact that receiving a Michelin star has on the opening of new business ventures associated with a restaurant. The database covers the full history of each restaurant, from its opening year (which varies by restaurant) to the present. It includes the number of associated businesses in each year, the year (if any) in which the restaurant received a star, its location, and the type of ownership. There’s also a 1:1 control group consisting of restaurants that were never included in the guide but are otherwise similar to the treated ones.
At the moment, I’m considering a Difference-in-Differences (DiD) approach. I started with a classical 2x2 DiD, using a window of two years before (to account for potential anticipation effects) and five years after the treatment for each restaurant. However, this approach is overly simplistic since the year of treatment (i.e., when the star is awarded) varies across restaurants, which introduces well-known identification issues. I'm therefore considering the Callaway and Sant’Anna ATT estimator, which allows for an event-study-style analysis and better handles the staggered nature of the treatment.
My main concerns revolve around
• the staggered timing
• the unbalanced nature of the panel some restaurants have data covering the full observation period (e.g., one opened in 1960 with treatment in 2009 has data from 2000 to 2024), while others like one opened in 2007 lack earlier years. I can't simply fill in missing pre-opening years with zeros for diversification, as that would bias the analysis.)
• the dependent variable: the number of business ventures is cumulative, meaning it either increases or remains constant. One possible solution is to use the year-over-year difference, but the numbers are very small, and I’m worried about losing meaningful signals.
Any suggestions or references to similar work would be very welcome.
I’m conducting an analysis on the impact that receiving a Michelin star has on the opening of new business ventures associated with a restaurant. The database covers the full history of each restaurant, from its opening year (which varies by restaurant) to the present. It includes the number of associated businesses in each year, the year (if any) in which the restaurant received a star, its location, and the type of ownership. There’s also a 1:1 control group consisting of restaurants that were never included in the guide but are otherwise similar to the treated ones.
At the moment, I’m considering a Difference-in-Differences (DiD) approach. I started with a classical 2x2 DiD, using a window of two years before (to account for potential anticipation effects) and five years after the treatment for each restaurant. However, this approach is overly simplistic since the year of treatment (i.e., when the star is awarded) varies across restaurants, which introduces well-known identification issues. I'm therefore considering the Callaway and Sant’Anna ATT estimator, which allows for an event-study-style analysis and better handles the staggered nature of the treatment.
My main concerns revolve around
• the staggered timing
• the unbalanced nature of the panel some restaurants have data covering the full observation period (e.g., one opened in 1960 with treatment in 2009 has data from 2000 to 2024), while others like one opened in 2007 lack earlier years. I can't simply fill in missing pre-opening years with zeros for diversification, as that would bias the analysis.)
• the dependent variable: the number of business ventures is cumulative, meaning it either increases or remains constant. One possible solution is to use the year-over-year difference, but the numbers are very small, and I’m worried about losing meaningful signals.
Any suggestions or references to similar work would be very welcome.
Comment