Dear all,
I'm working on my master's thesis and am looking at the effects of culture (individualism vs. collectivism) on vaccination acceptance across European countries. I am using panel data and a random effects model and data ranges from februari 2020 - februari 2022. This is due to the fact that I use both static (cultural) and dynamic (COVID_19 indicators) variables in my research. My dependent variable is "fully vaccinated people per hundred". When looking at its histogram and quantile, it seems to be heavy-tailed.. I am wondering what I can do to make sure my results make sense. Can I transform the dependent variable in any way? Or do you recommend something else?
See my histogram and quantile below:

My random effects model can be seen below:
Individualism: Every country is rated between 0-100 (Hofstede index) (static)
Excess mortality: Logged, weekly data, calculated as: excess deaths = reported deaths - expected deaths
GDP per capita: Logged (static)
Population: Logged (static)
Government trust: Percentage (static)
Median age: Absolute number (static)
DO-FILE:
clear
// ssc inst asdoc
import excel "/Users/Administrator/Documents/MASTERTHESISDATA.xlsx", sheet("TRY-OUT") firstrow
// set time
gen sdate = date(date, "YMD")
format sdate %td
rename location country
encode country, gen(scountry)
xtset scountry sdate
// change variables
gen logpopulation = ln(population)
gen loggdppercapita = ln(gdp_per_capita)
rename total_vaccinations_per_hundred vaccinationsperhundred
gen logexcessmortality = ln(excess_mortality)
// xtline
xtline vaccinationsperhundred
xtline vaccinationsperhundred, overlay
// reg
xtreg people_fully_vaccinated_per_hund Individualism logexcessmortality loggdppercapita logpopulation Governmenttrust median_age, re
Please mind: I am still finding my way when it comes to this analysis. Other suggestions regarding other variables / research methods are welcome as well.
I'm working on my master's thesis and am looking at the effects of culture (individualism vs. collectivism) on vaccination acceptance across European countries. I am using panel data and a random effects model and data ranges from februari 2020 - februari 2022. This is due to the fact that I use both static (cultural) and dynamic (COVID_19 indicators) variables in my research. My dependent variable is "fully vaccinated people per hundred". When looking at its histogram and quantile, it seems to be heavy-tailed.. I am wondering what I can do to make sure my results make sense. Can I transform the dependent variable in any way? Or do you recommend something else?
See my histogram and quantile below:
My random effects model can be seen below:
Individualism: Every country is rated between 0-100 (Hofstede index) (static)
Excess mortality: Logged, weekly data, calculated as: excess deaths = reported deaths - expected deaths
GDP per capita: Logged (static)
Population: Logged (static)
Government trust: Percentage (static)
Median age: Absolute number (static)
DO-FILE:
clear
// ssc inst asdoc
import excel "/Users/Administrator/Documents/MASTERTHESISDATA.xlsx", sheet("TRY-OUT") firstrow
// set time
gen sdate = date(date, "YMD")
format sdate %td
rename location country
encode country, gen(scountry)
xtset scountry sdate
// change variables
gen logpopulation = ln(population)
gen loggdppercapita = ln(gdp_per_capita)
rename total_vaccinations_per_hundred vaccinationsperhundred
gen logexcessmortality = ln(excess_mortality)
// xtline
xtline vaccinationsperhundred
xtline vaccinationsperhundred, overlay
// reg
xtreg people_fully_vaccinated_per_hund Individualism logexcessmortality loggdppercapita logpopulation Governmenttrust median_age, re
Please mind: I am still finding my way when it comes to this analysis. Other suggestions regarding other variables / research methods are welcome as well.

Comment