Hi all,
I am deciding what regression model to use for my project. I am working with a panel dataset that contains deaths by suicide for 41 counties over 5 decades, as well as the occupational structure of those counties and the percentage of population that is urban in the given years. I am aiming to observe the effect of different occupations on the suicide rate as well as the level of urban population. However, my dependent variable is expressed as a percentage, so either percentage of all deaths that are from suicide, or percentage of the total population that committed suicide in a given year. From my own research, it looks like I should use a Logit regression model because my dependent variable is bounded but I am confused by this because the outcome is not either 0 or 1. If using the percentage of all deaths by suicide as the dependent variable, the maximum value of my dependent variable is 3.571429 and the minimum is 0.
If this is incorrect, could I do a linear regression model where my dependent variable is the total amount of suicides in a county, but control for the total amount of deaths or total population as I have data on both?
Any advice here? Thanks in advance!
I am deciding what regression model to use for my project. I am working with a panel dataset that contains deaths by suicide for 41 counties over 5 decades, as well as the occupational structure of those counties and the percentage of population that is urban in the given years. I am aiming to observe the effect of different occupations on the suicide rate as well as the level of urban population. However, my dependent variable is expressed as a percentage, so either percentage of all deaths that are from suicide, or percentage of the total population that committed suicide in a given year. From my own research, it looks like I should use a Logit regression model because my dependent variable is bounded but I am confused by this because the outcome is not either 0 or 1. If using the percentage of all deaths by suicide as the dependent variable, the maximum value of my dependent variable is 3.571429 and the minimum is 0.
If this is incorrect, could I do a linear regression model where my dependent variable is the total amount of suicides in a county, but control for the total amount of deaths or total population as I have data on both?
Any advice here? Thanks in advance!
Comment