Hi everyone
I'm working with the attached database: https://drive.google.com/open?id=1j0...6beozV2ShTmAvB
I'm analysing the factors that affects an vehicle accident in different regions.
The first 52 variables are regions (or commune in this county): =1 if the accident occurred in this region (and of course the others 51 are 0)
then the next 11 are the factors of the accident:
alcoh =1 if accident is caused by drunk driver
impr =1 if accident is caused by imprudent driver
rdef =1 if accident is caused by road deficiencies
drugs =1 if accident is caused by drugged driver
The 4 factors of above are the main factors where: if the accident was caused by alcoh (1), then impr, rdef and drugs are 0
The next ones are independents between each others
uzone =1 if accident occurred in urban zone
sexm =1 if male
age age in driver
popul population per commune
numlic number of driver licences per commune
invexpn investment expenditures per commune
dlight =1 if accident occurred in daylight
My intentions are:
a) Create a table in STATA where I can analyse or show the 52 regions with the 11 factors. How can I do this?
I tried with one region (santiago) but it says this:
reg santiago alcoh impr rdef drugs uzone sexm age popul numlic invexpn dlight
note: rdef omitted because of collinearity
b) should I delete the variable rdef and the observations that have 1 on rdef?
thanks
I'm working with the attached database: https://drive.google.com/open?id=1j0...6beozV2ShTmAvB
I'm analysing the factors that affects an vehicle accident in different regions.
The first 52 variables are regions (or commune in this county): =1 if the accident occurred in this region (and of course the others 51 are 0)
then the next 11 are the factors of the accident:
alcoh =1 if accident is caused by drunk driver
impr =1 if accident is caused by imprudent driver
rdef =1 if accident is caused by road deficiencies
drugs =1 if accident is caused by drugged driver
The 4 factors of above are the main factors where: if the accident was caused by alcoh (1), then impr, rdef and drugs are 0
The next ones are independents between each others
uzone =1 if accident occurred in urban zone
sexm =1 if male
age age in driver
popul population per commune
numlic number of driver licences per commune
invexpn investment expenditures per commune
dlight =1 if accident occurred in daylight
My intentions are:
a) Create a table in STATA where I can analyse or show the 52 regions with the 11 factors. How can I do this?
I tried with one region (santiago) but it says this:
reg santiago alcoh impr rdef drugs uzone sexm age popul numlic invexpn dlight
note: rdef omitted because of collinearity
b) should I delete the variable rdef and the observations that have 1 on rdef?
thanks
Comment