Hi,
I'm currently working with nationally representative, but confidential, household survey data which looks at the level of ICT access and usage for randomly selected individuals within a given population. It has 1771 unique observations and can be disaggregated across various sub-criteria.
For the purposes of my study however, I would like to restrict this dataset to only look at the urban poor (according to a national income poverty line) in order to estimate their probability of being digitally poor. Given that I am effectively truncating my data and analysing a non-randomly selected sample, I was wondering if there is there any way in which I can perform a regression analysis without producing biased estimates ?
Although I am aware of the truncreg command, I'm not sure it's appropriate to use in this case since my dependent variable is not the variable I am truncating. The dependent variable for my study is a categorical variable for digital poverty, and I am truncating the sample to only include those individuals with a monthly per capita income of less than or equal to 758.
I would ideally like to run a generalised ordered logistic (gologit2) regression, but I don't want to provide misleading results. Therefore, if there is any way in which I can control for this sample selection bias, I would be extremely grateful for any guidance on how to achieve it.
Many thanks in advance for any advice provided!
I'm currently working with nationally representative, but confidential, household survey data which looks at the level of ICT access and usage for randomly selected individuals within a given population. It has 1771 unique observations and can be disaggregated across various sub-criteria.
For the purposes of my study however, I would like to restrict this dataset to only look at the urban poor (according to a national income poverty line) in order to estimate their probability of being digitally poor. Given that I am effectively truncating my data and analysing a non-randomly selected sample, I was wondering if there is there any way in which I can perform a regression analysis without producing biased estimates ?
Although I am aware of the truncreg command, I'm not sure it's appropriate to use in this case since my dependent variable is not the variable I am truncating. The dependent variable for my study is a categorical variable for digital poverty, and I am truncating the sample to only include those individuals with a monthly per capita income of less than or equal to 758.
I would ideally like to run a generalised ordered logistic (gologit2) regression, but I don't want to provide misleading results. Therefore, if there is any way in which I can control for this sample selection bias, I would be extremely grateful for any guidance on how to achieve it.
Many thanks in advance for any advice provided!
Comment