Dear Statalist,
I'm working with a panel data set consisting of 1.134.384 individuals tracked over different periods of time between 1970 and 2002 among different nations (ex. France, Germany Belgium and so on). I have a variable (6 values) about i's church attendence. I generated a variable which for each year and for each nations, records the percentage of individuals that declare to attend church functions several times in a week (value 1 of variable) once a week (value 2 of variable) and so on. So, considering two year, one nation (ex. France) my data set looks like that:
year nation2 churchat pc_churchat
1970 france SEV TIME 3.763441
1970 france SEV TIME 3.763441
1970 france SEV TIME 3.763441
1970 france SEV TIME 3.763441
1970 france SEV TIME 3.763441
.
.
.
1970 france ONCE A W 19.25709
1970 france ONCE A W 19.25709
1970 france ONCE A W 19.25709
1970 france ONCE A W 19.25709
1970 france ONCE A W 19.25709
.
.
.
1971 france SEV TIME 1.24105
1971 france SEV TIME 1.24105
1971 france SEV TIME 1.24105
1971 france SEV TIME 1.24105
1971 france SEV TIME 1.24105
.
.
.
1971 france ONCE A W 18.47255
1971 france ONCE A W 18.47255
1971 france ONCE A W 18.47255
1971 france ONCE A W 18.47255
1971 france ONCE A W 18.47255
I have two problems:
1) I want to creare a table that for each year (row imput) and for each nation (column imput) represents the cumulative probability of variable churchat for the only first two values (several time and once a w.). In the example,
year France
1970 3.763441+19.25709= 23.02
1971 1.24105 +18.47255= 19.71
.
.
.
2) For each nation, I want to graph the cumulative probability function of the only first two values of variable (several time and once a w.) over year. In the other terms, I want to graph the column of the previous table.
Thank for the attention.
I'm working with a panel data set consisting of 1.134.384 individuals tracked over different periods of time between 1970 and 2002 among different nations (ex. France, Germany Belgium and so on). I have a variable (6 values) about i's church attendence. I generated a variable which for each year and for each nations, records the percentage of individuals that declare to attend church functions several times in a week (value 1 of variable) once a week (value 2 of variable) and so on. So, considering two year, one nation (ex. France) my data set looks like that:
year nation2 churchat pc_churchat
1970 france SEV TIME 3.763441
1970 france SEV TIME 3.763441
1970 france SEV TIME 3.763441
1970 france SEV TIME 3.763441
1970 france SEV TIME 3.763441
.
.
.
1970 france ONCE A W 19.25709
1970 france ONCE A W 19.25709
1970 france ONCE A W 19.25709
1970 france ONCE A W 19.25709
1970 france ONCE A W 19.25709
.
.
.
1971 france SEV TIME 1.24105
1971 france SEV TIME 1.24105
1971 france SEV TIME 1.24105
1971 france SEV TIME 1.24105
1971 france SEV TIME 1.24105
.
.
.
1971 france ONCE A W 18.47255
1971 france ONCE A W 18.47255
1971 france ONCE A W 18.47255
1971 france ONCE A W 18.47255
1971 france ONCE A W 18.47255
I have two problems:
1) I want to creare a table that for each year (row imput) and for each nation (column imput) represents the cumulative probability of variable churchat for the only first two values (several time and once a w.). In the example,
year France
1970 3.763441+19.25709= 23.02
1971 1.24105 +18.47255= 19.71
.
.
.
2) For each nation, I want to graph the cumulative probability function of the only first two values of variable (several time and once a w.) over year. In the other terms, I want to graph the column of the previous table.
Thank for the attention.
Comment