Hi everybody,
I'm currently calculating a model where I have three time points at which the same event can happen (panel data): T1 T2 T3
At each time point the observations/persons can go from 0 => 1 (latent change model)
So we have a pattern like:
T1 T2 T3
1 0 0
0 1 0
0 0 1
1 1 0
0 1 1
0 0 0
The cases that interest me and which I need for the rest of my analyses are the ones where the persons of observation go from 0 => 1.
I also created a variable that distinguishes whether the person has had a "1" in the previous years, so that the 0 => 1 is unique (if there was a 1 before, the cases are set to .)
What I did was lay the time frame between T2 and T3 over the time frame between T1 and T2 because this is for subsequent years and basically nothing changes (same events can happen, only political climate might be a control factor which for this analysis is irrelevant) - the whole purpose of this analysis was because data for T1 + T2 was insufficient a few years back and now we wanted to check whether adding the new phase would yield more stable results.
So I appended the data so that I only have two time points now but with data from both phases (T1->T2 & T2->T3)
The only problem I have now is duplicates. Maybe it's a logical problem I'm having, but I obviously got duplicate cases because the person numbers from the panel data are doubled since I have two time frames with the same persons in it (only additional observations/cases can happen for the 2nd phase). Since the data is made up in a way that to me only the 0 => 1 case is interesting (I want to observe change) I thought it wouldn't be a problem, because it can only happen once, but I need to identify the unique new cases where 0 => 1 happened in the 2nd time frame and definitely not drop them, while finding a way / creating a variable that tells me which cases are duplicates / drop the cases where I have double person numbers and basically compare the same persons from two different phases against each other while pretending they're in the same.
I thought about bringing in a variable that captures the years and maybe there's a way to easily drop those cases that would be double and harmful to the analysis and keep the ones which have happened uniquely in both T1+T2 & T2+T3 so that I can cleanly compare them...
Should anybody know how to deal with this and know a solution to the problem (if it is really problematic), I'd be very glad.
Kind regards
Kornelius
I'm currently calculating a model where I have three time points at which the same event can happen (panel data): T1 T2 T3
At each time point the observations/persons can go from 0 => 1 (latent change model)
So we have a pattern like:
T1 T2 T3
1 0 0
0 1 0
0 0 1
1 1 0
0 1 1
0 0 0
The cases that interest me and which I need for the rest of my analyses are the ones where the persons of observation go from 0 => 1.
I also created a variable that distinguishes whether the person has had a "1" in the previous years, so that the 0 => 1 is unique (if there was a 1 before, the cases are set to .)
What I did was lay the time frame between T2 and T3 over the time frame between T1 and T2 because this is for subsequent years and basically nothing changes (same events can happen, only political climate might be a control factor which for this analysis is irrelevant) - the whole purpose of this analysis was because data for T1 + T2 was insufficient a few years back and now we wanted to check whether adding the new phase would yield more stable results.
So I appended the data so that I only have two time points now but with data from both phases (T1->T2 & T2->T3)
The only problem I have now is duplicates. Maybe it's a logical problem I'm having, but I obviously got duplicate cases because the person numbers from the panel data are doubled since I have two time frames with the same persons in it (only additional observations/cases can happen for the 2nd phase). Since the data is made up in a way that to me only the 0 => 1 case is interesting (I want to observe change) I thought it wouldn't be a problem, because it can only happen once, but I need to identify the unique new cases where 0 => 1 happened in the 2nd time frame and definitely not drop them, while finding a way / creating a variable that tells me which cases are duplicates / drop the cases where I have double person numbers and basically compare the same persons from two different phases against each other while pretending they're in the same.
I thought about bringing in a variable that captures the years and maybe there's a way to easily drop those cases that would be double and harmful to the analysis and keep the ones which have happened uniquely in both T1+T2 & T2+T3 so that I can cleanly compare them...
Should anybody know how to deal with this and know a solution to the problem (if it is really problematic), I'd be very glad.
Kind regards
Kornelius
Comment