Hi,
I am trying to fill in a variable (state) over IDs IF the state is consistent for the year before and year after. My data for example looks like this:
ID_child year state
5032 1983 Tennessee
5032 1984 Tennessee
5032 1985 Tennessee
5032 1986 Tennessee
5032 1987 Tennessee
5032 1988 Tennessee
5032 1989 Tennessee
5032 1990 Tennessee
5032 1991 Tennessee
5032 1992 Tennessee
5032 1993 Tennessee
5032 1994 Tennessee
5032 1995 Tennessee
5032 1996 Tennessee
5032 1997 Tennessee
5032 1998 .
5032 1999 Tennessee
5032 2000 .
5032 2001 Tennessee
5032 2002 .
5032 2003 Tennessee
5032 2004 .
5032 2005 Tennessee
5032 2008 .
5032 2009 Illinois
5032 2010
I want it to look like this:
ID_child year state
5032 1983 Tennessee
5032 1984 Tennessee
5032 1985 Tennessee
5032 1986 Tennessee
5032 1987 Tennessee
5032 1988 Tennessee
5032 1989 Tennessee
5032 1990 Tennessee
5032 1991 Tennessee
5032 1992 Tennessee
5032 1993 Tennessee
5032 1994 Tennessee
5032 1995 Tennessee
5032 1996 Tennessee
5032 1997 Tennessee
5032 1998 Tennessee
5032 1999 Tennessee
5032 2000 Tennessee
5032 2001 Tennessee
5032 2002 Tennessee
5032 2003 Tennessee
5032 2004 Tennessee
5032 2005 Tennessee
5032 2008 .
5032 2009 Illinois
5032 2010 Illinois
For example, the state in year 2008 should be missing because we know that the child moved sometime between 2005 and 2009 but we do not know exactly when so we cannot assume that the state in year 2008 is Tennessee or Illinois. I have the code for filling in missing values based on state[_n+1] and state[_n-1] but when I run this code, it fills in 2008 as Illinois when it should really be missing. How do I go about replace the state as missing for those observations in which the state in the previous year that is not missing (for example in this it is 2005) and the year after this is not missing (for example 2009)?
Thank you for your help and time.
Surya
I am trying to fill in a variable (state) over IDs IF the state is consistent for the year before and year after. My data for example looks like this:
ID_child year state
5032 1983 Tennessee
5032 1984 Tennessee
5032 1985 Tennessee
5032 1986 Tennessee
5032 1987 Tennessee
5032 1988 Tennessee
5032 1989 Tennessee
5032 1990 Tennessee
5032 1991 Tennessee
5032 1992 Tennessee
5032 1993 Tennessee
5032 1994 Tennessee
5032 1995 Tennessee
5032 1996 Tennessee
5032 1997 Tennessee
5032 1998 .
5032 1999 Tennessee
5032 2000 .
5032 2001 Tennessee
5032 2002 .
5032 2003 Tennessee
5032 2004 .
5032 2005 Tennessee
5032 2008 .
5032 2009 Illinois
5032 2010
I want it to look like this:
ID_child year state
5032 1983 Tennessee
5032 1984 Tennessee
5032 1985 Tennessee
5032 1986 Tennessee
5032 1987 Tennessee
5032 1988 Tennessee
5032 1989 Tennessee
5032 1990 Tennessee
5032 1991 Tennessee
5032 1992 Tennessee
5032 1993 Tennessee
5032 1994 Tennessee
5032 1995 Tennessee
5032 1996 Tennessee
5032 1997 Tennessee
5032 1998 Tennessee
5032 1999 Tennessee
5032 2000 Tennessee
5032 2001 Tennessee
5032 2002 Tennessee
5032 2003 Tennessee
5032 2004 Tennessee
5032 2005 Tennessee
5032 2008 .
5032 2009 Illinois
5032 2010 Illinois
For example, the state in year 2008 should be missing because we know that the child moved sometime between 2005 and 2009 but we do not know exactly when so we cannot assume that the state in year 2008 is Tennessee or Illinois. I have the code for filling in missing values based on state[_n+1] and state[_n-1] but when I run this code, it fills in 2008 as Illinois when it should really be missing. How do I go about replace the state as missing for those observations in which the state in the previous year that is not missing (for example in this it is 2005) and the year after this is not missing (for example 2009)?
Thank you for your help and time.
Surya
Comment