Hey guys,
I have an issue with respect to structuring my Lending Club dataset in a panel data format. I have a dataset with loan applications from the Lending Club Peer-to-Peer lending platform. My intention is to run an xtlogit regression to find out whether loan applications from US states with certain characteristics (e.g. startup activity) have a higher probability of being funded. As there are a lot loan applications per day, I created a date_time2 variable with clocktime notation. I would like to base my panel on the different states within the US, as such I'm trying to run the following command:
xtset State_Code date_time2, delta (1 second)
Unfortunately, STATA returns "repeated time values within panel" r (451). This is because STATA still recognizes a lot of these values as duplicates. Below you find a screenshot, which shows how I have structured my data. For State_Code==1 the last loans in my dataset were made on December 31st 2015. I distinguish them by seconds. Accordingly the first loan in my dataset for State==2 was made on September 23rd of 2007.
If more information is required please let me know. Hope you guys can help.
Thanks in advance!

I have an issue with respect to structuring my Lending Club dataset in a panel data format. I have a dataset with loan applications from the Lending Club Peer-to-Peer lending platform. My intention is to run an xtlogit regression to find out whether loan applications from US states with certain characteristics (e.g. startup activity) have a higher probability of being funded. As there are a lot loan applications per day, I created a date_time2 variable with clocktime notation. I would like to base my panel on the different states within the US, as such I'm trying to run the following command:
xtset State_Code date_time2, delta (1 second)
Unfortunately, STATA returns "repeated time values within panel" r (451). This is because STATA still recognizes a lot of these values as duplicates. Below you find a screenshot, which shows how I have structured my data. For State_Code==1 the last loans in my dataset were made on December 31st 2015. I distinguish them by seconds. Accordingly the first loan in my dataset for State==2 was made on September 23rd of 2007.
If more information is required please let me know. Hope you guys can help.
Thanks in advance!
Comment