Dear Statalisters,
I am running a panel data with 328 million records on my 2 year old dell laptop (stata 13). I was doing a simple
bysort year and industry: egen total_x=total(x) statement, after a long time, stata stopped responding .
The data set is 6.7 GB. My question to you is : is there some settings I can change to decrease the chance stata froze on me.
I stopped the execution, and am now trying to break the data into single years (my panel starts 1975 and ends 2012),
second approach:
code:
use master, clear
forvalue i=1975/2012 {
keep if year==`x'
save data`x'
}
I then plan to rerun egen for each year's data.
not sure if this will somehow bypass the huge size issue. right now, stata has not shown any results yet.
If you could, please comment or suggest better ways.
thanks,
Rochelle
I am running a panel data with 328 million records on my 2 year old dell laptop (stata 13). I was doing a simple
bysort year and industry: egen total_x=total(x) statement, after a long time, stata stopped responding .
The data set is 6.7 GB. My question to you is : is there some settings I can change to decrease the chance stata froze on me.
I stopped the execution, and am now trying to break the data into single years (my panel starts 1975 and ends 2012),
second approach:
code:
use master, clear
forvalue i=1975/2012 {
keep if year==`x'
save data`x'
}
I then plan to rerun egen for each year's data.
not sure if this will somehow bypass the huge size issue. right now, stata has not shown any results yet.
If you could, please comment or suggest better ways.
thanks,
Rochelle
Comment