Hey folks, I need some help on how to manipulate/reshape a dataset, and I'm not sure how to do this using the standard reshape command. Any help would be appreciated.
I have data on natural hazard events for, in this example code, 5 sites. Each observation is a site-hazard combination, so a site can occupy multiple lines if it experienced multiple hazards. I also have data on the number of days each site experienced each hazard, as well as the number of 1-, 2- and 3- day gaps between hazards at each site. The data presently looks like this:
Code:
* Example generated by -dataex-. For more info, type help dataex clear input str15 site str12 type byte(total_days sum_day1 sum_day2 sum_day3) "Jefferson Park" "Closure" 12 0 0 0 "Washington Park" "Bac Advisory" 4 5 4 4 "Washington Park" "HAB Advisory" 25 9 9 7 "Madison Park" "Bac Advisory" 7 11 4 0 "Eisenhower Park" "Closure" 15 0 0 0 "Eisenhower Park" "Bac Advisory" 5 3 3 3 "Eisenhower Park" "HAB Advisory" 19 7 6 5 "Adams Park" "Closure" 24 0 0 0 "Adams Park" "HAB Advisory" 8 14 12 11 end
I need to get the data into a format in which each observation is a site (instead of a site-hazard combination), and for each site the variables include the number of hazard days for each hazard, as well as the number of "gap" days for the non-closure hazards.
Here is an example of the format I need, hich corresponds to the above code:
Code:
* Example generated by -dataex-. For more info, type help dataex clear input str15 site byte(closuredays habdays hab_day1 hab_day2 hab_day3 bacdays bac_day1 bac_day2 bac_day3) "Jefferson Park" 12 0 0 0 0 0 0 0 0 "Washington Park" 0 25 9 9 7 4 5 4 4 "Madison Park" 0 0 0 0 0 7 11 4 0 "Eisenhower Park" 15 19 7 6 5 5 3 3 3 "Adams Park" 24 8 14 12 11 0 0 0 0 end
Comment