Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Ideas for Identifying Only "Important" Variables in a Dataset

    Hi all,

    I have a dataset with 250 variables and roughly 2 MM cases. It is a bit unwieldy. I am using a .do file with roughly 800 lines to analyze that dataset, which only uses a portion of the variables in the dataset. Ideally, I could slim down the analysis dataset to just those variables used in the .do file.

    I know I could do this manually by stepping through it and recording those variables, but I was hoping someone had a way to keep track of them "automatically" as the .do file was running. Pipe dream?

    Thank you for any thoughts you might have.

    Ben


    Ben Hoen
    Berkeley Lab




  • #2
    Hi all,

    Just curious if anyone had any thoughts on this "odd" problem?

    Ben

    Comment

    Working...
    X