Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • To construct a composite variable from multiple categorical variables

    My question may be simple. I tried to find a possible answer to my question, but failed to do so.

    I am trying to construct a composite variable from multiple categorical variables. I want to get a single value (e.g., mean) from multiple categorical variables.
    For instance, I want to estimate the effect of self-directed learning (SDL) on academic achievement. Both are categorical variables. There are five questions that are intended to assess the extent of SDL ability. So I would like to combine the five questions into one variable.

    How can I do this in STATA?
    egen, concat or egen, group are not what I am looking for because they do not give me a single value(number).
    If they are continuous variables, egen, rowmean will work.

    Or simply combining multiple categorical variables into a single variable is problematic, isn't it?
    Please help me with this.

  • #2
    Maybe use a simple sumscore of the five questions' response? Or perhaps something a little more involved, such as a MIMIC-like structural equation model using gsem. See the help file and user's manual for examples.

    Comment


    • #3
      An intermediate option to the ones mentioned by Joseph Coveney is a sheaf coefficient, see ssc desc sheafcoef
      ---------------------------------
      Maarten L. Buis
      University of Konstanz
      Department of history and sociology
      box 40
      78457 Konstanz
      Germany
      http://www.maartenbuis.nl
      ---------------------------------

      Comment


      • #4
        And if what Joseph and Maarten are proposing is not what you are looking for, please provide a data sample using -dataex-, and explain how exactly this combined variable that you desire has to look like.

        Comment


        • #5
          Originally posted by Maarten Buis View Post
          An intermediate option to the ones mentioned by Joseph Coveney is a sheaf coefficient, see ssc desc sheafcoef
          This is an exceptionally useful concept. Do you have any paper/report using this command showing a real-life use, just to get a better sense of its application. Thank you.
          Last edited by Sonnen Blume; 25 Sep 2020, 08:00.

          Comment

          Working...
          X