Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Normalising a variable between 0 and 1

    I am running a regression where my dependent variable is a 0-6 scale Likert item, and I am planning to run the regression via OLS before I compare the results to an ordered logit model. In my OLS model, I've normalized my dependent variable to be a scale between 0 and 1 (which will hopefully get around the issue that running OLS on Likert scale items will produce biased and inconsistent results). How do I interpret my regression output in this case? if I receive a coefficient of 0.04 for example, can I say that this is a 4% increase in the dependent variable for a unit increase in the independent variable?

  • #2
    Originally posted by Guest
    I've normalized my dependent variable to be a scale between 0 and 1 (which will hopefully get around the issue that running OLS on Likert scale items will produce biased and inconsistent results).
    What makes you think that?

    I don't have a reference handy, but recall reading somewhere that with upwards of seven ordered categories—and with 0 to 6, you seem to have that—you can often get away with treating the variable as interval scaled and fit a reasonably interpretable linear regression model to the unit-incremented values, using OLS or whatever.
    Last edited by sladmin; 26 Apr 2021, 15:18. Reason: anonymize original poster

    Comment


    • #3
      I fail to see how dividing by 6 can remove any bias or inconsistency.

      Besides, the values 0 ... 6 should mean something to you and other people working with the same data, as would a prediction of 3.45 or whatever. It isn't going to be easier to think about a predicted value of 3.45/6 = 0.575.



      Comment


      • #4
        Dear Nick,

        Thank you for your reply, this makes sense to me, and my original plan was to do what Joseph recommends. Where my confusion comes from is that I have been told that:

        "Typically, a scale made up of several 7-point Likert items is not appropriate for OLS. One issue is that the data are necessarily limited to the left (all people answer "1") and to the right (all people answer "7"). Another issue is that you are assuming that the responses can be summed to mean in a meaningful way. That is, you are assuming that an answer of "2" and an answer of "4" average out to an answer of "3". For most data, this is not true. In that case using OLS may give you biased and inconsistent results."

        Could you comment on this and add to what Joseph said? Would running OLS on this type of data be reasonable?

        Comment


        • #5
          Originally posted by Guest
          " . . .a scale made up of several 7-point Likert items . . ."
          You have several such items? If so, then what does a factor analysis (see example below) tell you? If the factor loadings are all about the same, then you might be able get by with linear regression of their sumscore. You can get an idea of what passes for "all about the same" from inspection of the factor loadings in the example below.

          .ÿ
          .ÿversionÿ16.1

          .ÿ
          .ÿclearÿ*

          .ÿ
          .ÿsetÿseedÿ`=strreverse("1604102")'

          .ÿ
          .ÿprogramÿdefineÿshowEm
          ÿÿ1.ÿÿÿÿÿÿÿÿÿversionÿ16.1
          ÿÿ2.ÿÿÿÿÿÿÿÿÿsyntaxÿanything(name=eqn)
          ÿÿ3.ÿ
          .ÿÿÿÿÿÿÿÿÿlocalÿtestÿ=ÿsubstr("`eqn'",ÿstrpos("`eqn'",ÿ":")ÿ+ÿ1,ÿ.)
          ÿÿ4.ÿÿÿÿÿÿÿÿÿquietlyÿtestÿ`test'
          ÿÿ5.ÿÿÿÿÿÿÿÿÿdisplayÿinÿsmclÿasÿtextÿ"Coefficientÿ±ÿSE:ÿ"ÿ///
          >ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ%04.2fÿ_b[`eqn']ÿ"ÿ±ÿ"ÿ%04.2fÿ_se[`eqn']
          ÿÿ6.ÿÿÿÿÿÿÿÿÿdisplayÿinÿsmclÿasÿtextÿ"Pÿ=ÿ"ÿ%04.2fÿr(p)
          ÿÿ7.ÿend

          .ÿ
          .ÿtempnameÿCorr

          .ÿmatrixÿdefineÿ`Corr'ÿ=ÿJ(5,ÿ5,ÿ0.7)ÿ+ÿI(5)ÿ*ÿ0.3

          .ÿquietlyÿdrawnormÿl1ÿl2ÿl3ÿl4ÿl5,ÿdoubleÿcorr(`Corr')ÿn(250)

          .ÿforeachÿvarÿofÿvarlistÿl?ÿ{
          ÿÿ2.ÿÿÿÿÿÿÿÿÿlocalÿnewÿ:ÿsubinstrÿlocalÿvarÿ"l"ÿ"item"
          ÿÿ3.ÿÿÿÿÿÿÿÿÿegenÿbyteÿ`new'ÿ=ÿcut(`var'),ÿgroup(7)
          ÿÿ4.ÿ}

          .ÿ
          .ÿgenerateÿdoubleÿpreÿ=ÿruniform(-0.5,ÿ0.5)

          .ÿ
          .ÿ*
          .ÿ*ÿBeginÿhere
          .ÿ*
          .ÿgsemÿ(item?ÿ<-ÿF,ÿoprobit),ÿvariance(F@1)ÿnodvheaderÿnocnsreportÿnolog

          GeneralizedÿstructuralÿequationÿmodelÿÿÿÿÿÿÿÿÿÿÿNumberÿofÿobsÿÿÿÿÿ=ÿÿÿÿÿÿÿÿ250
          Logÿlikelihoodÿ=ÿ-2016.4953

          ------------------------------------------------------------------------------
          ÿÿÿÿÿÿÿÿÿÿÿÿÿ|ÿÿÿÿÿÿCoef.ÿÿÿStd.ÿErr.ÿÿÿÿÿÿzÿÿÿÿP>|z|ÿÿÿÿÿ[95%ÿConf.ÿInterval]
          -------------+----------------------------------------------------------------
          item1ÿÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿÿÿÿFÿ|ÿÿÿ1.530575ÿÿÿ.1494107ÿÿÿÿ10.24ÿÿÿ0.000ÿÿÿÿÿ1.237735ÿÿÿÿ1.823415
          -------------+----------------------------------------------------------------
          item2ÿÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿÿÿÿFÿ|ÿÿÿ1.473228ÿÿÿ.1431126ÿÿÿÿ10.29ÿÿÿ0.000ÿÿÿÿÿ1.192732ÿÿÿÿ1.753724
          -------------+----------------------------------------------------------------
          item3ÿÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿÿÿÿFÿ|ÿÿÿ1.785881ÿÿÿ.1797322ÿÿÿÿÿ9.94ÿÿÿ0.000ÿÿÿÿÿ1.433613ÿÿÿÿÿ2.13815
          -------------+----------------------------------------------------------------
          item4ÿÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿÿÿÿFÿ|ÿÿÿ1.585869ÿÿÿ.1563639ÿÿÿÿ10.14ÿÿÿ0.000ÿÿÿÿÿ1.279402ÿÿÿÿ1.892337
          -------------+----------------------------------------------------------------
          item5ÿÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿÿÿÿFÿ|ÿÿÿ1.805861ÿÿÿ.1814921ÿÿÿÿÿ9.95ÿÿÿ0.000ÿÿÿÿÿ1.450143ÿÿÿÿ2.161579
          -------------+----------------------------------------------------------------
          /item1ÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿcut1ÿ|ÿÿ-1.990027ÿÿÿ.1952606ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-2.372731ÿÿÿ-1.607323
          ÿÿÿÿÿÿÿÿcut2ÿ|ÿÿ-1.038565ÿÿÿ.1554137ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-1.34317ÿÿÿ-.7339599
          ÿÿÿÿÿÿÿÿcut3ÿ|ÿÿ-.3298704ÿÿÿ.1421652ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-.608509ÿÿÿ-.0512317
          ÿÿÿÿÿÿÿÿcut4ÿ|ÿÿÿ.3235942ÿÿÿ.1424777ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.044343ÿÿÿÿ.6028454
          ÿÿÿÿÿÿÿÿcut5ÿ|ÿÿÿ1.044843ÿÿÿ.1557403ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.7395972ÿÿÿÿ1.350088
          ÿÿÿÿÿÿÿÿcut6ÿ|ÿÿÿÿ1.94363ÿÿÿ.1898226ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ1.571585ÿÿÿÿ2.315676
          -------------+----------------------------------------------------------------
          /item2ÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿcut1ÿ|ÿÿ-1.933566ÿÿÿ.1870726ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-2.300222ÿÿÿ-1.566911
          ÿÿÿÿÿÿÿÿcut2ÿ|ÿÿ-1.016727ÿÿÿ.1512807ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-1.313231ÿÿÿ-.7202217
          ÿÿÿÿÿÿÿÿcut3ÿ|ÿÿ-.3158609ÿÿÿ.1383859ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-.5870923ÿÿÿ-.0446296
          ÿÿÿÿÿÿÿÿcut4ÿ|ÿÿÿ.3078309ÿÿÿ.1382479ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.03687ÿÿÿÿ.5787918
          ÿÿÿÿÿÿÿÿcut5ÿ|ÿÿÿ.9998912ÿÿÿÿ.150549ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.7048205ÿÿÿÿ1.294962
          ÿÿÿÿÿÿÿÿcut6ÿ|ÿÿÿ1.897502ÿÿÿ.1849521ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ1.535002ÿÿÿÿ2.260001
          -------------+----------------------------------------------------------------
          /item3ÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿcut1ÿ|ÿÿ-2.256066ÿÿÿ.2285317ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-2.70398ÿÿÿ-1.808152
          ÿÿÿÿÿÿÿÿcut2ÿ|ÿÿ-1.194789ÿÿÿ.1800433ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-1.547668ÿÿÿ-.8419111
          ÿÿÿÿÿÿÿÿcut3ÿ|ÿÿ-.3399177ÿÿÿÿÿÿ.1596ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-.6527278ÿÿÿ-.0271075
          ÿÿÿÿÿÿÿÿcut4ÿ|ÿÿÿ.4046611ÿÿÿÿ.159693ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.0916685ÿÿÿÿ.7176537
          ÿÿÿÿÿÿÿÿcut5ÿ|ÿÿÿÿ1.18248ÿÿÿÿÿ.17698ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.8356053ÿÿÿÿ1.529354
          ÿÿÿÿÿÿÿÿcut6ÿ|ÿÿÿÿ2.17104ÿÿÿ.2201924ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ1.739471ÿÿÿÿ2.602609
          -------------+----------------------------------------------------------------
          /item4ÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿcut1ÿ|ÿÿ-2.058657ÿÿÿ.2005686ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-2.451764ÿÿÿÿ-1.66555
          ÿÿÿÿÿÿÿÿcut2ÿ|ÿÿ-1.119233ÿÿÿ.1636716ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-1.440023ÿÿÿ-.7984422
          ÿÿÿÿÿÿÿÿcut3ÿ|ÿÿ-.3387851ÿÿÿ.1469229ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-.6267486ÿÿÿ-.0508216
          ÿÿÿÿÿÿÿÿcut4ÿ|ÿÿÿ.3429487ÿÿÿÿ.146498ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.055818ÿÿÿÿ.6300795
          ÿÿÿÿÿÿÿÿcut5ÿ|ÿÿÿ1.085634ÿÿÿ.1615663ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.76897ÿÿÿÿ1.402298
          ÿÿÿÿÿÿÿÿcut6ÿ|ÿÿÿ2.020882ÿÿÿ.1985468ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ1.631737ÿÿÿÿ2.410026
          -------------+----------------------------------------------------------------
          /item5ÿÿÿÿÿÿÿ|
          ÿÿÿÿÿÿÿÿcut1ÿ|ÿÿ-2.269245ÿÿÿ.2297271ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-2.719502ÿÿÿ-1.818988
          ÿÿÿÿÿÿÿÿcut2ÿ|ÿÿ-1.195828ÿÿÿ.1803253ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-1.549259ÿÿÿ-.8423965
          ÿÿÿÿÿÿÿÿcut3ÿ|ÿÿ-.3683673ÿÿÿ.1600543ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ-.6820679ÿÿÿ-.0546667
          ÿÿÿÿÿÿÿÿcut4ÿ|ÿÿÿ.3617866ÿÿÿÿ.160268ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.0476671ÿÿÿÿ.6759062
          ÿÿÿÿÿÿÿÿcut5ÿ|ÿÿÿ1.177931ÿÿÿ.1788058ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ.8274781ÿÿÿÿ1.528384
          ÿÿÿÿÿÿÿÿcut6ÿ|ÿÿÿ2.207101ÿÿÿ.2247487ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ1.766602ÿÿÿÿ2.647601
          -------------+----------------------------------------------------------------
          ÿÿÿÿÿÿÿvar(F)|ÿÿÿÿÿÿÿÿÿÿ1ÿÿ(constrained)
          ------------------------------------------------------------------------------

          .ÿestimatesÿstoreÿFree

          .ÿ
          .ÿquietlyÿgsemÿ(item1@aÿitem2@aÿitem3@aÿitem4@aÿitem5@aÿ<-ÿF@1,ÿoprobit),ÿ///
          >ÿÿÿÿÿÿÿÿÿvariance(F@1)ÿnodvheaderÿnocnsreportÿnolog

          .ÿlrtestÿFree

          Likelihood-ratioÿtestÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿLRÿchi2(4)ÿÿ=ÿÿÿÿÿÿ4.31
          (Assumption:ÿ.ÿnestedÿinÿFree)ÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿProbÿ>ÿchi2ÿ=ÿÿÿÿ0.3658

          .ÿ
          .ÿquietlyÿgsemÿ(item?ÿ<-ÿF,ÿoprobit)ÿ(Fÿ<-ÿc.pre),ÿnodvheaderÿnocnsreportÿnolog

          .ÿshowEmÿF:pre
          Coefficientÿ±ÿSE:ÿ0.25ÿ±ÿ0.35
          Pÿ=ÿ0.48

          .ÿ
          .ÿegenÿbyteÿsscÿ=ÿrowmean(item?)

          .ÿquietlyÿregressÿsscÿc.pre

          .ÿshowEmÿpre
          Coefficientÿ±ÿSE:ÿ0.26ÿ±ÿ0.38
          Pÿ=ÿ0.49

          .ÿ
          .ÿexit

          endÿofÿdo-file


          .


          But, if you've already done the factor analysis, then you might as well go ahead and add a structural component to the measurement model (as illustrated above) in order to fit a regression model then and there.

          Anyway, how to handle ordered-categorical responses has been a long-standing debate among those who use Likert scales and their individual items. You can get a flavor of the points to consider by Googling likert-scale sum-score and taking what you encounter as a second opinion to what you've been told.
          Last edited by sladmin; 26 Apr 2021, 15:19. Reason: anonymize original poster

          Comment


          • #6
            I would say that the advice quoted in #4 makes sense but puts the emphasis in the wrong place. It's not the fault of OLS if it doesn't know where the data come from or what their limits are. Similarly, saying that OLS provides a biased and inconsistent solution is puzzling as the whole point of the outcome being just ordered codes is that any kind of weighted average is at best arbitrary so no other estimator could find correct estimates of the parameters either because correct values aren't even defined in principle.
            Last edited by Nick Cox; 18 Apr 2021, 02:58.

            Comment

            Working...
            X