Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Multinomial logistic regression - RRR is too big

    Hi,

    I am running a regression where a firm performance measure ( a categorical variable) is regressed on certain predictors using multinomial regression model.

    - Q4 is the firm performance variable that has 4 categories. Rest of the variables are explanatory variables.
    - The estimated coefficients in Stata are relative risk ratios (RRR).

    One of the comments from my reviewer is that the RRR coefficient of import capital intensity is TOO LARGE. I estimated the model again by:
    a. Using winsorized values of import capital intensity at 1% and 5% level
    b. Using winsorized values of all variables at 1% and 5% level

    However, nothing seem to work and the coefficient is still too large. I am looking for suggestions.

    Click image for larger version

Name:	Untitled.png
Views:	1
Size:	106.6 KB
ID:	1478528


    Thanks!



  • #2
    How is the variable in question scaled? What is the basis for saying the effect is too large? Coefficients are affected by how variables are measured, e.g. the effect of one dollar will be less than the effect of a thousand dollars. You can't really say whether an effect is too large without knowing how it is measured.
    -------------------------------------------
    Richard Williams
    Professor Emeritus of Sociology
    University of Notre Dame
    StataNow Version: 19.5 MP (2 processor)

    EMAIL: [email protected]
    WWW: https://academicweb.nd.edu/~rwilliam/

    Comment


    • #3
      1. The variable is defined as a ratio -

      Import capital intensity = Imported capital/Operating Expense

      2. Below is the summary statistics for this variable:

      Click image for larger version

Name:	Capture.JPG
Views:	1
Size:	44.3 KB
ID:	1478546



      So certainly outliers are not influencing the RRR coefficient of the regression


      (It is a panel data - the data is given for firms and years.)

      Originally posted by Richard Williams View Post
      How is the variable in question scaled? What is the basis for saying the effect is too large? Coefficients are affected by how variables are measured, e.g. the effect of one dollar will be less than the effect of a thousand dollars. You can't really say whether an effect is too large without knowing how it is measured.

      Comment


      • #4
        3. Also, if i take a log (import capital ratio), then these coefficients do not look that big. But, my guide suggests that it is not ideal to take a log transformation of a ratio.

        Comment


        • #5
          A coefficient tells you the effect of a one unit increase in an independent variable. In this case a 1 unit increase would be a super huge increase. I suggest multiplying the variable by 100, maybe even 1,000.

          Based on what you have said, I don't see any basis for saying the RRR is too big. Unless maybe other studies have measured variables the same way and found far smaller effects.
          -------------------------------------------
          Richard Williams
          Professor Emeritus of Sociology
          University of Notre Dame
          StataNow Version: 19.5 MP (2 processor)

          EMAIL: [email protected]
          WWW: https://academicweb.nd.edu/~rwilliam/

          Comment


          • #6
            Originally posted by Richard Williams View Post
            A coefficient tells you the effect of a one unit increase in an independent variable. In this case a 1 unit increase would be a super huge increase. I suggest multiplying the variable by 100, maybe even 1,000.

            Based on what you have said, I don't see any basis for saying the RRR is too big. Unless maybe other studies have measured variables the same way and found far smaller effects.
            I see.

            Multiplying it by 100 or 1000 resolves the issue. Thanks for your help

            Comment


            • #7
              Originally posted by Richard Williams View Post
              A coefficient tells you the effect of a one unit increase in an independent variable. In this case a 1 unit increase would be a super huge increase. I suggest multiplying the variable by 100, maybe even 1,000.

              Based on what you have said, I don't see any basis for saying the RRR is too big. Unless maybe other studies have measured variables the same way and found far smaller effects.
              I looked at percentiles of this variable. The median of this variable across different quartiles in really small which could explain why the estimated coefficient is so big.



              Click image for larger version

Name:	Capture.JPG
Views:	1
Size:	50.6 KB
ID:	1478804



              Comment


              • #8
                Hello,
                I have panel data which was observed one week. I have run multinomial logit regression with xtmlogit command with stating repeated observation. I obtained the following outcome. But, I have a question and I am confused some extreme coefficient and RRR values. My distance1 distance2 and distance3 (they are binary variables) variables seem to have very high coefficient values as you can see attached file. When I converted them into RRR values, one of them (distance1) is over 1000. SO, I would like to ask you that it is normal ? and do you have any suggestions to fix this problem. By the way, I only could attach the results for second category due to large table size. Thanks in advance.

                Click image for larger version

Name:	Result.jpg
Views:	1
Size:	191.4 KB
ID:	1644110

                Comment


                • #9
                  No, it is not usual to see RRRs that large for discrete variables--it is rare for anything to be that strongly associated to anything else in real life. My best guess, though it is only a guess without seeing the data, is that your base outcome 1 is a very rare event in your data. If outcome 1 almost never occurs in the estimation sample, then the coefficients of other variables can be very large. I suggest you take a look at the output of -tab decison distance1 if e(sample)- (and similarly with distance 2 and 3 to get a feel for what is going on. I suspect you are going to find some very tiny cells in the decision = 1 rows.

                  Comment


                  • #10
                    Originally posted by Gurpreet Singh View Post

                    I see.

                    Multiplying it by 100 or 1000 resolves the issue. Thanks for your help
                    I don't understand why multiplying 100 or 1000. Also, multiply with the very big RRR value variable?
                    Thank you for your explanation.

                    Comment

                    Working...
                    X