probit regression

jan dhoore

Join Date: May 2019

Posts: 4
#1

probit regression

26 May 2019, 13:38

When conducting the probit regression, i get multiple omitted variables. How do I solve this? By regressing each omitted variable as a dependent variables or is there an other way to account for these omitted variables?

Also when I try to export this stata output into an excel file using / outreg2 variables, replace e(all) label excel alpha(0.001, 0.01, 0.05) symbol(***, **, *), i get an error saying "option e() not allowed". How can I solve this problem?

Thanks in advance!!

Last edited by jan dhoore; 26 May 2019, 13:47.
Tags: None
William Lisowski

Join Date: Dec 2014

Posts: 10150
#2

26 May 2019, 13:51

Welcome to Statalist.

Without seeing your probit command and its output, it's difficult to be very helpful. And it's not clear what you mean by "solving" the problem.

You've apparently already seen the advice in the Stata FAQ at

https://www.stata.com/support/faqs/s...ted-variables/

based on your proposal to regress omitted variables as dependent variables. The result of doing this tells you what variables each omitted variable is collinear with, perhaps, but there's no way to get the omitted variable back into the model without excluding some other variable, so I don't know that this is a "solution" of the sort you are looking for.

In general, the most common reason for omitted variables is inclusion of a group of indicator variables ("dummy variables") that cover every category. For example, if everyone in your population has a gender of either female or male, and you have separate indicator variables for both female and male, then you have to exclude either the female or the male indicator variable, since together they add up to 1, which is collinear with the constant term. This is often called the "dummy variable trap" and is discussed in Wikipedia at the following location,

https://en.wikipedia.org/wiki/Dummy_...my_independent

Again, there's no "solution" to this.

For a more thorough discussion, should these pointers not prove helpful, you might take a few moments to review the Statalist FAQ linked to from the top of the page, as well as from the Advice on Posting link on the page you used to create your post. Note especially sections 9-12 on how to best pose your question. It's particularly helpful to copy commands and output from your Stata Results window and paste them into your Statalist post using code delimiters [CODE] and [/CODE], as described in section 12 of the FAQ.

The more you help others understand your problem, the more likely others are to be able to help you solve your problem.
Comment
jan dhoore

Join Date: May 2019

Posts: 4
#3

26 May 2019, 14:03

Okay, thanks for the information I'm new on this platform so i dont reqlly know the usual interaction. However my data is present below, maybe not in the way it's best readable, but i think it is still cleqr to read. As you can see, I have three omitted variables (all of them are dummies). All of these variables make up for the completness of a white paper, So it is logical the ommited variables are highly correlated with the variable complete white paper. For every problem, there was a solution as well, so this correlation is 1 and. Furthermore, both are found back in the whole sample so it makes sense that they are omitted. However, financial details are not always found back but is still omitted in my output with makes it kind of strange for me. So if I understand you well I have to delete all omitted variables except for one, or am I wrong?

----------------------+----------------------------------------------------------------
tokensdistributed | -1.824454 .8578954 -2.13 0.033 -3.505898 -.1430096
hardcaplog | .1131385 .1253064 0.90 0.367 -.1324576 .3587345
Patentactivitydummy | .3764257 .3852203 0.98 0.328 -.3785922 1.131444
Completewhitepaper | -.5022828 .3432484 -1.46 0.143 -1.175037 .1704717
Problemdummy | 0 (omitted)
Solutiondummy | 0 (omitted)
Technicalaspectdummy | .5402596 .6296035 0.86 0.391 -.6937406 1.77426
Financialdetailsdummy | 0 (omitted)
GitHubdummy | .5708313 .414692 1.38 0.169 -.2419502 1.383613
numberoftotalcommits | .0000674 .0000442 1.52 0.128 -.0000193 .0001541
Diversityindex | 1.150011 1.88543 0.61 0.542 -2.545365 4.845386
utilitydummy | .6478353 .6807837 0.95 0.341 -.6864763 1.982147
Tokensupplylog | -.0149982 .0711659 -0.21 0.833 -.1544809 .1244844
durationindays | -.0018485 .0033664 -0.55 0.583 -.0084466 .0047496
Country_n | .0348053 .0202925 1.72 0.086 -.0049673 .0745779
WPpages | .0042897 .0129377 0.33 0.740 -.0210678 .0296471
_cons | -1.142926 2.110682 -0.54 0.588 -5.279786 2.993934]

Thanks in advance

Last edited by jan dhoore; 26 May 2019, 14:10.
Comment
Phil Bromiley

Join Date: Apr 2014

Posts: 4348
#4

28 May 2019, 14:14

Jan,
You'll increase your chances of a useful answer by following the FAQ on asking questions - provide Stata code in code delimiters, readable Stata output (fixed spacing fonts help), and sample data using dataex. Assume we are not from your area. We don't even know what the columns are in the results you posted or what estimator (I assume an estimator) produced the results.

As William pointed out, there is probably a problem that your dummies add to one. Try regressing one of the dummies (say ProblemDummy) on the other rhs variables to try to understand what is going on .
Comment

Announcement

Comment

Comment

Comment