<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/">
	<channel>
		<title>Statalist - Forums</title>
		<link>https://www.statalist.org/forums/</link>
		<description>vBulletin Forums</description>
		<language>en</language>
		<lastBuildDate>Tue, 09 Jun 2026 00:12:32 GMT</lastBuildDate>
		<generator>vBulletin</generator>
		<ttl>60</ttl>
		<image>
			<url>images/misc/rss.png</url>
			<title>Statalist - Forums</title>
			<link>https://www.statalist.org/forums/</link>
		</image>
		<item>
			<title><![CDATA[Command multproc keeps returning &amp;quot;p ambiguous abbreviation&amp;quot;]]></title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786277-command-multproc-keeps-returning-p-ambiguous-abbreviation</link>
			<pubDate>Mon, 08 Jun 2026 14:38:07 GMT</pubDate>
			<description>Dear all,  
 
my analysis is very similar to the second example here SMILEPLOT: Stata module to create plots for use with multiple significance tests...</description>
			<content:encoded><![CDATA[Dear all, <br />
<br />
my analysis is very similar to the second example here <a href="https://www.researchgate.net/publication/4794304_SMILEPLOT_Stata_module_to_create_plots_for_use_with_multiple_significance_tests" target="_blank">SMILEPLOT: Stata module to create plots for use with multiple significance tests</a> but I keep running into the same error.<br />
<br />
This is the code I'm using:<br />
logistic event2 sodium_5 pot_5 calc_5 mag_5 pho_5 iron_5 copper_5 zinc_5 chlo_5 retinol_5 carotene_5 vitd_5 thiamin_5 ribo_5 niacin_5 trypto_5 vitc_5 vite_5 pyrid_5 b12_5 folic_tot_5 panto_5 biotin_5 iodine_5 mn_5 retequ_5 se_5 sfa_5 n6pufa_5 n3pufa_5 mfa_5 pufa_5 transfa_5 cholest_5<br />
<br />
parmest,format(estimate min95 max95 %8.2f p %8.1e) list(,)<br />
<br />
multproc, method(simes) puncor(.05)<br />
<br />
But every time I run this code, I get &quot;p ambiguous abbreviation&quot; error message. I've also tried multproc, pvalue(p) method(simes) puncor(.05) as seen in other examples but it doesn't address the initial issue as the error message I receive is &quot;p ambiguous abbreviation (error in option pvalue())&quot;<br />
<br />
I feel like I've tried everything and I'd appreciate any help, thank you.<br />
 ]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Kay Vachova</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786277-command-multproc-keeps-returning-p-ambiguous-abbreviation</guid>
		</item>
		<item>
			<title>Anderson-Rubin P-Values and Standard Errors</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786273-anderson-rubin-p-values-and-standard-errors</link>
			<pubDate>Sun, 07 Jun 2026 15:59:48 GMT</pubDate>
			<description>Hello,  
I am working on a revision to a manuscript where the editor asked us to report Anderson-Rubin p-values as a robustness test. When obtaining...</description>
			<content:encoded><![CDATA[Hello, <br />
I am working on a revision to a manuscript where the editor asked us to report Anderson-Rubin p-values as a robustness test. When obtaining Anderson-Rubin p-values, will the standard errors also change (i.e., should my standard errors be different from the standard two-stage least squares estimates)?]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Zachariah Rutledge</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786273-anderson-rubin-p-values-and-standard-errors</guid>
		</item>
		<item>
			<title>save commands</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786269-save-commands</link>
			<pubDate>Fri, 05 Jun 2026 18:41:54 GMT</pubDate>
			<description>Can someone show me 
how do you use the saved commands that vary by one variable (e.g. facilitykey), see below? Thank you  
 
sum iss if...</description>
			<content:encoded><![CDATA[Can someone show me<br />
how do you use the saved commands that vary by one variable (e.g. facilitykey), see below? Thank you <br />
<br />
sum iss if facilitykey==, detail<br />
tab iss_15 if facilitykey==<br />
tab died if facilitykey==<br />
tab died if iss_15==1 &amp; facilitykey==<br />
sum hlos if facilitykey==, detail<br />
sum age if facilitykey==, detail<br />
tab age_65 if facilitykey==<br />
tab gender if facilitykey==<br />
tab penetr if facilitykey==<br />
tab gcs_8 if facilitykey==<br />
tab eddispo if iss_15==1 &amp; facilitykey==<br />
tab level if facilitykey==<br />
tab teachingstatus if facilitykey==<br />
tab bedsize if facilitykey==<br />
tab transfu if facilitykey==<br />
tab hcontrol if facilitykey==<br />
tab hhcontrol if facilitykey==<br />
tab angio if facilitykey==]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Narong Kul</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786269-save-commands</guid>
		</item>
		<item>
			<title>xtabond2/xtdpdgmm: Estimation of dynamic simultaneous system of equations using GMM</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786267-xtabond2-xtdpdgmm-estimation-of-dynamic-simultaneous-system-of-equations-using-gmm</link>
			<pubDate>Fri, 05 Jun 2026 13:49:43 GMT</pubDate>
			<description>Hi, 
I want to estimate the effect of X on Y in formal sector and then what is effect of Y of the formal sector on the Y of Informal Sector. For...</description>
			<content:encoded><![CDATA[Hi,<br />
I want to estimate the effect of X on Y in formal sector and then what is effect of Y of the formal sector on the Y of Informal Sector. For instance,<br />
What is the effect of TFP (which is endogenous) on the employment of formal sector and what is the effect of employment of formal sector (due to TFP change) on the employment of informal sector. More specifically,<br />
<br />
<a href="filedata/fetch?filedataid=1786268">Array </a><br />
<br />
I have industry-region panel data. Four Time points. Approx 3500 observations. Can I estimate this system of equations simultaneously using GMM by using xtabond2 or xtdpdgmm? Do I need to manually stack the instruments for the equations in order to estimate the system simultaneously using the GMM command? However, I do not think this approach will reveal the mechanism through which Y in the formal sector affects Y in the informal sector. My objective is to estimate the system jointly so that I can identify the transmission mechanism, namely:<br />
<br />
TFP in the formal sector → Employment in the formal sector → Employment in the informal sector.<br />
<br />
 ]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>lakhi narayan</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786267-xtabond2-xtdpdgmm-estimation-of-dynamic-simultaneous-system-of-equations-using-gmm</guid>
		</item>
		<item>
			<title><![CDATA[problem including never-treated with Sun &amp;amp; Abraham(2021) eventstudyinteract]]></title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786265-problem-including-never-treated-with-sun-abraham-2021-eventstudyinteract</link>
			<pubDate>Thu, 04 Jun 2026 21:08:52 GMT</pubDate>
			<description>I am using the IW estimate and the eventstudyinteract command from Sun and Abraham (2021) ...</description>
			<content:encoded><![CDATA[I am using the IW estimate and the eventstudyinteract command from <a href="https://www.sciencedirect.com/science/article/pii/S030440762030378X" target="_blank">Sun and Abraham (2021) </a>with 'treated' (variable turns on when unit is treated) and 'never_treated' units, and with pre and post variables, as seen: <br />
<br />
CODE: <br />
               [eventstudyinteract DEPVAR month_pre6 month_pre5 month_pre4 month_pre3 month_pre2 month_post0 month_post1 month_post2 month_post3 month_post4 month_post5 month_post6, cohort(treated) control_cohort(never_treated) absorb(i.m_y##i.gov hhid) vce(cluster Code)] <br />
<br />
Since the estimator uses never treated units, I assumed that they would be included in the estimation sample, however, only treated units are in the estimation sample.<br />
<br />
Is this because there are three steps in the estimation procedure and the last step is creating a weighted sum of the first step estimates - which already incorporated the never treated units? I can't think of another reason that the never treated units woutd not be included in the estimation sample. In their paper and other papers that use the estimator, authors are clear that never treated units can be part of the control group. <br />
<br />
Any feedback would be much appreciated. ]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Anna DSouza</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786265-problem-including-never-treated-with-sun-abraham-2021-eventstudyinteract</guid>
		</item>
		<item>
			<title>Psacalc to estimate unobservable ability in regression capturing effect of formal vocational training</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786264-psacalc-to-estimate-unobservable-ability-in-regression-capturing-effect-of-formal-vocational-training</link>
			<pubDate>Thu, 04 Jun 2026 19:02:50 GMT</pubDate>
			<description>I am regressing the binary variable NEET(not in education or training) on past vocational training including controls and state fixed effects. Since...</description>
			<content:encoded><![CDATA[I am regressing the binary variable NEET(not in education or training) on past vocational training including controls and state fixed effects. Since the vocational training variable can be affected by ability, I want to estimate the selection on unobservables, the criterion used by Oster (2019). <br />
<br />

<div class="bbcode_container">
	<div class="bbcode_description">Code:</div>
	<pre class="bbcode_code">regress neet_t365_july i.urban i.male mpce i.unmarried i.religion i.social_group i.state i.edulevel##i.pastvoc365 [pweight = pop_weight] if majorstate==1, vce(cluster fsu)

psacalc beta 1.pastvoc365, delta(1) rmax(`rmax_val') * 4. Calculate the breakdown delta (the selection required to drive the treatment effect to zero) psacalc delta 1.pastvoc365, rmax(`rmax_val')</pre>
</div>IS the process correct? How do I interpret the results? I am getting a negative delta]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Sayoree Gooptu</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786264-psacalc-to-estimate-unobservable-ability-in-regression-capturing-effect-of-formal-vocational-training</guid>
		</item>
		<item>
			<title>Testing parallel trends for continuous treatment</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786258-testing-parallel-trends-for-continuous-treatment</link>
			<pubDate>Thu, 04 Jun 2026 06:57:00 GMT</pubDate>
			<description>Hello, I am trying to understand how I could show the parallel trends for continuous treatment. I have a standard DiD with treatment intensity...</description>
			<content:encoded>Hello, I am trying to understand how I could show the parallel trends for continuous treatment. I have a standard DiD with treatment intensity instead of a binary treatment. As treatment intensity is zero in the pre-period, I was not clear how to show the parallel trend. Really appreciate your help in advance. </content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Myat Thida Win</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786258-testing-parallel-trends-for-continuous-treatment</guid>
		</item>
		<item>
			<title>Latest (03 June 2026) updates to StataNow 19</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786255-latest-03-june-2026-updates-to-statanow-19</link>
			<pubDate>Wed, 03 Jun 2026 17:05:01 GMT</pubDate>
			<description>A new update to StataNow 19 is now available (as of 03 June 2026). This update includes new features for financial statistics and more. You can read...</description>
			<content:encoded><![CDATA[A new update to StataNow 19 is now available (as of 03 June 2026). This update includes new features for financial statistics and more. You can read about all of the new features in our latest <a href="https://blog.stata.com/2026/06/03/a-new-update-to-statanow-has-just-been-released-4/" target="_blank">blog post</a>.]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Kristin MacDonald (StataCorp)</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786255-latest-03-june-2026-updates-to-statanow-19</guid>
		</item>
		<item>
			<title>CIVREG: A Stata package for synthetic (coplanar) instrumental variables estimation</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786254-civreg-a-stata-package-for-synthetic-coplanar-instrumental-variables-estimation</link>
			<pubDate>Wed, 03 Jun 2026 10:36:55 GMT</pubDate>
			<description>Dear Statalist users, 
 
I would like to announce civreg, a new Stata package for instrumental variables estimation based on the Synthetic...</description>
			<content:encoded><![CDATA[Dear Statalist users,<br />
<br />
I would like to announce <b>civreg</b>, a new Stata package for instrumental variables estimation based on the Synthetic Instrumental Variables (SIV) methodology.<br />
<br />
The package implements the approach proposed by Dzhumashev and Tursunalieva (2025), which addresses endogeneity without requiring external instruments. Instead, valid instruments are constructed directly from the observed data by exploiting the coplanar structure of the regression system and a data-driven Dual Tendency (DT) condition. The method identifies both the synthetic instrument and the direction of endogeneity from the data itself<br />
<br />
To install, type:<br />

<div class="bbcode_container">
	<div class="bbcode_description">HTML Code:</div>
	<pre class="bbcode_code">net install civreg, from(&quot;https://raw.githubusercontent.com/ManhHB94/civreg/main/&quot;)</pre>
</div>Key features of civreg include:<ul><li>Estimation without requiring external instruments.</li>
<li>Support for models with or without exogenous control regressors.</li>
<li>Support for both homoskedastic and heteroskedastic SIV identification procedures.</li>
<li>Automatic determination of the direction of endogeneity.</li>
<li>Compatibility with fixed-effects and two-way fixed-effects settings for panel data.</li>
</ul>The example below replicates the SIV results presented in Table 1 of Dzhumashev and Tursunalieva (2025):<br />

<div class="bbcode_container">
	<div class="bbcode_description">HTML Code:</div>
	<pre class="bbcode_code">. . webuse mroz, clear

. . civreg hours (lwage = ) educ age kidslt6 kidsge6 nwifeinc , hete(0) reps(49) small rcode

------------------------------------------------------------------------------
Coplanar instrumental variables (CIV) regression
------------------------------------------------------------------------------
Dual Tendency:  Homoscedastic
Effects:        None
Reference:      Dzhumashev and Tursunalieva (2025)

IV (2SLS) estimation
--------------------

Estimates efficient for homoskedasticity only
Statistics consistent for homoskedasticity only

                                                      Number of obs =      428
                                                      F(  6,   421) =    18.26
                                                      Prob &gt; F      =   0.0000
Total (centered) SS     =  257311019.9                Centered R2   =  -1.3833
Total (uncentered) SS   =    983895094                Uncentered R2 =   0.3767
Residual SS             =  613254452.5                Root MSE      =     1207

------------------------------------------------------------------------------
       hours | Coefficient  Std. err.      t    P&gt;|t|     &#91;95% conf. interval&#93;
-------------+----------------------------------------------------------------
       lwage |    1369.47   138.4851     9.89   0.000     1097.261    1641.678
        educ |  -159.1529   30.84987    -5.16   0.000    -219.7919   -98.51395
         age |  -10.44129    8.83987    -1.18   0.238    -27.81706    6.934493
     kidslt6 |   -225.613   160.0919    -1.41   0.159     -540.292    89.06597
     kidsge6 |  -55.12937   49.49267    -1.11   0.266    -152.4129    42.15416
    nwifeinc |  -8.687533   5.853077    -1.48   0.138    -20.19243    2.817362
       _cons |   2396.561   544.0511     4.41   0.000     1327.166    3465.956
------------------------------------------------------------------------------
Underidentification test (Anderson canon. corr. LM statistic):         167.557
                                                   Chi-sq(1) P-val =    0.0000
------------------------------------------------------------------------------
Weak identification test (Cragg-Donald Wald F statistic):              270.853
Stock-Yogo weak ID test critical values: 10% maximal IV size             16.38
                                         15% maximal IV size              8.96
                                         20% maximal IV size              6.66
                                         25% maximal IV size              5.53
Source: Stock-Yogo (2005).  Reproduced by permission.
------------------------------------------------------------------------------
Sargan statistic (overidentification test of all instruments):           0.000
                                                 (equation exactly identified)
------------------------------------------------------------------------------
Instrumented:         lwage
Included instruments: educ age kidslt6 kidsge6 nwifeinc
Excluded instruments: civ_lwage
------------------------------------------------------------------------------</pre>
</div><br />
Comments, suggestions, and bug reports are welcome.<br />
<br />
Reference:<br />
Dzhumashev, R., Tursunalieva, A. 2025.  A synthetic instrumental variable method: Using the dual tendency condition for coplanar instruments. <a href="https://doi.org/10.48550/arXiv.2512.17301" target="_blank">https://doi.org/10.48550/arXiv.2512.17301</a>.<br />
 ]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Manh Hoang Ba</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786254-civreg-a-stata-package-for-synthetic-coplanar-instrumental-variables-estimation</guid>
		</item>
		<item>
			<title>Performing multiple imputation regression with pre-imputated variables (IPUMS INCIMP1-5)</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786249-performing-multiple-imputation-regression-with-pre-imputated-variables-ipums-incimp1-5</link>
			<pubDate>Tue, 02 Jun 2026 17:34:00 GMT</pubDate>
			<description>Hello All, 
 
I am using the multiple imputated variables incimp1 through incimp5 for my logit regression. I am wondering if anyone is familiar with...</description>
			<content:encoded><![CDATA[Hello All,<br />
<br />
I am using the multiple imputated variables incimp1 through incimp5 for my logit regression. I am wondering if anyone is familiar with doing this in Stata and could provide feedback on my code.<br />
<br />
It successfully generated a logit regression output, but I wanted to check and make sure that this is an appropriate use of these commands since is my first time using them. The Stata documentation is mainly geared toward data you impute yourself, not ones where imputation is already provided.<br />
<br />
Some of the variables used in the logit regression are cleaned and named differently than the IPUMS website, so I am hoping you can tell by looking at the code:<br />
<br />

<div class="bbcode_container">
	<div class="bbcode_description">Code:</div>
	<pre class="bbcode_code"> /* 
The variables incimp1-5 are imputated based on incfam97on2. I renamed incfam97on2 
incimp so it matches the imputated variables and removed missing data
 */
gen incimp=incfam97on2
replace incimp=. if incfam97on2&gt;95

* Tell Stata which variables are imputated and which variable they were imputated from

mi import wide, imputed(incimp=incimp1 incimp2 incimp3 incimp4 incimp5) clear

* Check conversion - &quot;incimp&quot; is noted as being imputed
mi describe

* Run logit regression using multiple imputation
mi estimate: logit CVD i.birthcohort10 c.age c.age#c.age i.female i.incimp if insample==1</pre>
</div><br />
I had previously posted on IPUMS forum, and they directed me to Statalist: <a href="https://forum.ipums.org/t/stata-logit-regression-with-multiple-imputated-variables-incimp1-5/7083?u=lilian_brusic" target="_blank">https://forum.ipums.org/t/stata-logi...=lilian_brusic</a>]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Lily Brusic</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786249-performing-multiple-imputation-regression-with-pre-imputated-variables-ipums-incimp1-5</guid>
		</item>
		<item>
			<title>Suggestion about my model</title>
			<link>https://www.statalist.org/forums/forum/forum-help/sandbox/1786241-suggestion-about-my-model</link>
			<pubDate>Tue, 02 Jun 2026 00:19:25 GMT</pubDate>
			<description>Hi everyone, 
I am an undergraduate economics student working on this model. I am posting here not just to get answers, but genuinely to learn and...</description>
			<content:encoded><![CDATA[<span style="font-size:16px">Hi everyone,<br />
I am an undergraduate economics student working on this model. I am posting here not just to get answers, but genuinely to learn and test my own understanding. Any feedback, criticism, or suggestions are welcome.<br />
The primary objective of this model is to isolate and quantify the effect of meteorological drought on annual barley production. ΔCultivatedArea is included strictly as a control variable.<br />
The empirical model is specified as follows:<br />
<br />
<a href="filedata/fetch?filedataid=1786242">Array </a><br />
Where:<br />
n=26(due to differencing of cultivatedarea<br />
t= year</span>  <span style="font-size:16px">PRODUCTION: Annual barley production (tonnes)</span><br />
  <span style="font-size:16px">SPEI_7: 7-month SPEI index for August</span><br />
  <span style="font-size:16px">ΔCultivatedArea: First difference of barley cultivated area (hectares)</span><br />
 <br />
<span style="font-size:16px">What are the steps I should follow, in order, to properly estimate and validate this model?<br />
<br />
So far I have completed the following steps:</span><ul><li><span style="font-size:16px">ADF Unit Root Tests</span></li>
<li><span style="font-size:16px">Pearson Correlation Matrix (Multicollinearity Check)</span></li>
<li><span style="font-size:16px">OLS Estimation</span></li>
<li><span style="font-size:16px">Breusch-Godfrey Test (Autocorrelation)</span></li>
<li><span style="font-size:16px">Breusch-Pagan-Godfrey Test (Heteroskedasticity)</span></li>
<li><span style="font-size:16px">Jarque-Bera and Shapiro-Wilk Tests (Normality of Residuals)</span></li>
<li><span style="font-size:16px">Ramsey RESET Test (Functional Form)</span></li>
</ul>]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/forum-help/sandbox">Sandbox</category>
			<dc:creator>Huseyin Gormus</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/forum-help/sandbox/1786241-suggestion-about-my-model</guid>
		</item>
		<item>
			<title>OLS Time Series: Sufficient Diagnostics or Missing Steps</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786240-ols-time-series-sufficient-diagnostics-or-missing-steps</link>
			<pubDate>Mon, 01 Jun 2026 22:57:28 GMT</pubDate>
			<description>Hi everyone, I am an undergraduate economics student working on this model, I am posting here not just to get answers, but genuinely to learn and...</description>
			<content:encoded><![CDATA[<span style="font-size:16px">Hi everyone, I am an undergraduate economics student working on this model, I am posting here not just to get answers, but genuinely to learn and test my own understanding of the methodology I applied. Any feedback, criticism, or suggestions are welcome.I want to understand where I might be wrong. The primary objective of this model is to isolate and quantify the effect of meteorological drought, measured by the SPEI_7 index, on annual barley production. ΔCultivatedArea is included strictly as a control variable to prevent the drought coefficient from absorbing the effect of physical land changes, not as a variable of independent interest<br />
<br />
Here is my setup<br />
<br />
Model: Production_t = β0 + β1SPEI7_t + β2ΔCultivatedAreat + ε_t (n=26).(due to differencing)<br />
<br />
Where:</span><ul><li><span style="font-size:16px">PRODUCTION: Annual barley production (tonnes)</span></li>
<li><span style="font-size:16px">SPEI_7: 7-month SPEI index for August</span></li>
<li><span style="font-size:16px">ΔCultivatedArea: First difference of barley cultivated area </span></li>
</ul> <span style="font-size:16px"><b>Steps followed:</b></span><ul><li><span style="font-size:16px"><b>ADF unit root tests</b> (intercept for PRODUCTION and SPEI_7; intercept+trend for CultivatedArea due to visible deterministic trend)</span></li>
<li><span style="font-size:16px">First-differenced CultivatedArea to achieve stationarity</span></li>
<li><span style="font-size:16px"><b>Pearson correlation matrix to check multicollinearity</b> (r = -0.081 between SPEI_7 and ΔCultivatedArea)</span></li>
<li><span style="font-size:16px">OLS estimation</span></li>
<li><span style="font-size:16px"><b>Breusch-Godfrey test </b>for autocorrelation (lag=1)</span></li>
<li><span style="font-size:16px"><b>Breusch-Pagan-Godfrey test</b> for heteroskedasticity</span></li>
<li><span style="font-size:16px"><b>Jarque-Bera and Shapiro-Wilk tests</b> for normality of residuals</span></li>
<li><span style="font-size:16px"><b>Ramsey RESET</b> test for functional form (F p=0.8856)</span></li>
</ul> <b>Results:</b><br />
<span style="font-size:16px">SPEI_7: <b>β=874,320, p=0.0021</b> (significant at 1%)</span><br />
<span style="font-size:16px">ΔCultivatedArea: <b>β=1.983, p=0.0188</b> (significant at 5%)</span><br />
<span style="font-size:16px"><b>R²=0.453</b>, Adjusted <b>R²=0.401, F p=0.0014</b></span><br />
<span style="font-size:16px">All diagnostic tests passed (no autocorrelation, no heteroskedasticity, normality satisfied, correct functional form</span><br />
<br />
<span style="font-size:20px"><b>MY QUESTIONS:</b></span><br />
<br />
<span style="font-size:16px">Two of the diagnostic tests produced borderline results that I would like to highlight:<br />
<br />
<b>1. Breusch-Godfrey Test (Autocorrelation)</b></span><ul><li><span style="font-size:16px">Chi-Square <b>p = 0.0691</b></span></li>
<li><span style="font-size:16px">F p = <b>0.0874</b></span></li>
<li><span style="font-size:16px">Both values exceed the 0.05 threshold, so the null hypothesis of no autocorrelation cannot be rejected. However, the margin is relatively narrow. I am wondering whether this should be a concern or whether it is simply a consequence of the small sample size (n=26).</span></li>
</ul> <br />
<span style="font-size:16px"><b>2. Shapiro-Wilk Test (Normality of Residuals)</b></span><ul><li><span style="font-size:16px">p = <b>0.0532</b></span></li>
<li><span style="font-size:16px">The null hypothesis of normality cannot be rejected, but the result is marginally above the critical value. Again, I suspect this may be related to the limited number of observations.</span></li>
</ul> <br />
<span style="font-size:16px">With only n=26 observations, ADF unit root tests are known to have low power. Is there a more appropriate test for this sample, and should I run both for robustness?</span><br />
<br />
<span style="font-size:16px">While I argue that SPEI_7 is strictly exogenous, the same argument does not hold for ΔCultivatedArea, as annual planting decisions may be correlated with omitted socioeconomic variables such as input costs or government subsidies. However, since the correlation between SPEI_7 and ΔCultivatedArea is negligible (r=-0.081, p=0.73), I argue that even if the ΔCultivatedArea coefficient is biased, this does not contaminate the SPEI7 estimate. Is this reasoning valid, or should I be more concerned about the potential endogeneity of ΔCultivatedArea?</span>]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Huseyin Gormus</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786240-ols-time-series-sufficient-diagnostics-or-missing-steps</guid>
		</item>
		<item>
			<title>SPSS Setup file</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786238-spss-setup-file</link>
			<pubDate>Mon, 01 Jun 2026 16:21:36 GMT</pubDate>
			<description><![CDATA[I have downloaded several  
spss setup files.  Is there a straightforward way to open them in  
Stata?  The manual doesn't give me answer. 
 
Ric...]]></description>
			<content:encoded><![CDATA[I have downloaded several <br />
spss setup files.  Is there a straightforward way to open them in <br />
Stata?  The manual doesn't give me answer.<br />
<br />
Ric Uslaner<br />
<br />
<br />
<br />
<br />
<br />
stata?  I tried both Stat ]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Euslaner</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786238-spss-setup-file</guid>
		</item>
		<item>
			<title>DRLATE: Module for doubly robust estimation of LATE and LATT</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786236-drlate-module-for-doubly-robust-estimation-of-late-and-latt</link>
			<pubDate>Mon, 01 Jun 2026 15:51:16 GMT</pubDate>
			<description>Hello Statalisters! I would like to announce that drlate, a Stata module for doubly robust estimation of the local average treatment effect (LATE)...</description>
			<content:encoded><![CDATA[Hello Statalisters! I would like to announce that drlate, a Stata module for doubly robust estimation of the local average treatment effect (LATE) and the local average treatment effect on the treated (LATT), is now available in SSC.<br />
<br />
drlate provides estimators of the local average treatment effect (LATE) and the local average treatment effect on the treated (LATT), including inverse-probability-weighted regression adjustment (IPWRA), inverse probability weighting (IPW), augmented inverse probability weighting (AIPW), and regression adjustment (RA). Outcome and treatment models can be specified as linear, logistic, or Poisson regressions. The instrument propensity score is specified as a logistic regression (logit) and can be estimated by maximum likelihood (ML), covariate balancing propensity score (CBPS), or inverse probability tilting (IPT). The instrument must be binary.<br />
<br />
The estimators implemented by drlate are described in the paper by Słoczyński, Uysal, and Wooldridge, &quot;Doubly Robust Estimation of Local Average Treatment Effects Using Inverse Probability Weighted Regression Adjustment,&quot; available here: <a href="https://urldefense.com/v3/__https:/arxiv.org/abs/2208.01300__;!!HXCxUKc!1JsBnCb5HMQzAC8whX_75RP7Ttlfy9TCXAVKn3MNAGG9RbtcObmEy28idOMpH5BZ_jDjvOfjSaDa1AFL2d4$" target="_blank">https://arxiv.org/abs/2208.01300</a>. We expect to have a new draft of this paper in the next few months.<br />
<br />
&quot;Dr. Late&quot; can provide a cure when the treatment is confounded, you want to allow heterogeneity, and you want to take functional form seriously!<br />
<br />
Let us know if you have comments or suggestions about the package.<br />
 ]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Jeff Wooldridge</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786236-drlate-module-for-doubly-robust-estimation-of-late-and-latt</guid>
		</item>
		<item>
			<title>Unexpected result from test command used with xtgee</title>
			<link>https://www.statalist.org/forums/forum/general-stata-discussion/general/1786233-unexpected-result-from-test-command-used-with-xtgee</link>
			<pubDate>Sun, 31 May 2026 15:56:16 GMT</pubDate>
			<description>I am using xtgee to estimate risk ratios for a data set with repeated measures. I want to test an interaction between two factors, 1 with 2 levels...</description>
			<content:encoded><![CDATA[I am using xtgee to estimate risk ratios for a data set with repeated measures. I want to test an interaction between two factors, 1 with 2 levels (arm1) and the other with 3 levels (indication): i.e. 2 interaction parameters. I had expected that using the  test command would yield a chi-squared statistic (2df) equal to the difference between the chi-squared statistics for models with and without the interaction. That isn't what I get. The Wald chi(2) (5 df) for the model with the interaction = 10.21. The Wald chi(2) (3df) for the model without the interaction is 8.44 a difference of 1.77. The chi squared value produced by the the test command is 1.60. Can anyone explain the difference? Output below.<br />
<br />
<span style="font-family:courier new"> xtgee prim_outcome i.arm1##i.indication if arm1!=3, family(binomial) link(log) corr(exch) vce(robust) eform<br />
<br />
GEE population-averaged model                        Number of obs    =  6,749<br />
Group variable: participan~m                         Number of groups =  6,287<br />
Family: Binomial                                     Obs per group:  <br />
Link:   Log                                                       min =      1<br />
Correlation: exchangeable                                         avg =    1.1<br />
                                                                  max =      4<br />
                                                     Wald chi2(5)     =  10.21<br />
Scale parameter = 1                                  Prob &gt; chi2      = 0.0696<br />
<br />
                                                            (Std. err. adjusted for clustering on participantnum)<br />
-----------------------------------------------------------------------------------------------------------------<br />
                                                |             Semirobust<br />
                                   prim_outcome |     exp(b)   std. err.      z    P&gt;|z|     [95% conf. interval]<br />
------------------------------------------------+----------------------------------------------------------------<br />
                                           arm1 |<br />
                                    Dexa 4x6mg  |   .8965813   .1559133    -0.63   0.530     .6376286    1.260699<br />
                                                |<br />
                                     indication |<br />
           Preterm labor with intact membranes  |   .6763322   .1428367    -1.85   0.064     .4470871    1.023123<br />
                              Planned delivery  |   .9138782   .1248478    -0.66   0.510     .6992025    1.194466<br />
                                                |<br />
                                arm1#indication |<br />
Dexa 4x6mg#Preterm labor with intact membranes  |   1.168356   .3464885     0.52   0.600     .6533442    2.089335<br />
                   Dexa 4x6mg#Planned delivery  |   1.278983   .2505847     1.26   0.209     .8711508    1.877745<br />
                                                |<br />
                                          _cons |   .1040274   .0124361   -18.93   0.000     .0822981     .131494<br />
-----------------------------------------------------------------------------------------------------------------<br />
<br />
.     test 2.arm1#2.indication 2.arm1#3.indication<br />
<br />
 ( 1)  2.arm1#2.indication = 0<br />
 ( 2)  2.arm1#3.indication = 0<br />
<br />
           chi2(  2) =    1.60<br />
         Prob &gt; chi2 =    0.4493<br />
<br />
. xtgee prim_outcome i.arm1 i.indication if arm1!=3, family(binomial) link(log) corr(exch) vce(robust) eform<br />
<br />
GEE population-averaged model                        Number of obs    =  6,749<br />
Group variable: participan~m                         Number of groups =  6,287<br />
Family: Binomial                                     Obs per group:  <br />
Link:   Log                                                       min =      1<br />
Correlation: exchangeable                                         avg =    1.1<br />
                                                                  max =      4<br />
                                                     Wald chi2(3)     =   8.44<br />
Scale parameter = 1                                  Prob &gt; chi2      = 0.0378<br />
<br />
                                                 (Std. err. adjusted for clustering on participantnum)<br />
------------------------------------------------------------------------------------------------------<br />
                                     |             Semirobust<br />
                        prim_outcome |     exp(b)   std. err.      z    P&gt;|z|     [95% conf. interval]<br />
-------------------------------------+----------------------------------------------------------------<br />
                                arm1 |<br />
                         Dexa 4x6mg  |   1.083794   .0823057     1.06   0.289     .9339095    1.257734<br />
                                     |<br />
                          indication |<br />
Preterm labor with intact membranes  |   .7305089   .1081895    -2.12   0.034     .5464637    .9765394<br />
                   Planned delivery  |   1.035454   .1013354     0.36   0.722     .8547268    1.254394<br />
                                     |<br />
                               _cons |   .0945754    .009167   -24.33   0.000      .078212    .1143624<br />
------------------------------------------------------------------------------------------------------</span><br />
 ]]></content:encoded>
			<category domain="https://www.statalist.org/forums/forum/general-stata-discussion/general">General</category>
			<dc:creator>Simon Cousens</dc:creator>
			<guid isPermaLink="true">https://www.statalist.org/forums/forum/general-stata-discussion/general/1786233-unexpected-result-from-test-command-used-with-xtgee</guid>
		</item>
	</channel>
</rss>
