ci for categorial variables - tabci?

Marcos Almeida

Join Date: Apr 2014

Posts: 4047
#1

ci for categorial variables - tabci?

30 Sep 2019, 09:55

I'm just wondering why Stata gives CIs for binary variables - ci proportions varname - but, as far as I tried to find, we cannot have results for categorical variables.

Code:

. sysuse auto (1978 Automobile Data) . ci proportions foreign -- Binomial Exact -- Variable | Obs Proportion Std. Err. [95% Conf. Interval] -------------+--------------------------------------------------------------- foreign | 74 .2972973 .0531331 .196584 .4148353 . ci proportions rep78 no binary (0/1) variables found; nothing to compute

Am I missing something, is it technically impossible or we just don't have a user-written program to estimate these proportions with CIs in Stata?

If it can be done, and considering (and maybe I'm wrong) we don't have it so far, maybe this thread could foster somebody to create, say, a SSC (or SJ) - tabci - ado file.

Best regards,

Marcos
Tags: None

1 like

Carlo Lazzaro

Join Date: Apr 2014
Posts: 17682

30 Sep 2019, 12:04

Marcos:
I think that -rep78- takes on positive numeric integers only (ie, the Repair Record during 1978).
Hence, I would calculate the 95% CI for the mean via:

Code:

ci means rep78, poisson
. ci means rep78, poisson

                                                         -- Poisson  Exact --
    Variable |   Exposure        Mean    Std. Err.       [95% Conf. Interval]
-------------+---------------------------------------------------------------
       rep78 |         69    3.405797    .2221697        2.984238    3.870221

Kind regards,
Carlo
(Stata 19.0)

Comment

Nick Cox

Join Date: Mar 2014

Posts: 35481
#3

30 Sep 2019, 12:12

Marcos Almeida I think this comes under at least one heading, multinomial confidence intervals, and many methods have been suggested, but fewer (no???) Stata implementations.
1 like
Comment
Mike Lacy

Join Date: Apr 2014

Posts: 2405
#4

30 Sep 2019, 19:13

*If* (a big if) you satisfied with CIs for each category's proportion treated as independent of the others, you can run -mlogit- without any explanatory variables, and follow it with -margins-. This amounts to just a convenient way to get the asymptotic CIs you would get from running -ci proportions- on a binary variable for each category.

Last edited by Mike Lacy; 30 Sep 2019, 19:15.
2 likes
Comment
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17682
#5

01 Oct 2019, 01:06

Marcos:
if you did not come across it yet, there's something on the topic you're interested in at: https://www.statalist.org/forums/for...al-proportions

Kind regards,
Carlo
(Stata 19.0)
1 like
Comment
Marcos Almeida

Join Date: Apr 2014

Posts: 4047
#6

01 Oct 2019, 05:03

Nick Cox Carlo Lazzaro Mike Lacy : thank you for the remarks and suggestions.

Hopefully we'll have a Stata implementation concerning this issue.

Best regards,

Marcos
Comment
Mike Lacy

Join Date: Apr 2014

Posts: 2405
#7

01 Oct 2019, 15:24

I tried bootstrapping this problem, which gave me essentially the same results as -mlogit-. I do know there are various fancier solutions to this, per Nick's comment, which I should think would give different results than mlogit, but I'd think of the bootstrap as the gold standard. Is there anything conceptually wrong with just bootstrapping the following?

Code:

clear cap prog drop mp prog mp, rclass syntax varlist tempname f quiet tab `varlist', matcell(`f') mat `f' = `f'/r(N) forval i = 1/`=rowsof(`f')' { return scalar p`i' = `f'[`i',1] } end
Comment

Announcement

ci for categorial variables - tabci?

Comment

Comment

Comment

Comment

Comment

Comment