Hi everyone,
I'm currently trying to fit a generalized beta distribution for every individuals that I have in a survey in order to take interquartile range of the responses as proxy for uncertainty levels. The formulation derives from a famous paper of Engelberg and Manski (2009) that is called "Comparing the Point Predictions and Subjective Probability Distributions of Professional Forecasters", widely used in the surveys of the fed and the ecb.
I was wondering for a way to derive the distribution's percentiles directly from STATA, that is not something that I found up to now (maybe I'm wrong).
In particular, the distribution is fitted on the outcomes that individuals give to a probabilistic question where they can attach different probability (summed up to 100) from 0 to 100 to different bins of inflation expectations, in my case I have:
\[
\min_{\substack{a > 1,\, b > 1,\\ l > L,\, r > r}} \sum_{i=1}^{8} \left[ B(x_i; a, b, l, r) - F(x_i) \right]^2
\]
The distribution parameters (a,b,l,r)(a, b, l, r)(a,b,l,r) are estimated by minimizing the squared distance between the cumulative beta distribution and the respondent’s empirical distribution over the 8 bins.
if you have any idea on how to proceed, or if you know whether someone already implemented this methodology on STATA, I would be very glad.
Thanks,
best regards,
Riccardo
I'm currently trying to fit a generalized beta distribution for every individuals that I have in a survey in order to take interquartile range of the responses as proxy for uncertainty levels. The formulation derives from a famous paper of Engelberg and Manski (2009) that is called "Comparing the Point Predictions and Subjective Probability Distributions of Professional Forecasters", widely used in the surveys of the fed and the ecb.
I was wondering for a way to derive the distribution's percentiles directly from STATA, that is not something that I found up to now (maybe I'm wrong).
In particular, the distribution is fitted on the outcomes that individuals give to a probabilistic question where they can attach different probability (summed up to 100) from 0 to 100 to different bins of inflation expectations, in my case I have:
- Increase by 8% or more
- Increase by 4% to less than 8%
- Increase by 2% to less than 4%
- Increase by 0% to less than 2%
- Decrease by more than 0% to less than 2%
- Decrease by 2% to less than 4%
- Decrease by 4% to less than 8%
- Decrease by 8% or more
\[
\min_{\substack{a > 1,\, b > 1,\\ l > L,\, r > r}} \sum_{i=1}^{8} \left[ B(x_i; a, b, l, r) - F(x_i) \right]^2
\]
The distribution parameters (a,b,l,r)(a, b, l, r)(a,b,l,r) are estimated by minimizing the squared distance between the cumulative beta distribution and the respondent’s empirical distribution over the 8 bins.
- a,b>1 are shape parameters.
- l,r are location parameters, constrained by lower (L) and upper (R) bounds if the outer bins are selected (i.e., bins 1 or 8).
- F(x_i) represents the cumulative empirical distribution based on the respondent’s probability allocations.
- If neither bin 1 nor bin 8 is used → only a and b are estimated; l and r are fixed.
- If one of bin 1 or 8 is used → estimate a,b and the corresponding location parameter.
- If both outer bins are used → all four parameters (a,b,l,r) are estimated.
- Initial parameter guesses: a=b=2, l=−12, r=12
- Constraints: L=−38, R=+38
if you have any idea on how to proceed, or if you know whether someone already implemented this methodology on STATA, I would be very glad.
Thanks,
best regards,
Riccardo
Comment