Simulating bivariate normal distributed data using conditional normal distribution

Adriaan Hoogendoorn

Join Date: Aug 2014

Posts: 11
#1

Simulating bivariate normal distributed data using conditional normal distribution

26 Oct 2017, 08:26

Dear all,

Although I am quite aware of Stata's powerful drawnorm function to generate bivariate normal distributed data, I tried using a property of bivariate normal distributions concerning the conditional distribution of X₂ given X₁= x₁.

If X₁ and X₂ have a bivariate normal distribution with means m₁and m₂, variances s₁² and s₂² and correlation r, then the conditional distribution of X₂ given X₁ = x₁ is itself normal distributed with mean = m₂ + r(s₂/s₁)(x₁ - m₁) and variance = (1 - r^₂)s₂² (see e.g. Bickel and Doksum, 1977, page 26). In the case that the marginal distributions are standard normal distributions (i.e. m₁ = m₂ = 0 and s₁ = s₂ = 1) this implies that the conditional distribution of X₂ given X₁ = x₁ is normal distributed with mean = rx₁and variance = 1 - r^₂.

However, when using the following code:

Code:

clear set seed 123 set obs 1000000 local rho=0.7 gen double x1 = rnormal(0,1) gen double x2 = rnormal(`rho' * x1, 1-`rho'^2) summ x* pwcorr x1 x2

I end up with X₂ having standard deviation of 0.87 (instead of 1) and X₁ and X₂having a correlation of 0.81 (instead of 0.7).
Am I missing something? Can anyone explain what is wrong here?

Kind regards, Adriaan Hoogendoorn

REFERENCE: Bickel, Peter J & Doksum, Kjell A (1977) Mathematical Statistics: Basic Ideas and Selected Topics, Holden-Day Inc., Oakland, California.
Tags: None
Clyde Schechter

Join Date: Apr 2014

Posts: 30117
#2

26 Oct 2017, 09:00

The second parameter of the -rnormal()- function is the SD, not the variance.
1 like
Comment

Adriaan Hoogendoorn

Join Date: Aug 2014
Posts: 11

26 Oct 2017, 09:09

Thank you, Clyde!

The code:

Code:

clear
set seed 123
set obs 1000000
local rho=0.7
gen double x1 = rnormal(0,1)
gen double x2 = rnormal(`rho' * x1, sqrt(1-`rho'^2))
summ x*
pwcorr x1 x2

works just fine!

Comment

Tiago Pereira

Join Date: Jan 2016

Posts: 389
#4

26 Oct 2017, 15:08

Is there a simple extension for the multivariate case? For example, I have x1 and want to generate x2, x3 and x4 with a specific correlation matrix.
Comment

German Rodriguez

Join Date: Feb 2017
Posts: 169

26 Oct 2017, 15:52

Yes, you can use the Cholesky decomposition R = CC' where C is lower-diagonal. Here is an example with three variables

Code:

 clear

. set seed 6201

. matrix define R = (1, 0.7, 0.5 \ 0.7, 1, 0.4 \ 0.5, 0.4, 1)

. matrix C = cholesky(R)

. matrix list C

C[3,3]
           c1         c2         c3
r1          1          0          0
r2         .7  .71414284          0
r3         .5    .070014  .86319062

. set obs 10000
number of observations (_N) was 0, now 10,000

. gen x1 = C[1,1] * rnormal(0,1)

. gen x2 = C[2,1] * x1 + C[2,2] * rnormal(0,1)

. gen x3 = C[3,1] * x1 + C[3,2] * x2  + C[3,3] * rnormal(0,1)

. corr x1 x2 x3
(obs=10,000)

             |       x1       x2       x3
-------------+---------------------------
          x1 |   1.0000
          x2 |   0.7009   1.0000
          x3 |   0.5320   0.4132   1.0000

Or you can do the same thing in Mata:

Code:

. mata:
------------------------------------------------- mata (type end to exit) ------
: R = (1, 0.7, 0.5 \ 0.7, 1, 0.4 \ 0.5,  0.4, 1)

: C = cholesky(R)

: rseed(3415)

: Z = rnormal(10000, 3, 0, 1)

: corr(variance(Z*C'))
[symmetric]
                 1             2             3
    +-------------------------------------------+
  1 |            1                              |
  2 |  .7025435966             1                |
  3 |  .5024883249   .3999212716             1  |
    +-------------------------------------------+

: end
--------------------------------------------------------------------------------

Added: when generating x1 the coefficient C[1,1] is 1 for correlation matrices, but I am leaving the general case for clarity.

Last edited by German Rodriguez; 26 Oct 2017, 16:22.

Comment

German Rodriguez

Join Date: Feb 2017
Posts: 169

26 Oct 2017, 17:17

A correction to the above post #5: the Stata code should be multiplying standard normals by C, just like the Mata code does. Here's a revised version

Code:

. set seed 3415

. set obs 10000
number of observations (_N) was 0, now 10,000

. matrix define R = (1, 0.7, 0.5 \ 0.7, 1, 0.4 \ 0.5, 0.4, 1)

. matrix C = cholesky(R)

. matrix list C

C[3,3]
           c1         c2         c3
r1          1          0          0
r2         .7  .71414284          0
r3         .5    .070014  .86319062

. set obs 10000
number of observations (_N) was 10,000, now 10,000

. gen z1 = rnormal(0,1)

. gen z2 = rnormal(0,1)

. gen z3 = rnormal(0,1)

. gen x1 = C[1,1] * z1

. gen x2 = C[2,1] * z1 + C[2,2] * z2

. gen x3 = C[3,1] * z1 + C[3,2] * z2  + C[3,3] * z3

. corr x1 x2 x3
(obs=10,000)

            |       x1       x2       x3
-------------+---------------------------
          x1 |   1.0000
          x2 |   0.7079   1.0000
          x3 |   0.5027   0.3982   1.0000

I was also wondering why the Stata and Mata simulations were not identical.I think Mata fills a matrix of random numbers by rows, because generating a 3 by 10000 matrix and then transposing (instead of generating a 10000 by 3 matrix) yields exactly the same result as Stata:

Code:

. mata:
------------------------------------------------- mata (type end to exit) ------
: rseed(3415)

: R = (1, 0.7, 0.5 \ 0.7, 1, 0.4 \ 0.5, 0.4, 1)

: C = cholesky(R)

: Z = rnormal(3, 10000, 0, 1)'

: corr(variance(Z*C'))
[symmetric]
                 1             2             3
    +-------------------------------------------+
  1 |            1                              |
  2 |  .7079454626             1                |
  3 |  .5026927404   .3982160631             1  |
    +-------------------------------------------+

: end
--------------------------------------------------------------------------------

I think all is well now.

Last edited by German Rodriguez; 26 Oct 2017, 17:46.

Comment

Tiago Pereira

Join Date: Jan 2016

Posts: 389
#7

26 Oct 2017, 19:44

Dear German,

Thank you so much for your tip. That will be extremely useful in my current work.

All the best,

Tiago
Comment
German Rodriguez

Join Date: Feb 2017

Posts: 169
#8

26 Oct 2017, 20:09

Glad you find it useful, Tiago. I went the Cholesky way because it is the easiest, but it is also possible to work with conditional distributions y1, y2|y1 and y3|y1,y2 (and so on). I put a note on this alternative approach on my website at http://data.princeton.edu/stata/conditionals.
Comment

Announcement