Why doesn’t the logical operator != (not) work in if-expression?

Minna Tuominen

Join Date: Sep 2023

Posts: 5
#1

Why doesn’t the logical operator != (not) work in if-expression?

09 Sep 2023, 02:55

I am using European Social Survey data (wave 9) and working on the variable ‘wkhtot’ (with 41,630 valid observations) measuring average weekly working hours. The variable includes 419 observations with weekly working hours greater than 80.

I want to create a copy of the original variable replacing all extreme values with 80, excluding those with missing values. To build a new variable, I ran the following commands:

gen wkhtot_mod = wkhtot
replace wkhtot_mod = 80 if (wkhtot > 80 & wkhtot != . )

As a result, Stata replaced all extreme values AND the missing values with 80. Why is that?

I did realise that I can reach the intended result with the following:

replace wkhtot_mod = 80 if (wkhtot > 80 & wkhtot < . )

I would still like to understand, why didn't the != (not) operator work in my first attempt.

I would appreciate any insight on this.

Last edited by Minna Tuominen; 09 Sep 2023, 03:30.
Tags: None
Andrew Musau

Join Date: Oct 2014

Posts: 10178
#2

09 Sep 2023, 03:25

gen wkhtot_mod = wkhtot
replace wkhtot_mod = 80 if (wkhtot > 80 & wkhtot != . )

As a result, Stata replaces all extreme values AND the missing values with 80. Why is that?

Extreme values if meaning values greater than 80 are replaced because you asked for that:

Code:

... if (wkhtot > 80 ...

But you have to present a data example that shows that missing values are replaced. Unless you are referring to extended missing values (.a, .b, ..., .z) which are different from system missing values. See

Code:

help missing
Comment
Minna Tuominen

Join Date: Sep 2023

Posts: 5
#3

09 Sep 2023, 03:49

Dear Andrew,

Many thanks for the quick response. Indeed, I have both system missing (.) and extended missing values (.a, .b, .c etc.) in the data. I have not realized before that the extended missing values (.a, .b, .c etc.) would behave differently from the system missings (.). I read the help-file, but still I cannot see why they all do not obey the same commands.

I wonder, if I wanted to apply the operator != on the extended missing case, how should I then write my command?

Last edited by Minna Tuominen; 09 Sep 2023, 03:52.
Comment
Andrew Musau

Join Date: Oct 2014

Posts: 10178
#4

09 Sep 2023, 04:10

The help file is clear as far as I am concerned. Two points:

3. The ordering of missing values is

all nonmissing numbers < . < .a < .b < ... < .z

and

The relational operators (see operators) interpret missing values as large positive numbers (see above). All the following thus evaluate to true

73 < .
. == .
.a == .a
.a != .
.a < .b
.a <= .b

Here, it is explicit that extended missing values are not equal to system missing. These are distinct.

I wonder, if I wanted to apply the operator != on the extended missing case, how should I then write my command?

You will have to enumerate them.

Code:

local list foreach l in `c(alpha)'{ local list `list' & wkhtot !=.`l' } display "`list'"

Res.:

Code:

. display "`list'" & wkhtot !=.a & wkhtot !=.b & wkhtot !=.c & wkhtot !=.d & wkhtot !=.e & wkhtot !=.f & wkhtot !=.g & wkhtot !=.h & wkhtot !=.i & wkht > ot !=.j & wkhtot !=.k & wkhtot !=.l & wkhtot !=.m & wkhtot !=.n & wkhtot !=.o & wkhtot !=.p & wkhtot !=.q & wkhtot !=.r & wkhtot ! > =.s & wkhtot !=.t & wkhtot !=.u & wkhtot !=.v & wkhtot !=.w & wkhtot !=.x & wkhtot !=.y & wkhtot !=.z

So it is apparent that it is just easier to specify

& wkhtot < .

Last edited by Andrew Musau; 09 Sep 2023, 04:24.
1 like
Comment
Minna Tuominen

Join Date: Sep 2023

Posts: 5
#5

09 Sep 2023, 04:33

Thank you for the clarification. It's really the first time I run into this kind of a problem. I appreciate your patience!
Comment
Andrew Musau

Join Date: Oct 2014

Posts: 10178
#6

09 Sep 2023, 05:19

Another way to capture all missing values (system missing and extended missing) is to use the -missing()- function.

Code:

help missing()

So I often write

Code:

if !missing(var1)

instead of

Code:

if var1!=.

In your case:

Code:

gen wkhtot_mod = wkhtot replace wkhtot_mod = 80 if (wkhtot > 80 & !missing(wkhtot))
1 like
Comment

Announcement

Why doesn’t the logical operator != (not) work in if-expression?

Comment

Comment

Comment

Comment

Comment