Hi everyone,
I am trying to figure out a solution to finding 'strange' patterns in my data. I have a dataset of cellphones and I am trying to drop those observations where the phone numbers seem odd. For instance, I would like to be able to identify and flag observations within a variable called 'cellphone' that look like this:
1. 5555252525
2. 5555555555
3. 5515151515
4. 0123456789
In these four cases, the numbers are either too repetitive or follow a series and thus it is highly unlikely that these are real phone numbers. All phone numbers in my dataset are 10 digits and a regular phone number could look something like this: 5559912630.
Is there an easy way to do this? Is this even possible?
Thank you so much for your help on this!
I am trying to figure out a solution to finding 'strange' patterns in my data. I have a dataset of cellphones and I am trying to drop those observations where the phone numbers seem odd. For instance, I would like to be able to identify and flag observations within a variable called 'cellphone' that look like this:
1. 5555252525
2. 5555555555
3. 5515151515
4. 0123456789
In these four cases, the numbers are either too repetitive or follow a series and thus it is highly unlikely that these are real phone numbers. All phone numbers in my dataset are 10 digits and a regular phone number could look something like this: 5559912630.
Is there an easy way to do this? Is this even possible?
Thank you so much for your help on this!
Comment