Hi,
The dataset is based on workers. There are 4 variables.
year:2010-2020
nacio: nationalities
NPC_FIC: firms ID. (Repeating each year). We do not have a unique ID each year. There are around 500,000 first which repeats each year.
re_wage: Wages
There are different nationalities. I am going to create a variable (without dropping) that only shows the wages of specific nationality (PT). I need to know min/max/ mean and std.dev of this variable as well.
Any ideas apprecited.
Cheers,
Paris
The dataset is based on workers. There are 4 variables.
year:2010-2020
nacio: nationalities
NPC_FIC: firms ID. (Repeating each year). We do not have a unique ID each year. There are around 500,000 first which repeats each year.
re_wage: Wages
There are different nationalities. I am going to create a variable (without dropping) that only shows the wages of specific nationality (PT). I need to know min/max/ mean and std.dev of this variable as well.
Code:
input int year str2 nacio long NPC_FIC float re_wage 2010 "PT" 501195373 237.79385 2010 "PT" 500996349 745.1175 2010 "PT" 501112968 234.783 2010 "UK" 501087261 784.1953 2011 "PT" 501101640 578.58044 2011 "UK" 501052779 456.3653 2011 "PT" 503188955 268.7161 2012 "GW" 501165899 479.83725 2012 "BR" 503249542 720.6148 2012 "PT" 501384409 80.37975 2013 "BR" 503357509 293.8517 2013 "PT" 504103455 628.8788 2014 "PT" 501101765 198.55334 2014 "US" 501052779 440.7233 2014 "PT" 502622516 574.46655 2015 "PT" 501204126 331.4828 2015 "US" 501297955 356.9078 2016 "IR" 502910365 686.1664 2019"PT" 501081112 636.4105 2020 "CN" 503184507 629.1139 2020 "PT" 501139334 105.11755 2020 "PT" 501129929 344.4485 2020 "SP" 501192139 722.0615 2020 "PT" 501130463 726.3924
Any ideas apprecited.
Cheers,
Paris
Comment