Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • A question about matching observations when merging data

    In my master dataset observations do not have values for the "dq" variable. In my using dataset I have values for the "dq" variable. I want to give values of this "dq" variable from observations in the using dataset to those corresponding observations from the master dataset. I cannot make this automatically by using the merge command (merge n:1 B0562 B0563 B0564 using "*.dta") because corresponding observations in two datasets only match partially in at least one of the three variables "B0562", "B0563", and "B0564". For example, in the following data sample, I want the value of the variable "dq" in the 48th and 98th observations equal to the value of the variable "dq" in the 22nd observation.

    Is my question clear enough? Many thanks!

    Code:
    * Example generated by -dataex-. To install: ssc install dataex
    clear
    input str24 B0562 str33 B0563 str45 B0564 str6 dq byte _merge
    "上海市" "县"                            "县"                            "310200" 2
    "上海市" "市辖区"                      "市辖区"                      "310100" 2
    "上海市" "上海市"                      "上海市"                      "310000" 2
    "云南"    "玉溪"                         "元江"                         ""       1
    "云南"    "昆明"                         "官渡"                         ""       1
    "云南"    "昆明市"                      "五华区"                      ""       1
    "云南"    "玉溪"                         "元江"                         ""       1
    "云南"    "玉溪"                         "元江"                         ""       1
    "云南"    "昆明市"                      "五华区"                      ""       1
    "云南"    "玉溪"                         "元江"                         ""       1
    "云南"    "玉溪"                         "元江"                         ""       1
    "云南"    "昆明"                         "官渡"                         ""       1
    "云南"    "玉溪"                         "元江"                         ""       1
    "云南省" "玉溪市"                      "新平县"                      ""       1
    "云南省" "临沧市"                      "临沧市"                      "530900" 2
    "云南省" "红河州"                      "个旧市"                      ""       1
    "云南省" "楚雄州"                      "楚雄市"                      ""       1
    "云南省" "昆明市"                      "禄劝县"                      ""       1
    "云南省" "红河州"                      "开远市"                      ""       1
    "云南省" "红河州"                      "石屏县"                      ""       1
    "云南省" "思茅市"                      "澜沧拉祜族自治县"       "530828" 2
    "云南省" "思茅市"                      "江城县"                      ""       1
    "云南省" "红河州"                      "建水县"                      ""       1
    "云南省" "红河哈尼族彝族自治州" "个旧市"                      "532501" 2
    "云南省" "玉溪市"                      "峨山县"                      ""       1
    "云南省" "红河州"                      "开远市"                      ""       1
    "云南省" "楚雄州"                      "楚雄市"                      ""       1
    "云南省" "西双版纳州"                "景洪市"                      ""       1
    "云南省" "红河州"                      "河口县"                      ""       1
    "云南省" "昆明市"                      "市辖区"                      "530101" 2
    "云南省" "德宏州"                      "潞西市"                      ""       1
    "云南省" "德宏州"                      "盈江县"                      ""       1
    "云南省" "文山州"                      "砚山县"                      ""       1
    "云南省" "红河州"                      "弥勒县"                      ""       1
    "云南省" "红河州"                      "弥勒县"                      ""       1
    "云南省" "文山壮族苗族自治州"    "文山壮族苗族自治州"    "532600" 2
    "云南省" "玉溪市"                      "元江县"                      ""       1
    "云南省" "昆明市"                      "禄劝县"                      ""       1
    "云南省" "楚雄州"                      "牟定县"                      ""       1
    "云南省" "德宏傣族景颇族自治州" "德宏傣族景颇族自治州" "533100" 2
    "云南省" "文山州"                      "马关县"                      ""       1
    "云南省" "红河州"                      "建水县"                      ""       1
    "云南省" "迪庆州"                      "香格里拉县"                ""       1
    "云南省" "楚雄州"                      "禄丰县"                      ""       1
    "云南省" "西双版纳州"                "景洪市"                      ""       1
    "云南省" "西双版纳州"                "景洪市"                      ""       1
    "云南省" "红河州"                      "弥勒县"                      ""       1
    "云南省" "思茅市"                      "澜沧县"                      ""       1
    "云南省" "红河州"                      "个旧市"                      ""       1
    "云南省" "红河州"                      "个旧市"                      ""       1
    "云南省" "文山州"                      "砚山县"                      ""       1
    "云南省" "保山市"                      "保山市"                      "530500" 2
    "云南省" "玉溪市"                      "新平县"                      ""       1
    "云南省" "德宏州"                      "盈江县"                      ""       1
    "云南省" "红河州"                      "河口县"                      ""       1
    "云南省" "文山州"                      "麻栗坡县"                   ""       1
    "云南省" "昆明市"                      "寻甸县"                      ""       1
    "云南省" "楚雄州"                      "楚雄市"                      ""       1
    "云南省" "红河州"                      "开远市"                      ""       1
    "云南省" "文山州"                      "麻栗坡县"                   ""       1
    "云南省" "临沧市"                      "双江县"                      ""       1
    "云南省" "思茅市"                      "西盟县"                      ""       1
    "云南省" "思茅市"                      "景谷县"                      ""       1
    "云南省" "思茅市"                      "西盟县"                      ""       1
    "云南省" "红河州"                      "建水县"                      ""       1
    "云南省" "德宏州"                      "潞西市"                      ""       1
    "云南省" "临沧市"                      "耿马县"                      ""       1
    "云南省" "怒江州"                      "泸水县"                      ""       1
    "云南省" "楚雄州"                      "武定县"                      ""       1
    "云南省" "西双版纳州"                "勐腊县"                      ""       1
    "云南省" "怒江傈僳族自治州"       "怒江傈僳族自治州"       "533300" 2
    "云南省" "临沧市"                      "双江县"                      ""       1
    "云南省" "思茅市"                      "墨江县"                      ""       1
    "云南省" "文山州"                      "砚山县"                      ""       1
    "云南省" "红河州"                      "泸西县"                      ""       1
    "云南省" "昆明市"                      "寻甸回族彝族自治县"    "530129" 2
    "云南省" "红河州"                      "蒙自县"                      ""       1
    "云南省" "德宏州"                      "梁河县"                      ""       1
    "云南省" "昆明市"                      "寻甸县"                      ""       1
    "云南省" "临沧市"                      "耿马县"                      ""       1
    "云南省" "思茅市"                      "景东县"                      ""       1
    "云南省" "楚雄州"                      "双柏县"                      ""       1
    "云南省" "文山州"                      "麻栗坡县"                   ""       1
    "云南省" "楚雄州"                      "禄丰县"                      ""       1
    "云南省" "大理白族自治州"          "巍山县"                      ""       1
    "云南省" "红河州"                      "蒙自县"                      ""       1
    "云南省" "红河州"                      "蒙自县"                      ""       1
    "云南省" "玉溪市"                      "新平县"                      ""       1
    "云南省" "文山州"                      "砚山县"                      ""       1
    "云南省" "昆明市"                      "寻甸县"                      ""       1
    "云南省" "楚雄州"                      "楚雄市"                      ""       1
    "云南省" "楚雄彝族自治州"          "楚雄市"                      "532301" 2
    "云南省" "红河州"                      "金平县"                      ""       1
    "云南省" "文山州"                      "砚山县"                      ""       1
    "云南省" "红河州"                      "弥勒县"                      ""       1
    "云南省" "文山州"                      "马关县"                      ""       1
    "云南省" "德宏州"                      "潞西市"                      ""       1
    "云南省" "思茅市"                      "澜沧县"                      ""       1
    "云南省" "楚雄州"                      "姚安县"                      ""       1
    "云南省" "楚雄彝族自治州"          "大姚县"                      "532326" 2
    end
    label values _merge _merge
    label def _merge 1 "master only (1)", modify
    label def _merge 2 "using only (2)", modify

  • #2
    So it seems that your command, which requires identical matching on B0562, B0563, and B0564 is too strict. But it isn't clear to me what more lenient criterion you want. The example you give is unhelpful in several ways:

    1. You refer to wanting to apply the value of dq in observation 22 to observations 48 and 98, but dq is missing in observation 22.
    Code:
    . list if inlist(_n, 22, 48, 98)
    
         +-------------------------------------------------+
         |  B0562    B0563    B0564   dq            _merge |
         |-------------------------------------------------|
     22. | 云南省   思茅市   江城县        master only (1) |
     48. | 云南省   思茅市   澜沧县        master only (1) |
     98. | 云南省   思茅市   澜沧县        master only (1) |
         +-------------------------------------------------+
    2. You don't say what pattern(s) of agreement among B0562, B0563, and B0564 between the two data sets should be allowed to result in dq being shared. This is absolutely critical to resolving your problem: without a clear statement of what you do want nobody can help you.

    3. You show the results of your -merge- operation, but you don't show the two example data sets separately. I suppose I could reconstruct them based on the value of the _merge variable, but it would be better if you provided them directly.

    Comment


    • #3
      Thank you very much and sorry for being short of clarity! In my master dataset, B0562, B0563, B0564 are Chinese names respectively for province, prefecture, and county. In my using dataset, there is a fourth variable "dq" with its first two digits denoting the code for province, the middle two digits denoting prefecture, and the last two digits denoting county. I want to create three variables in the master dataset: one (corresponding to B0562 the variable with Chinese province name) taking values of the first two digits of the "dq" variable from the using dataset followed by four zeros, one (corresponding to B0563 the variable with Chinese prefecture name) taking values of the first four digits of the "dq" variable followed by two zeros, one (corresponding to B0564 the variable with Chinese county name) taking values of the "dq" variable. The difficult part is that the Chinese names in the master dataset is not standard and cannot be automatically matched with those in the using dataset where the Chinese names are standard.

      Data sample from the master dataset as follows:
      Code:
      * Example generated by -dataex-. To install: ssc install dataex
      clear
      input str24 B0562 str33 B0563 str45 B0564
      "天津"       ""             "宁河县"            
      "天津"       ""             "武清区"            
      "河北省"    "邢台市"    "临西县"            
      "河北省"    "保定市"    "阜平县"            
      "河南省"    "平顶山市" "新华区"            
      "河北省"    "沧州市"    "南皮县"            
      "河北省"    "衡水市"    "深州市"            
      "河北省"    "衡水市"    "故城县"            
      "辽宁省"    "鞍山市"    "台安县"            
      "辽宁省"    "鞍山市"    "岫岩县"            
      "辽宁省"    "抚顺市"    "清原满族自治县"
      "辽宁省"    "丹东市"    "振兴区"            
      "辽宁省"    "营口市"    "老边区"            
      "辽宁省"    "营口市"    "盖州市"            
      "辽宁省"    "辽阳市"    "太子河区"         
      "辽宁省"    "辽阳市"    "辽阳县"            
      "辽宁省"    "铁岭市"    "西丰县"            
      "辽宁省"    "盘锦市"    "大洼县"            
      "河南省"    "南阳市"    "社旗县"            
      "辽宁省"    "沈阳市"    "苏家屯区"         
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "铁西区"            
      "辽宁省"    "沈阳市"    "新城子区"         
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "大东区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "铁西区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "新城子区"         
      "辽宁省"    "沈阳市"    "新城子区"         
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "辽中县"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "铁西区"            
      "河南省"    "平顶山市" "新华区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "大东区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "新城子区"         
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "和平区"            
      "辽宁省"    "沈阳市"    "皇姑区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "和平区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "苏家屯区"         
      "辽宁省"    "沈阳市"    "于洪区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "辽宁省"    "沈阳市"    "东陵区"            
      "河北省"    "衡水市"    "深州市"            
      "黑龙江省" "大庆市"    "肇源县"            
      "黑龙江省" "大庆市"    "肇州县"            
      "黑龙江省" "牡丹江市" "海林市"            
      "黑龙江省" "哈尔滨市" "五常市"            
      "黑龙江省" "哈尔滨市" "延寿县"            
      "黑龙江省" "黑河市"    "逊克县"            
      "黑龙江省" "哈尔滨市" "双城市"            
      "黑龙江省" "绥化市"    "明水县"            
      "黑龙江省" "鸡西市"    "密山市"            
      "河南省"    "商丘市"    "梁园区"            
      "河南省"    "洛阳市"    "廛河回族区"      
      "黑龙江省" "哈尔滨市" "南岗区"            
      "浙江省"    "温州市"    "平阳县"            
      "浙江省"    "衢州市"    "江山市"            
      "安徽省"    "安庆市"    "宿松县"            
      "安徽省"    "六安市"    "舒城县"            
      "安徽省"    "宣城市"    "郎溪县"            
      "安徽省"    "宣城市"    "广德县"            
      "福建"       "漳州市"    "南靖县"            
      "山东省"    "烟台市"    "莱阳市"            
      "辽宁省"    "沈阳市"    "于洪区"            
      "河南省"    "郑州市"    "荥阳市"            
      "河南省"    "郑州市"    "新密市"            
      "河南省"    "郑州市"    "新密市"            
      "河南省"    "新乡市"    "原阳县"            
      "河南省"    "新乡市"    "长垣县"            
      "河南省"    "焦作市"    "博爱县"            
      "河南省"    "焦作市"    "武陟县"            
      "河南省"    "漯河市"    "临颖县"            
      "河南省"    "商丘市"    "民权县"            
      "河南省"    "驻马店市" "泌阳县"            
      "河南省"    "驻马店市" "泌阳县"            
      "河南省"    "信阳市"    "罗山县"            
      end
      Data sample from the using dataset as follows:
      Code:
      * Example generated by -dataex-. To install: ssc install dataex
      clear
      input str6 dq str24 B0562 str33 B0563 str45 B0564
      "310000" "上海市" "上海市"                      "上海市"                                    
      "310200" "上海市" "县"                            "县"                                          
      "310230" "上海市" "县"                            "崇明县"                                    
      "310119" "上海市" "市辖区"                      "南汇区"                                    
      "310103" "上海市" "市辖区"                      "卢湾区"                                    
      "310114" "上海市" "市辖区"                      "嘉定区"                                    
      "310120" "上海市" "市辖区"                      "奉贤区"                                    
      "310113" "上海市" "市辖区"                      "宝山区"                                    
      "310100" "上海市" "市辖区"                      "市辖区"                                    
      "310104" "上海市" "市辖区"                      "徐汇区"                                    
      "310107" "上海市" "市辖区"                      "普陀区"                                    
      "310110" "上海市" "市辖区"                      "杨浦区"                                    
      "310117" "上海市" "市辖区"                      "松江区"                                    
      "310115" "上海市" "市辖区"                      "浦东新区"                                 
      "310109" "上海市" "市辖区"                      "虹口区"                                    
      "310116" "上海市" "市辖区"                      "金山区"                                    
      "310105" "上海市" "市辖区"                      "长宁区"                                    
      "310112" "上海市" "市辖区"                      "闵行区"                                    
      "310108" "上海市" "市辖区"                      "闸北区"                                    
      "310118" "上海市" "市辖区"                      "青浦区"                                    
      "310106" "上海市" "市辖区"                      "静安区"                                    
      "310101" "上海市" "市辖区"                      "黄浦区"                                    
      "530900" "云南省" "临沧市"                      "临沧市"                                    
      "530902" "云南省" "临沧市"                      "临翔区"                                    
      "530922" "云南省" "临沧市"                      "云县"                                       
      "530921" "云南省" "临沧市"                      "凤庆县"                                    
      "530925" "云南省" "临沧市"                      "双江拉祜族佤族布朗族傣族自治县"
      "530901" "云南省" "临沧市"                      "市辖区"                                    
      "530923" "云南省" "临沧市"                      "永德县"                                    
      "530927" "云南省" "临沧市"                      "沧源佤族自治县"                        
      "530926" "云南省" "临沧市"                      "耿马傣族佤族自治县"                  
      "530924" "云南省" "临沧市"                      "镇康县"                                    
      "530700" "云南省" "丽江市"                      "丽江市"                                    
      "530723" "云南省" "丽江市"                      "华坪县"                                    
      "530702" "云南省" "丽江市"                      "古城区"                                    
      "530724" "云南省" "丽江市"                      "宁蒗彝族自治县"                        
      "530701" "云南省" "丽江市"                      "市辖区"                                    
      "530722" "云南省" "丽江市"                      "永胜县"                                    
      "530721" "云南省" "丽江市"                      "玉龙纳西族自治县"                     
      "530000" "云南省" "云南省"                      "云南省"                                    
      "530500" "云南省" "保山市"                      "保山市"                                    
      "530501" "云南省" "保山市"                      "市辖区"                                    
      "530521" "云南省" "保山市"                      "施甸县"                                    
      "530524" "云南省" "保山市"                      "昌宁县"                                    
      "530522" "云南省" "保山市"                      "腾冲县"                                    
      "530502" "云南省" "保山市"                      "隆阳区"                                    
      "530523" "云南省" "保山市"                      "龙陵县"                                    
      "532929" "云南省" "大理白族自治州"          "云龙县"                                    
      "532931" "云南省" "大理白族自治州"          "剑川县"                                    
      "532926" "云南省" "大理白族自治州"          "南涧彝族自治县"                        
      "532901" "云南省" "大理白族自治州"          "大理市"                                    
      "532900" "云南省" "大理白族自治州"          "大理白族自治州"                        
      "532924" "云南省" "大理白族自治州"          "宾川县"                                    
      "532927" "云南省" "大理白族自治州"          "巍山彝族回族自治县"                  
      "532925" "云南省" "大理白族自治州"          "弥渡县"                                    
      "532928" "云南省" "大理白族自治州"          "永平县"                                    
      "532930" "云南省" "大理白族自治州"          "洱源县"                                    
      "532922" "云南省" "大理白族自治州"          "漾濞彝族自治县"                        
      "532923" "云南省" "大理白族自治州"          "祥云县"                                    
      "532932" "云南省" "大理白族自治州"          "鹤庆县"                                    
      "533100" "云南省" "德宏傣族景颇族自治州" "德宏傣族景颇族自治州"               
      "533122" "云南省" "德宏傣族景颇族自治州" "梁河县"                                    
      "533103" "云南省" "德宏傣族景颇族自治州" "潞西市"                                    
      "533102" "云南省" "德宏傣族景颇族自治州" "瑞丽市"                                    
      "533123" "云南省" "德宏傣族景颇族自治州" "盈江县"                                    
      "533124" "云南省" "德宏傣族景颇族自治州" "陇川县"                                    
      "533325" "云南省" "怒江傈僳族自治州"       "兰坪白族普米族自治县"               
      "533300" "云南省" "怒江傈僳族自治州"       "怒江傈僳族自治州"                     
      "533321" "云南省" "怒江傈僳族自治州"       "泸水县"                                    
      "533323" "云南省" "怒江傈僳族自治州"       "福贡县"                                    
      "533324" "云南省" "怒江傈僳族自治州"       "贡山独龙族怒族自治县"               
      "530822" "云南省" "思茅市"                      "墨江哈尼族自治县"                     
      "530827" "云南省" "思茅市"                      "孟连傣族拉祜族佤族自治县"         
      "530801" "云南省" "思茅市"                      "市辖区"                                    
      "530800" "云南省" "思茅市"                      "思茅市"                                    
      "530821" "云南省" "思茅市"                      "普洱哈尼族彝族自治县"               
      "530823" "云南省" "思茅市"                      "景东彝族自治县"                        
      "530824" "云南省" "思茅市"                      "景谷傣族彝族自治县"                  
      "530826" "云南省" "思茅市"                      "江城哈尼族彝族自治县"               
      "530828" "云南省" "思茅市"                      "澜沧拉祜族自治县"                     
      "530802" "云南省" "思茅市"                      "翠云区"                                    
      "530829" "云南省" "思茅市"                      "西盟佤族自治县"                        
      "530825" "云南省" "思茅市"                      "镇沅彝族哈尼族拉祜族自治县"      
      "532626" "云南省" "文山壮族苗族自治州"    "丘北县"                                    
      "532628" "云南省" "文山壮族苗族自治州"    "富宁县"                                    
      "532627" "云南省" "文山壮族苗族自治州"    "广南县"                                    
      "532621" "云南省" "文山壮族苗族自治州"    "文山县"                                    
      "532600" "云南省" "文山壮族苗族自治州"    "文山壮族苗族自治州"                  
      "532622" "云南省" "文山壮族苗族自治州"    "砚山县"                                    
      "532623" "云南省" "文山壮族苗族自治州"    "西畴县"                                    
      "532625" "云南省" "文山壮族苗族自治州"    "马关县"                                    
      "532624" "云南省" "文山壮族苗族自治州"    "麻栗坡县"                                 
      "530113" "云南省" "昆明市"                      "东川区"                                    
      "530102" "云南省" "昆明市"                      "五华区"                                    
      "530121" "云南省" "昆明市"                      "呈贡县"                                    
      "530181" "云南省" "昆明市"                      "安宁市"                                    
      "530111" "云南省" "昆明市"                      "官渡区"                                    
      "530125" "云南省" "昆明市"                      "宜良县"                                    
      "530124" "云南省" "昆明市"                      "富民县"                                    
      "530129" "云南省" "昆明市"                      "寻甸回族彝族自治县"                  
      end

      Comment


      • #4
        More data samples from the master dataset:
        Code:
        * Example generated by -dataex-. To install: ssc install dataex
        clear
        input str24 B0562 str33 B0563 str45 B0564
        "上海市" "市辖区" "青浦区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "普陀区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "南汇区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "浦东新区"
        "上海市" "市辖区" "浦东新区"
        "上海市" "市辖区" "浦东新区"
        "上海市" "市辖区" "浦东新区"
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "奉贤区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "虹口区"   
        "上海市" "市辖区" "青浦区"   
        "上海市" "市辖区" "徐汇区"   
        "上海市" "市辖区" "普陀区"   
        "上海市" "市辖区" "宝山区"   
        "上海市" "市辖区" "浦东新区"
        "上海市" "市辖区" "长宁区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "宝山区"   
        "上海市" "市辖区" "宝山区"   
        "上海市" "市辖区" "宝山区"   
        "上海市" "市辖区" "宝山区"   
        "上海市" "市辖区" "宝山区"   
        "上海市" "市辖区" "宝山区"   
        "上海市" "市辖区" "宝山区"   
        "上海市" "市辖区" "浦东新区"
        "上海市" "市辖区" "宝山区"   
        "上海市" "市辖区" "浦东新区"
        "上海市" "市辖区" "青浦区"   
        "上海市" "市辖区" "金山区"   
        "上海市" "市辖区" "普陀区"   
        "上海市" "市辖区" "青浦区"   
        "上海市" "市辖区" "徐汇区"   
        "上海市" "市辖区" "闵行区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "浦东新区"
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "松江区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "南汇区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        "上海市" "市辖区" "杨浦区"   
        end

        Code:
        * Example generated by -dataex-. To install: ssc install dataex
        clear
        input str24 B0562 str33 B0563 str45 B0564
        "云南省" "曲靖市" "宣威市"
        "云南省" "曲靖市" "宣威市"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "呈贡县"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "晋宁县"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "西山区"
        "云南"    "昆明"    "官渡"   
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "晋宁县"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "官渡区"
        "云南省" "红河州" "蒙自县"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "楚雄州" "楚雄市"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "盘龙区"
        "云南省" "昆明市" "五华区"
        "云南"    "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "盘龙区"
        "云南"    "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "西山区"
        "云南省" "昆明市" "官渡区"
        "云南省" "昆明市" "五华区"
        "云南省" "昆明市" "五华区"
        end

        Comment


        • #5
          Code:
          * Example generated by -dataex-. To install: ssc install dataex
          clear
          input str24 B0562 str33 B0563 str45 B0564
          "天津"       ""             "宁河县"            
          "天津"       ""             "武清区"            
          "河北省"    "邢台市"    "临西县"            
          "河北省"    "保定市"    "阜平县"            
          "河南省"    "平顶山市" "新华区"            
          "河北省"    "沧州市"    "南皮县"            
          "河北省"    "衡水市"    "深州市"            
          "河北省"    "衡水市"    "故城县"            
          "辽宁省"    "鞍山市"    "台安县"            
          "辽宁省"    "鞍山市"    "岫岩县"            
          "辽宁省"    "抚顺市"    "清原满族自治县"
          "辽宁省"    "丹东市"    "振兴区"            
          "辽宁省"    "营口市"    "老边区"            
          "辽宁省"    "营口市"    "盖州市"            
          "辽宁省"    "辽阳市"    "太子河区"         
          "辽宁省"    "辽阳市"    "辽阳县"            
          "辽宁省"    "铁岭市"    "西丰县"            
          "辽宁省"    "盘锦市"    "大洼县"            
          "河南省"    "南阳市"    "社旗县"            
          "辽宁省"    "沈阳市"    "苏家屯区"         
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "铁西区"            
          "辽宁省"    "沈阳市"    "新城子区"         
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "大东区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "铁西区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "新城子区"         
          "辽宁省"    "沈阳市"    "新城子区"         
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "辽中县"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "铁西区"            
          "河南省"    "平顶山市" "新华区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "大东区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "新城子区"         
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "和平区"            
          "辽宁省"    "沈阳市"    "皇姑区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "和平区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "苏家屯区"         
          "辽宁省"    "沈阳市"    "于洪区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "辽宁省"    "沈阳市"    "东陵区"            
          "河北省"    "衡水市"    "深州市"            
          "黑龙江省" "大庆市"    "肇源县"            
          "黑龙江省" "大庆市"    "肇州县"            
          "黑龙江省" "牡丹江市" "海林市"            
          "黑龙江省" "哈尔滨市" "五常市"            
          "黑龙江省" "哈尔滨市" "延寿县"            
          "黑龙江省" "黑河市"    "逊克县"            
          "黑龙江省" "哈尔滨市" "双城市"            
          "黑龙江省" "绥化市"    "明水县"            
          "黑龙江省" "鸡西市"    "密山市"            
          "河南省"    "商丘市"    "梁园区"            
          "河南省"    "洛阳市"    "廛河回族区"      
          "黑龙江省" "哈尔滨市" "南岗区"            
          "浙江省"    "温州市"    "平阳县"            
          "浙江省"    "衢州市"    "江山市"            
          "安徽省"    "安庆市"    "宿松县"            
          "安徽省"    "六安市"    "舒城县"            
          "安徽省"    "宣城市"    "郎溪县"            
          "安徽省"    "宣城市"    "广德县"            
          "福建"       "漳州市"    "南靖县"            
          "山东省"    "烟台市"    "莱阳市"            
          "辽宁省"    "沈阳市"    "于洪区"            
          "河南省"    "郑州市"    "荥阳市"            
          "河南省"    "郑州市"    "新密市"            
          "河南省"    "郑州市"    "新密市"            
          "河南省"    "新乡市"    "原阳县"            
          "河南省"    "新乡市"    "长垣县"            
          "河南省"    "焦作市"    "博爱县"            
          "河南省"    "焦作市"    "武陟县"            
          "河南省"    "漯河市"    "临颖县"            
          "河南省"    "商丘市"    "民权县"            
          "河南省"    "驻马店市" "泌阳县"            
          "河南省"    "驻马店市" "泌阳县"            
          "河南省"    "信阳市"    "罗山县"            
          end
          tempfile master
          save `master'
          
          * Example generated by -dataex-. To install: ssc install dataex
          clear
          input str6 dq str24 B0562 str33 B0563 str45 B0564
          "310000" "上海市" "上海市"                      "上海市"                                    
          "310200" "上海市" "县"                            "县"                                          
          "310230" "上海市" "县"                            "崇明县"                                    
          "310119" "上海市" "市辖区"                      "南汇区"                                    
          "310103" "上海市" "市辖区"                      "卢湾区"                                    
          "310114" "上海市" "市辖区"                      "嘉定区"                                    
          "310120" "上海市" "市辖区"                      "奉贤区"                                    
          "310113" "上海市" "市辖区"                      "宝山区"                                    
          "310100" "上海市" "市辖区"                      "市辖区"                                    
          "310104" "上海市" "市辖区"                      "徐汇区"                                    
          "310107" "上海市" "市辖区"                      "普陀区"                                    
          "310110" "上海市" "市辖区"                      "杨浦区"                                    
          "310117" "上海市" "市辖区"                      "松江区"                                    
          "310115" "上海市" "市辖区"                      "浦东新区"                                 
          "310109" "上海市" "市辖区"                      "虹口区"                                    
          "310116" "上海市" "市辖区"                      "金山区"                                    
          "310105" "上海市" "市辖区"                      "长宁区"                                    
          "310112" "上海市" "市辖区"                      "闵行区"                                    
          "310108" "上海市" "市辖区"                      "闸北区"                                    
          "310118" "上海市" "市辖区"                      "青浦区"                                    
          "310106" "上海市" "市辖区"                      "静安区"                                    
          "310101" "上海市" "市辖区"                      "黄浦区"                                    
          "530900" "云南省" "临沧市"                      "临沧市"                                    
          "530902" "云南省" "临沧市"                      "临翔区"                                    
          "530922" "云南省" "临沧市"                      "云县"                                       
          "530921" "云南省" "临沧市"                      "凤庆县"                                    
          "530925" "云南省" "临沧市"                      "双江拉祜族佤族布朗族傣族自治县"
          "530901" "云南省" "临沧市"                      "市辖区"                                    
          "530923" "云南省" "临沧市"                      "永德县"                                    
          "530927" "云南省" "临沧市"                      "沧源佤族自治县"                        
          "530926" "云南省" "临沧市"                      "耿马傣族佤族自治县"                  
          "530924" "云南省" "临沧市"                      "镇康县"                                    
          "530700" "云南省" "丽江市"                      "丽江市"                                    
          "530723" "云南省" "丽江市"                      "华坪县"                                    
          "530702" "云南省" "丽江市"                      "古城区"                                    
          "530724" "云南省" "丽江市"                      "宁蒗彝族自治县"                        
          "530701" "云南省" "丽江市"                      "市辖区"                                    
          "530722" "云南省" "丽江市"                      "永胜县"                                    
          "530721" "云南省" "丽江市"                      "玉龙纳西族自治县"                     
          "530000" "云南省" "云南省"                      "云南省"                                    
          "530500" "云南省" "保山市"                      "保山市"                                    
          "530501" "云南省" "保山市"                      "市辖区"                                    
          "530521" "云南省" "保山市"                      "施甸县"                                    
          "530524" "云南省" "保山市"                      "昌宁县"                                    
          "530522" "云南省" "保山市"                      "腾冲县"                                    
          "530502" "云南省" "保山市"                      "隆阳区"                                    
          "530523" "云南省" "保山市"                      "龙陵县"                                    
          "532929" "云南省" "大理白族自治州"          "云龙县"                                    
          "532931" "云南省" "大理白族自治州"          "剑川县"                                    
          "532926" "云南省" "大理白族自治州"          "南涧彝族自治县"                        
          "532901" "云南省" "大理白族自治州"          "大理市"                                    
          "532900" "云南省" "大理白族自治州"          "大理白族自治州"                        
          "532924" "云南省" "大理白族自治州"          "宾川县"                                    
          "532927" "云南省" "大理白族自治州"          "巍山彝族回族自治县"                  
          "532925" "云南省" "大理白族自治州"          "弥渡县"                                    
          "532928" "云南省" "大理白族自治州"          "永平县"                                    
          "532930" "云南省" "大理白族自治州"          "洱源县"                                    
          "532922" "云南省" "大理白族自治州"          "漾濞彝族自治县"                        
          "532923" "云南省" "大理白族自治州"          "祥云县"                                    
          "532932" "云南省" "大理白族自治州"          "鹤庆县"                                    
          "533100" "云南省" "德宏傣族景颇族自治州" "德宏傣族景颇族自治州"               
          "533122" "云南省" "德宏傣族景颇族自治州" "梁河县"                                    
          "533103" "云南省" "德宏傣族景颇族自治州" "潞西市"                                    
          "533102" "云南省" "德宏傣族景颇族自治州" "瑞丽市"                                    
          "533123" "云南省" "德宏傣族景颇族自治州" "盈江县"                                    
          "533124" "云南省" "德宏傣族景颇族自治州" "陇川县"                                    
          "533325" "云南省" "怒江傈僳族自治州"       "兰坪白族普米族自治县"               
          "533300" "云南省" "怒江傈僳族自治州"       "怒江傈僳族自治州"                     
          "533321" "云南省" "怒江傈僳族自治州"       "泸水县"                                    
          "533323" "云南省" "怒江傈僳族自治州"       "福贡县"                                    
          "533324" "云南省" "怒江傈僳族自治州"       "贡山独龙族怒族自治县"               
          "530822" "云南省" "思茅市"                      "墨江哈尼族自治县"                     
          "530827" "云南省" "思茅市"                      "孟连傣族拉祜族佤族自治县"         
          "530801" "云南省" "思茅市"                      "市辖区"                                    
          "530800" "云南省" "思茅市"                      "思茅市"                                    
          "530821" "云南省" "思茅市"                      "普洱哈尼族彝族自治县"               
          "530823" "云南省" "思茅市"                      "景东彝族自治县"                        
          "530824" "云南省" "思茅市"                      "景谷傣族彝族自治县"                  
          "530826" "云南省" "思茅市"                      "江城哈尼族彝族自治县"               
          "530828" "云南省" "思茅市"                      "澜沧拉祜族自治县"                     
          "530802" "云南省" "思茅市"                      "翠云区"                                    
          "530829" "云南省" "思茅市"                      "西盟佤族自治县"                        
          "530825" "云南省" "思茅市"                      "镇沅彝族哈尼族拉祜族自治县"      
          "532626" "云南省" "文山壮族苗族自治州"    "丘北县"                                    
          "532628" "云南省" "文山壮族苗族自治州"    "富宁县"                                    
          "532627" "云南省" "文山壮族苗族自治州"    "广南县"                                    
          "532621" "云南省" "文山壮族苗族自治州"    "文山县"                                    
          "532600" "云南省" "文山壮族苗族自治州"    "文山壮族苗族自治州"                  
          "532622" "云南省" "文山壮族苗族自治州"    "砚山县"                                    
          "532623" "云南省" "文山壮族苗族自治州"    "西畴县"                                    
          "532625" "云南省" "文山壮族苗族自治州"    "马关县"                                    
          "532624" "云南省" "文山壮族苗族自治州"    "麻栗坡县"                                 
          "530113" "云南省" "昆明市"                      "东川区"                                    
          "530102" "云南省" "昆明市"                      "五华区"                                    
          "530121" "云南省" "昆明市"                      "呈贡县"                                    
          "530181" "云南省" "昆明市"                      "安宁市"                                    
          "530111" "云南省" "昆明市"                      "官渡区"                                    
          "530125" "云南省" "昆明市"                      "宜良县"                                    
          "530124" "云南省" "昆明市"                      "富民县"                                    
          "530129" "云南省" "昆明市"                      "寻甸回族彝族自治县"                  
          end
          tempfile dq_data_set
          save `dq_data_set'
          
          * Example generated by -dataex-. To install: ssc install dataex
          clear
          input str24 B0562 str33 B0563 str45 B0564
          "上海市" "市辖区" "青浦区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "普陀区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "南汇区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "浦东新区"
          "上海市" "市辖区" "浦东新区"
          "上海市" "市辖区" "浦东新区"
          "上海市" "市辖区" "浦东新区"
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "奉贤区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "虹口区"   
          "上海市" "市辖区" "青浦区"   
          "上海市" "市辖区" "徐汇区"   
          "上海市" "市辖区" "普陀区"   
          "上海市" "市辖区" "宝山区"   
          "上海市" "市辖区" "浦东新区"
          "上海市" "市辖区" "长宁区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "宝山区"   
          "上海市" "市辖区" "宝山区"   
          "上海市" "市辖区" "宝山区"   
          "上海市" "市辖区" "宝山区"   
          "上海市" "市辖区" "宝山区"   
          "上海市" "市辖区" "宝山区"   
          "上海市" "市辖区" "宝山区"   
          "上海市" "市辖区" "浦东新区"
          "上海市" "市辖区" "宝山区"   
          "上海市" "市辖区" "浦东新区"
          "上海市" "市辖区" "青浦区"   
          "上海市" "市辖区" "金山区"   
          "上海市" "市辖区" "普陀区"   
          "上海市" "市辖区" "青浦区"   
          "上海市" "市辖区" "徐汇区"   
          "上海市" "市辖区" "闵行区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "浦东新区"
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "松江区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "南汇区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          "上海市" "市辖区" "杨浦区"   
          end
          append using `master'
          save `"`master'"', replace
          
          * Example generated by -dataex-. To install: ssc install dataex
          clear
          input str24 B0562 str33 B0563 str45 B0564
          "云南省" "曲靖市" "宣威市"
          "云南省" "曲靖市" "宣威市"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "呈贡县"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "晋宁县"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "西山区"
          "云南"    "昆明"    "官渡"   
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "晋宁县"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "官渡区"
          "云南省" "红河州" "蒙自县"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "楚雄州" "楚雄市"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "盘龙区"
          "云南省" "昆明市" "五华区"
          "云南"    "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "盘龙区"
          "云南"    "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "西山区"
          "云南省" "昆明市" "官渡区"
          "云南省" "昆明市" "五华区"
          "云南省" "昆明市" "五华区"
          end
          append using `master'
          save `"`master'"', replace
          
          merge m:1 B0562 B0563 B0564 using `dq_data_set', keep(master match)
          
          gen var1 = substr(dq, 1, 2) + "0000" if !missing(dq)
          gen var2 = substr(dq, 1, 4) + "00" if !missing(dq)
          rename dq var3
          The lines at the end of the above code will carry out the merge and create the three variables you want. I don't know how to help you with the problem of the unmatched observations in the master data set. If these names were written in an alphabetic language, there are programs to do "fuzzy" matching, but as far as I know they cannot be applied to ideographs. Is there, perhaps, some reference compendium available that has the official names of all the provinces, prefectures, and counties listed alongside all of the known variations on those names? Such a crosswalk could be imported as a Stata data set, and merged with your master data, and then you could merge in the dq data set matching on the official names instead of the varying names you have.

          Comment


          • #6
            Thank you very much! Probably I have to do it manually for those unmatched observations.

            Comment

            Working...
            X