Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • string variables: How to use correctly substr in my case?

    Hi everyone,

    I have a question related to the below dataset:

    Code:
    * Example generated by -dataex-. For more info, type help dataex
    clear
    input double(Gis_X Gis_Y) str23 Distrito str26 Barrio str50 Calle
    439078.78 4473665.67 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, RONDA, DE"                      
    439076.91 4473616.39 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, RONDA, DE"                      
    439511.85 4473945.25 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, CALLE, DE"                      
    439129.02 4473505.46 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, RONDA, DE"                      
    439144.26 4473472.94 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, RONDA, DE"                      
    439741.25 4473700.08 "01  CENTRO" "01-01 PALACIO" "CAVA ALTA, CALLE, DE LA"                  
    439580.77 4473945.07 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, CALLE, DE"                      
    439547.53 4473935.68 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, CALLE, DE"                      
    439611.29 4474144.48 "01  CENTRO" "01-01 PALACIO" "SAN NICOLAS, PLAZA, DE"                  
    439093.98 4473580.23 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, RONDA, DE"                      
    439724.66 4473414.35 "01  CENTRO" "01-01 PALACIO" "CALATRAVA, CALLE, DE"                    
    439724.28 4474326.66 "01  CENTRO" "01-01 PALACIO" "UNION, CALLE, DE LA"                      
    439309.97 4473913.98 "01  CENTRO" "01-01 PALACIO" "VALLICIERGO, CALLE, DE"                  
    439206.07 4474078.34 "01  CENTRO" "01-01 PALACIO" "VEGA, CUESTA, DE LA"                      
    439888.78 4474676.45 "01  CENTRO" "01-01 PALACIO" "SANTO DOMINGO, PLAZA, DE"                
    439394.36 4473795.92 "01  CENTRO" "01-01 PALACIO" "YESEROS, CALLE, DE LOS"                  
    439702.45 4474356.78 "01  CENTRO" "01-01 PALACIO" "VERGARA, CALLE, DE"                      
    439835.96 4474773.99 "01  CENTRO" "01-01 PALACIO" "ISABEL LA CATOLICA, CALLE, DE"            
    439386.17  4473452.9 "01  CENTRO" "01-01 PALACIO" "ROSARIO, CALLE, DEL"                      
    439525.28 4473608.81 "01  CENTRO" "01-01 PALACIO" "SAN FRANCISCO, CARRERA, DE"              
    439413.46 4473467.54 "01  CENTRO" "01-01 PALACIO" "ROSARIO, CALLE, DEL"                      
    439338.37  4473687.7 "01  CENTRO" "01-01 PALACIO" "SAN BUENAVENTURA, CALLE, DE"              
    439726.99 4473514.22 "01  CENTRO" "01-01 PALACIO" "HUMILLADERO, CALLE, DEL"                  
    439897.36 4473871.42 "01  CENTRO" "01-01 PALACIO" "CAVA BAJA, CALLE, DE LA"                  
    439393.06 4473397.86 "01  CENTRO" "01-01 PALACIO" "SAN BERNABE, CALLE, DE"                  
    439386.09 4474080.95 "01  CENTRO" "01-01 PALACIO" "MAYOR, CALLE"                            
    439277.93 4473395.72 "01  CENTRO" "01-01 PALACIO" "ROSARIO, CALLE, DEL"                      
    439470.97 4473703.13 "01  CENTRO" "01-01 PALACIO" "DON PEDRO, CALLE, DE"                    
    439548.61 4474156.88 "01  CENTRO" "01-01 PALACIO" "FACTOR, CALLE, DEL"                      
    439831.15 4474082.63 "01  CENTRO" "01-01 PALACIO" "SAN MIGUEL, PLAZA, DE"                    
    439371.54 4473301.89 "01  CENTRO" "01-01 PALACIO" "DESCARGAS, CUESTA, DE LAS"                
     439410.8 4473823.15 "01  CENTRO" "01-01 PALACIO" "MORERIA, CALLE, DE LA"                    
    439471.84  4473847.8 "01  CENTRO" "01-01 PALACIO" "MORERIA, CALLE, DE LA"                    
     439728.7 4473528.18 "01  CENTRO" "01-01 PALACIO" "HUMILLADERO, CALLE, DEL"                  
    439765.86 4474366.74 "01  CENTRO" "01-01 PALACIO" "INDEPENDENCIA, CALLE, DE LA"              
    439622.35 4474669.55 "01  CENTRO" "01-01 PALACIO" "ENCARNACION, CALLE, DE LA"                
    439667.95 4473557.69 "01  CENTRO" "01-01 PALACIO" "LUCIENTE, CALLE, DE"                      
    439575.49 4473495.34 "01  CENTRO" "01-01 PALACIO" "AGUILA, CALLE, DEL"                      
    439728.16  4474865.1 "01  CENTRO" "01-01 PALACIO" "LEGANITOS, CALLE, DE"                    
    439768.45 4474524.75 "01  CENTRO" "01-01 PALACIO" "SANTO DOMINGO, CUESTA, DE"                
    439820.89 4474707.15 "01  CENTRO" "01-01 PALACIO" "LEGANITOS, CALLE, DE"                    
    439644.86 4474147.05 "01  CENTRO" "01-01 PALACIO" "JUAN DE HERRERA, CALLE, DE"              
    439820.86 4474303.54 "01  CENTRO" "01-01 PALACIO" "ESCALINATA, CALLE, DE LA"                
     439833.1 4474161.33 "01  CENTRO" "01-01 PALACIO" "MAYOR, CALLE"                            
    439767.95 4473590.17 "01  CENTRO" "01-01 PALACIO" "CEBADA, CALLE, DE LA"                    
    439432.01 4473497.06 "01  CENTRO" "01-01 PALACIO" "SAN FRANCISCO, PLAZA, DE"                
    439558.35 4473470.48 "01  CENTRO" "01-01 PALACIO" "AGUILA, CALLE, DEL"                      
    439460.97  4473584.1 "01  CENTRO" "01-01 PALACIO" "SAN FRANCISCO, CARRERA, DE"              
    439447.18 4473455.79 "01  CENTRO" "01-01 PALACIO" "GRAN VIA DE SAN FRANCISCO, AVENIDA, DE LA"
    439487.66 4473392.83 "01  CENTRO" "01-01 PALACIO" "GRAN VIA DE SAN FRANCISCO, AVENIDA, DE LA"
     439330.4 4473376.99 "01  CENTRO" "01-01 PALACIO" "VENTOSA, CALLE, DE LA"                    
    439532.28 4473308.17 "01  CENTRO" "01-01 PALACIO" "GRAN VIA DE SAN FRANCISCO, AVENIDA, DE LA"
    439764.67 4473658.84 "01  CENTRO" "01-01 PALACIO" "CEBADA, PLAZA, DE LA"                    
    439740.07 4474887.09 "01  CENTRO" "01-01 PALACIO" "FLOR BAJA, CALLE, DE LA"                  
    439615.69    4473938 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, CALLE, DE"                      
    439088.22 4473591.83 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, RONDA, DE"                      
    439635.92 4473565.19 "01  CENTRO" "01-01 PALACIO" "LUCIENTE, CALLE, DE"                      
    439592.91 4473566.23 "01  CENTRO" "01-01 PALACIO" "AGUAS, CALLE, DE LAS"                    
    439946.64 4474684.79 "01  CENTRO" "01-01 PALACIO" "SILVA, CALLE, DE"                        
    439226.88 4474003.64 "01  CENTRO" "01-01 PALACIO" "VEGA, CUESTA, DE LA"                      
    439422.03 4473413.42 "01  CENTRO" "01-01 PALACIO" "SAN BERNABE, CALLE, DE"                  
     439457.9 4473717.38 "01  CENTRO" "01-01 PALACIO" "REDONDILLA, CALLE, DE LA"                
    439577.21 4473680.76 "01  CENTRO" "01-01 PALACIO" "DON PEDRO, CALLE, DE"                    
     439871.7 4474112.65 "01  CENTRO" "01-01 PALACIO" "CAVA DE SAN MIGUEL, CALLE, DE LA"        
    439737.51 4474275.39 "01  CENTRO" "01-01 PALACIO" "UNION, CALLE, DE LA"                      
    439721.75 4474632.73 "01  CENTRO" "01-01 PALACIO" "GUILLERMO ROLLAND, CALLE, DE"            
    439667.65 4474607.52 "01  CENTRO" "01-01 PALACIO" "ENCARNACION, CALLE, DE LA"                
    439773.76 4473804.41 "01  CENTRO" "01-01 PALACIO" "ALMENDRO, CALLE, DEL"                    
    439411.28 4473715.83 "01  CENTRO" "01-01 PALACIO" "DON PEDRO, CALLE, DE"                    
    439505.84 4473693.05 "01  CENTRO" "01-01 PALACIO" "DON PEDRO, CALLE, DE"                    
     439667.2 4474297.81 "01  CENTRO" "01-01 PALACIO" "AMNISTIA, CALLE, DE LA"                  
    439601.59 4473833.74 "01  CENTRO" "01-01 PALACIO" "SAN ANDRES, COSTANILLA, DE"              
       439510 4473865.84 "01  CENTRO" "01-01 PALACIO" "MORERIA, PLAZA, DE LA"                    
    439485.11 4473590.64 "01  CENTRO" "01-01 PALACIO" "SAN FRANCISCO, CARRERA, DE"              
    439318.19 4473344.84 "01  CENTRO" "01-01 PALACIO" "DESCARGAS, CUESTA, DE LAS"                
    439617.37 4473873.91 "01  CENTRO" "01-01 PALACIO" "SAN ANDRES, COSTANILLA, DE"              
    439209.69  4473940.3 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, CALLE, DE"                      
    439868.52 4474194.68 "01  CENTRO" "01-01 PALACIO" "COMANDANTE LAS MORENAS, PLAZA, DEL"      
    439556.07 4473736.71 "01  CENTRO" "01-01 PALACIO" "MANCEBOS, CALLE, DE LOS"                  
     439762.9 4473942.82 "01  CENTRO" "01-01 PALACIO" "DOCTOR LETAMENDI, CALLE, DEL"            
    439463.95 4473717.98 "01  CENTRO" "01-01 PALACIO" "REDONDILLA, CALLE, DE LA"                
    439365.69 4473989.84 "01  CENTRO" "01-01 PALACIO" "RAMON, CUESTA, DE"                        
    439320.14 4474023.75 "01  CENTRO" "01-01 PALACIO" "MAYOR, CALLE"                            
    439338.36 4473729.73 "01  CENTRO" "01-01 PALACIO" "GABRIEL MIRO, PLAZA, DE"                  
    439297.05 4473939.05 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, CALLE, DE"                      
    439499.78 4473582.81 "01  CENTRO" "01-01 PALACIO" "SAN ISIDRO LABRADOR, CALLE, DE"          
    439595.58 4473427.73 "01  CENTRO" "01-01 PALACIO" "CALATRAVA, CALLE, DE"                    
    439681.59 4474534.24 "01  CENTRO" "01-01 PALACIO" "ARRIETA, CALLE, DE"                      
    439647.16 4474638.99 "01  CENTRO" "01-01 PALACIO" "ENCARNACION, CALLE, DE LA"                
    439343.09    4473437 "01  CENTRO" "01-01 PALACIO" "ROSARIO, CALLE, DEL"                      
    439920.17 4474499.33 "01  CENTRO" "01-01 PALACIO" "ANGELES, COSTANILLA, DE LOS"              
    439716.41 4473619.66 "01  CENTRO" "01-01 PALACIO" "HUMILLADERO, CALLE, DEL"                  
    439475.24 4473936.63 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, CALLE, DE"                      
    439370.71 4473444.66 "01  CENTRO" "01-01 PALACIO" "ROSARIO, CALLE, DEL"                      
       439743 4473670.75 "01  CENTRO" "01-01 PALACIO" "CEBADA, PLAZA, DE LA"                    
    439495.36 4473211.65 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, RONDA, DE"                      
    439536.49 4473994.74 "01  CENTRO" "01-01 PALACIO" "VILLA, CALLE, DE LA"                      
    439121.85  4473777.3 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, RONDA, DE"                      
     439882.6 4474486.32 "01  CENTRO" "01-01 PALACIO" "CAÑOS DEL PERAL, CALLE, DE LOS"          
    439836.63  4474692.4 "01  CENTRO" "01-01 PALACIO" "SANTO DOMINGO, PLAZA, DE"                
    end

    Another example from the same dataset:


    Code:
    * Example generated by -dataex-. For more info, type help dataex
    clear
    input double(Gis_X Gis_Y) str23 Distrito str26 Barrio
    437610.25 4475808.36 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438469.95 4474962.66 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438046.07 4475540.74 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438646.34 4474876.06 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437759.83 4475517.25 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437923.37 4475663.23 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438252.11 4475091.63 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437533.79 4475931.73 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438112.57 4475216.87 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438502.03 4475057.88 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438081.32 4475385.27 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438084.93 4475192.84 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437966.32 4475422.38 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438297.96 4475262.55 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438963.97 4474908.39 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437633.82 4475810.85 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438676.86 4474851.24 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438756.66 4474848.94 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437796.63 4475566.13 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438597.53 4475311.84 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437609.57 4475653.19 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437743.79 4475693.95 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437606.67 4475732.98 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438146.29  4475186.6 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437848.16 4475493.22 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437616.11 4475720.08 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438706.95  4474813.1 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437970.31 4475622.34 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438324.73 4475310.62 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437682.32 4475722.34 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    439006.59 4474861.97 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437621.75 4475734.16 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437855.37 4475436.41 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437851.57 4475343.73 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438691.09 4475466.44 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437889.24 4475435.12 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438020.74 4475559.21 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437827.45 4475882.03 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
     438986.5 4475058.67 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438445.91  4475113.4 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438745.62 4474960.22 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437795.43  4475417.6 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    439284.13 4475076.48 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437861.82 4475966.58 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438203.29 4475469.95 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
       437640 4475651.58 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437766.82 4475607.39 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438326.91  4475111.7 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438539.82 4475083.06 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437735.28 4475880.58 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437832.83 4475640.59 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437989.99 4475751.12 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437825.63 4476016.17 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437896.57 4475423.31 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438582.22  4475012.9 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437592.14  4475684.7 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437703.01 4475707.61 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
       437748 4475492.09 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437892.09 4475316.97 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437948.28 4475517.33 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438239.54 4475108.68 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
     437722.7 4475889.57 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438509.26 4475979.15 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438090.42 4475484.43 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438188.28 4475074.66 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437773.96 4475822.68 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437648.95 4475726.07 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438094.66 4475299.96 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437838.32 4475349.46 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438581.11 4475366.91 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438477.73 4474964.45 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438890.91  4476245.2 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438345.73 4475989.65 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438215.24 4475124.78 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438339.93 4475293.58 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438509.97 4475781.48 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437756.89 4475841.53 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437823.52 4476101.15 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438752.12 4474908.23 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437939.03 4475379.44 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437812.91 4475389.56 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438498.37 4475128.28 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437587.55 4475693.65 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437946.41 4475362.77 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438509.53 4475038.23 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437974.01 4475450.66 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438632.93 4475155.25 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437785.49 4475533.75 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438267.08 4475042.87 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438523.22 4476047.17 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438077.61 4475379.03 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
     438070.3 4475541.75 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437606.15 4475758.97 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437793.21    4475860 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437922.22 4475392.76 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
     437677.1 4475701.65 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437849.58 4475406.23 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437870.48 4475910.16 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    438694.53  4474830.4 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    437673.88 4476030.67 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"
    end

    For the "Barrio" variable, I just want to preserve the word, without the "01-01" numbers. The maximum "values" these strings can take are "09-03".
    Also, I need to have another specificity. I would like to merge this dataset with another one.

    However, they are not in capital letters in my other dataset. For example: "PALACIO" is written "Palacio", or "EL PILAR" is written "El Pilar", or "CASCO HISTÓRICO DE VALLECAS" is written "Casco Histórico de Vallecas", and so on.

    Could anyone give me a solution to that please?

    Thank you very much in advance.
    Best,

    Michael

  • #2
    I tried that:

    Code:
    gen Barrio_without_numbers = regexs(1) if regexm(Barrio, "([A-Z][A-ZÁÉÍÓÚÜÑ]*[ ]?[A-ZÁÉÍÓÚÜÑ]*[ ]?[A-ZÁÉÍÓÚÜÑ]*)([A-ZÁÉÍÓÚÜÑ]+)*([A-ZÁÉÍÓÚÜÑ]+)*")
    is that correct?

    Thank you.
    Michael

    Comment


    • #3
      I am sure that there is a regular expression solution here. but yours is more complicated than

      either looking for the first space and selecting what follows

      or removing the first "word" and trimming (words are separated by spaces or bound by double quotes)

      Code:
      * Example generated by -dataex-. For more info, type help dataex
      clear
      input double(Gis_X Gis_Y) str23 Distrito str26 Barrio str50 Calle
      439078.78 4473665.67 "01  CENTRO" "01-01 PALACIO" "SEGOVIA, RONDA, DE"                      
      437610.25 4475808.36 "09  MONCLOA-ARAVACA" "09-01 CASA DE CAMPO"           
      end
      
      gen Barrio2 = substr(Barrio, 1 + strpos(Barrio, " "), .)
      gen Barrio3 = trim(subinstr(Barrio, word(Barrio, 1), "", 1))
      
      list Barrio* 
      
           +-----------------------------------------------------+
           |              Barrio         Barrio2         Barrio3 |
           |-----------------------------------------------------|
        1. |       01-01 PALACIO         PALACIO         PALACIO |
        2. | 09-01 CASA DE CAMPO   CASA DE CAMPO   CASA DE CAMPO |
           +-----------------------------------------------------+
      Otherwise

      Code:
      help string functions 
      
      help strproper()

      Comment


      • #4
        Good morning Nick Cox:

        Thanks for your feedback! Clearer than the code that I presented in #2.
        I didn't even know that "01-01" counted as a word.

        But it is logical, of course, as it is a string. Thanks!
        Have a nice day.

        Best,

        Michael
        Last edited by Michael Duarte Goncalves; 20 Oct 2023, 02:44.

        Comment

        Working...
        X