I have a dataset that uses extended ascii to render characters in Spanish, Portuguese and Creole (in variable labels, value labels, and notes). I attempt to convert the file to unicode characters and receive an error. I have not worked extensively with the unicode commands and don't know how to identify issues with the dataset. Any help would be appreciated:
Code:
. unicode encoding set Latin1
(default encoding now Latin1)
. unicode translate myfile.dta
(using Latin1 encoding)
File summary (before starting):
1 file(s) specified
1 file(s) to be examined ...
File myfile.dta (Stata dataset)
all variable names okay, ASCII
all data labels translated
1281 variable labels okay, ASCII
0 variable labels okay, already UTF-8
605 variable labels translated
all value-label names okay, ASCII
examine_dta_vallabs_content(): 3301 subscript invalid
examine_dta_file(): - function returned error
examine_file(): - function returned error
do_examine_files(): - function returned error
unicode_do(): - function returned error
unicode_translate(): - function returned error
<istmt>: - function returned error
Comment