Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • How forum software deals with Unicode whitespace

    The following dataex output contains several uchar(160):

    Code:
    * Example generated by -dataex-. To install: ssc install dataex
    clear
    input str16 A
    "1     1  "
    "1     2  "
    "1     3  "
    "1     4  "
    "1     5  "
    "1     6  "
    "1     7  "
    "1     8  "
    "1     9  "
    end
    Do these Unicode whitespace characters survive?

  • #2
    Apparently not!

    Code:
    . chartab A
    
       decimal  hexadecimal   character |     frequency    unique name
    ------------------------------------+-----------------------------
            32       \u0020             |            63    SPACE
            49       \u0031       1     |            10    DIGIT ONE
            50       \u0032       2     |             1    DIGIT TWO
            51       \u0033       3     |             1    DIGIT THREE
            52       \u0034       4     |             1    DIGIT FOUR
            53       \u0035       5     |             1    DIGIT FIVE
            54       \u0036       6     |             1    DIGIT SIX
            55       \u0037       7     |             1    DIGIT SEVEN
            56       \u0038       8     |             1    DIGIT EIGHT
            57       \u0039       9     |             1    DIGIT NINE
    ------------------------------------+-----------------------------
    
                                        freq. count   distinct
    ASCII characters              =              81         10
    Multibyte UTF-8 characters    =               0          0
    Unicode replacement character =               0          0
    Total Unicode characters      =              81         10

    Comment

    Working...
    X