[Cuis-dev] https://github.com/KenDickey/Cuis-Smalltalk-UniCodePoint-Properties

ken.dickey at whidbey.com ken.dickey at whidbey.com
Sun May 15 15:32:52 PDT 2022


Now that we have Unicode string handling and display in Cuis, there may 
be a need for Unicode codepoint properties to support text processing.

World script systems have many kinds of spaces and punctuation, some 
characters are upper or lower case, and so forth.  Beyond ASCII, methods 
like #isSpace require extension.

Unicode codepoint property description tables are fairly large and may 
be unused, so we don't want them around all the time, hence the 
'System-Text-UnicodeSupport' package.

At present this package supplies a utility class named UniCode which 
supplies various utility methods. These methods typically take a 
Character or a UnicodeCodePoint.

E.g.

   UniCode isSpace: (UnicodeCodePoint codePoint: 32).
        "--> true"
   UniCode rangeInfo: (UnicodeCodePoint codePoint: 16r2E80).
        "-->  #('CJK' 'CJK Radicals Supplement') "

To try it out:
vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv
Feature require: #'System-Text-UnicodeSupport'.
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

I am sure there are bugs to fix and useful improvements to add.

Please let me know if you try this out and find it useful.

We live in hope,
-KenD



More information about the Cuis-dev mailing list