Character.st
author sr
Thu, 14 Sep 2017 12:35:46 +0200
changeset 22251 a1ebd9c9af37
parent 22018 b9f13bb90e55
child 22409 5a6cfd90e913
permissions -rw-r--r--
#BUGFIX by sr class: Character changed: #utf8BytesPerCharacter
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
21673
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
     1
"{ Encoding: utf8 }"
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
     2
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
     3
"
5
67342904af11 *** empty log message ***
claus
parents: 3
diff changeset
     4
 COPYRIGHT (c) 1988 by Claus Gittinger
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
     5
	      All Rights Reserved
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
     6
a27a279701f8 Initial revision
claus
parents:
diff changeset
     7
 This software is furnished under a license and may be used
a27a279701f8 Initial revision
claus
parents:
diff changeset
     8
 only in accordance with the terms of that license and with the
a27a279701f8 Initial revision
claus
parents:
diff changeset
     9
 inclusion of the above copyright notice.   This software may not
a27a279701f8 Initial revision
claus
parents:
diff changeset
    10
 be provided or otherwise made available to, or used by, any
a27a279701f8 Initial revision
claus
parents:
diff changeset
    11
 other person.  No title to or ownership of the software is
a27a279701f8 Initial revision
claus
parents:
diff changeset
    12
 hereby transferred.
a27a279701f8 Initial revision
claus
parents:
diff changeset
    13
"
5407
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
    14
"{ Package: 'stx:libbasic' }"
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
    15
17346
a21eaa06b20b class: Character
Claus Gittinger <cg@exept.de>
parents: 17249
diff changeset
    16
"{ NameSpace: Smalltalk }"
a21eaa06b20b class: Character
Claus Gittinger <cg@exept.de>
parents: 17249
diff changeset
    17
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
    18
Magnitude subclass:#Character
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
    19
	instanceVariableNames:'asciivalue'
22000
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
    20
	classVariableNames:'CharacterTable Separators BOMCharacter'
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
    21
	poolDictionaries:''
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
    22
	category:'Magnitude-General'
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
    23
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
    24
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
    25
!Character class methodsFor:'documentation'!
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
    26
88
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    27
copyright
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    28
"
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    29
 COPYRIGHT (c) 1988 by Claus Gittinger
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
    30
	      All Rights Reserved
88
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    31
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    32
 This software is furnished under a license and may be used
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    33
 only in accordance with the terms of that license and with the
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    34
 inclusion of the above copyright notice.   This software may not
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    35
 be provided or otherwise made available to, or used by, any
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    36
 other person.  No title to or ownership of the software is
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    37
 hereby transferred.
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    38
"
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    39
!
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    40
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
    41
documentation
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
    42
"
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    43
    This class represents characters.
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    44
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    45
    Notice, that actual character objects are not used when characters
17249
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    46
    are stored in strings, symbols etc.
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    47
    These only store a character's asciiValue/codePoint for a more compact representation.
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    48
    The word 'asciiValue' is a historic leftover - actually, any integer
8028
4fecc8242a64 comment
Claus Gittinger <cg@exept.de>
parents: 8026
diff changeset
    49
    code is allowed and actually used (i.e. characters are not limited to 8bit).
17249
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    50
    Also, the encoding is actually Unicode, of which ascii is a subset and the same encoding value
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    51
    for the first 128 characters (codePoint 0 to 127 are the same in ascii).
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    52
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
    53
    Some heavily used Characters are kept as singletons; i.e. for every asciiValue (0..N),
17249
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    54
    there exists exactly one instance of Character, which is shared.
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    55
    Character value:xxx checks for this, and returns a reference to an existing instance.
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    56
    For N<=255, this is guaranteed; i.e. in all Smalltalks, the single byte characters are always
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    57
    handled like this, and you can therefore safely compare them using == (identity compare).
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    58
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    59
    Other characters (i.e. codepoint > N) are not guaranteed to be shared;
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
    60
    i.e. these my or may not be created as required.
17249
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    61
    Actually, do NOT depend on which characters are and which are not shared.
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    62
    Always compare using #= if there is any chance of a non-ascii character being involved.
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    63
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    64
    Once again (because beginners sometimes make this mistake):
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    65
	This means: you may compare characters using #== ONLY IFF you are certain,
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    66
	that the characters ranges is 0..255.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    67
	Otherwise, you HAVE TO compare using #=. (if in doubt, always compare using #=).
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    68
	Sorry for this inconvenience, but it is (practically) impossible to keep
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    69
	the possible maximum of 2^32 characters (Unicode) around, for that convenience alone.
17249
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    70
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    71
    In ST/X, N is (currently) 1024. This means that all the latin characters and some others are
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
    72
    kept as singleton in the CharacterTable class variable (which is also used by the VM when characters
20088
9d16b5a4a6ac #OTHER by mawalch
mawalch
parents: 20016
diff changeset
    73
    are instantiated).
357
claus
parents: 345
diff changeset
    74
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
    75
    Methods marked as (JS) come from the manchester Character goody
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
    76
    (CharacterComparing) by Jan Steinman, which allow Characters to be used as
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
    77
    Interval elements (i.e. ($a to:$z) do:[...] );
1229
ace799c5765e commentary
Claus Gittinger <cg@exept.de>
parents: 1138
diff changeset
    78
    They are not a big deal, but convenient add-ons.
ace799c5765e commentary
Claus Gittinger <cg@exept.de>
parents: 1138
diff changeset
    79
    Some of these have been modified a bit.
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
    80
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
    81
    WARNING: characters are known by compiler and runtime system -
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    82
	     do not change the instance layout.
357
claus
parents: 345
diff changeset
    83
claus
parents: 345
diff changeset
    84
    Also, although you can create subclasses of Character, the compiler always
claus
parents: 345
diff changeset
    85
    creates instances of Character for literals ...
814
d4d28ca7afcd made the global CharacterTable a classVar of Character
Claus Gittinger <cg@exept.de>
parents: 760
diff changeset
    86
    ... and other classes are hard-wired to always return instances of characters
357
claus
parents: 345
diff changeset
    87
    in some cases (i.e. String>>at:, Symbol>>at: etc.).
claus
parents: 345
diff changeset
    88
    Therefore, it may not make sense to create a character-subclass.
1295
83f594f05c52 documentation
Claus Gittinger <cg@exept.de>
parents: 1271
diff changeset
    89
8028
4fecc8242a64 comment
Claus Gittinger <cg@exept.de>
parents: 8026
diff changeset
    90
    Case Mapping in Unicode:
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    91
	There are a number of complications to case mappings that occur once the repertoire
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    92
	of characters is expanded beyond ASCII.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    93
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    94
	* Because of the inclusion of certain composite characters for compatibility,
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    95
	  such as U+01F1 'DZ' capital dz, there is a third case, called titlecase,
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    96
	  which is used where the first letter of a word is to be capitalized
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    97
	  (e.g. Titlecase, vs. UPPERCASE, or lowercase).
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    98
	  For example, the title case of the example character is U+01F2 'Dz' capital d with small z.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
    99
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   100
	* Case mappings may produce strings of different length than the original.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   101
	  For example, the German character U+00DF small letter sharp s expands when uppercased to
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   102
	  the sequence of two characters 'SS'.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   103
	  This also occurs where there is no precomposed character corresponding to a case mapping.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   104
	  *** This is not yet implemented (in 5.2) ***
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   105
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   106
	* Characters may also have different case mappings, depending on the context.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   107
	  For example, U+03A3 capital sigma lowercases to U+03C3 small sigma if it is not followed
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   108
	  by another letter, but lowercases to 03C2 small final sigma if it is.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   109
	  *** This is not yet implemented (in 5.2) ***
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   110
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   111
	* Characters may have case mappings that depend on the locale.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   112
	  For example, in Turkish the letter 0049 'I' capital letter i lowercases to 0131 small dotless i.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   113
	  *** This is not yet implemented (in 5.2) ***
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   114
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   115
	* Case mappings are not, in general, reversible.
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   116
	  For example, once the string 'McGowan' has been uppercased, lowercased or titlecased,
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   117
	  the original cannot be recovered by applying another uppercase, lowercase, or titlecase operation.
8028
4fecc8242a64 comment
Claus Gittinger <cg@exept.de>
parents: 8026
diff changeset
   118
4fecc8242a64 comment
Claus Gittinger <cg@exept.de>
parents: 8026
diff changeset
   119
    Collation Sequence:
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   120
	*** This is not yet implemented (in 5.2) ***
8028
4fecc8242a64 comment
Claus Gittinger <cg@exept.de>
parents: 8026
diff changeset
   121
1295
83f594f05c52 documentation
Claus Gittinger <cg@exept.de>
parents: 1271
diff changeset
   122
    [author:]
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   123
	Claus Gittinger
1295
83f594f05c52 documentation
Claus Gittinger <cg@exept.de>
parents: 1271
diff changeset
   124
83f594f05c52 documentation
Claus Gittinger <cg@exept.de>
parents: 1271
diff changeset
   125
    [see also:]
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   126
	String TwoByteString Unicode16String Unicode32String
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   127
	StringCollection Text
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
   128
"
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
   129
! !
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   130
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   131
!Character class methodsFor:'instance creation'!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   132
a27a279701f8 Initial revision
claus
parents:
diff changeset
   133
basicNew
a27a279701f8 Initial revision
claus
parents:
diff changeset
   134
    "catch new - Characters cannot be created with new"
a27a279701f8 Initial revision
claus
parents:
diff changeset
   135
a27a279701f8 Initial revision
claus
parents:
diff changeset
   136
    ^ self error:'Characters cannot be created with new'
a27a279701f8 Initial revision
claus
parents:
diff changeset
   137
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   138
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   139
codePoint:anInteger
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   140
    "return a character with codePoint anInteger"
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   141
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   142
%{  /* NOCONTEXT */
18240
28af09029a8b ifdef for SCHTEAM engine changed (not relevant for ST/X)
Claus Gittinger <cg@exept.de>
parents: 18215
diff changeset
   143
#ifdef __SCHTEAM__
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   144
    {
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   145
        char ch = (char)(context.stArg(0).intValue("[codePoint:]"));
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   146
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   147
        return context._RETURN(STCharacter._new(ch));
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   148
    }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   149
    /* NOTREACHED */
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   150
#else
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   151
    INT __codePoint;
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   152
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   153
    if (__isSmallInteger(anInteger)) {
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   154
        __codePoint = ( __smallIntegerVal(anInteger) );
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   155
        if ((unsigned INT)(__codePoint) <= MAX_IMMEDIATE_CHARACTER /* (__codePoint >= 0) && (__codePoint <= 255) */) {
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   156
            RETURN ( __MKCHARACTER(__codePoint) );
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   157
        }
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   158
    }
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   159
#endif
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   160
%}.
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   161
    (anInteger between:0 and:(CharacterTable size - 1)) ifTrue:[
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   162
        ^ CharacterTable at:(anInteger + 1)
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   163
    ].
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   164
    (anInteger between:16r100 and:16r3FFFFFFF) ifTrue:[
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   165
        (anInteger = 16rFEFF) ifTrue:[^ self byteOrderMark]. 
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   166
        ^ super basicNew setCodePoint:anInteger
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   167
    ].
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   168
    
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   169
    "
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   170
     a character's codePoint must be 0..16r3FFFFFFF.
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   171
     (i.e. only characters with up-to 30 bits are allowed, 
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   172
      which is way more than needed...)
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   173
    "
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   174
    RangeError raiseWith:anInteger errorString:'invalid codePoint for character'
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   175
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   176
    "
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   177
      self codePoint:16r34.
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   178
      self codePoint:16r3455.
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   179
      (self codePoint:16rFEFF) == (self codePoint:16rFEFF).
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   180
      self codePoint:16rFFFFFFFFFFFFFFFFFFF.
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   181
    "
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   182
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   183
    "Modified (comment): / 13-07-2017 / 12:18:14 / cg"
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   184
!
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   185
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   186
digitValue:anInteger
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   187
    "return a character that corresponds to anInteger.
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   188
     0-9 map to $0-$9, 10-35 map to $A-$Z"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   189
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   190
    |val "{ Class: SmallInteger }" |
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   191
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   192
    val := anInteger.
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   193
    (val between:0 and:9) ifTrue:[
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   194
	^ Character codePoint:(val + ($0 codePoint))
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   195
    ].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   196
    (val between:10 and:35) ifTrue:[
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   197
	^ Character codePoint:(val + ($A codePoint - 10))
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   198
    ].
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   199
    ^ self error:'value not in range 0 to 35'
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   200
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   201
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   202
utf8DecodeFrom:aStream
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   203
    "read and return a single unicode character from an UTF8 encoded stream"
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   204
18605
4f0a114fed00 class: Character
Stefan Vogel <sv@exept.de>
parents: 18595
diff changeset
   205
    |fetchNext c1 c2 codePoint|
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   206
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   207
    c1 := aStream next.
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   208
    codePoint := c1 codePoint.
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   209
    codePoint <= 16r7F ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   210
	"/ 0xxxxxxx - 7 bits
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   211
	^ c1 asCharacter.
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   212
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   213
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   214
    (codePoint bitAnd:2r11000000) == 2r10000000 ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   215
	"/ out of sync (got an intermediate character)
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   216
	InvalidEncodingError raiseRequestWith:codePoint errorString:' - out of sync'.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   217
	^ c1 asCharacter.
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   218
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   219
18605
4f0a114fed00 class: Character
Stefan Vogel <sv@exept.de>
parents: 18595
diff changeset
   220
    fetchNext := [  |code|
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   221
		    code := aStream next codePoint.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   222
		    (code bitAnd:2r11000000) == 2r10000000 ifFalse:[
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   223
			"/ followup chars must have 2r10 in high bits
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   224
			InvalidEncodingError raiseRequestWith:code.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   225
			^ c1 asCharacter.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   226
		    ].
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   227
		    code bitAnd:16r3F
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   228
		 ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   229
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   230
    (codePoint bitAnd:2r11100000) == 2r11000000 ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   231
	"/ 110xxxxx 10xxxxxx - 11 bits
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   232
	codePoint := codePoint bitAnd:16r1F.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   233
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   234
	codePoint <= 16r7F ifTrue:[
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   235
	    InvalidEncodingError raiseRequestWith:codePoint.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   236
	].
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   237
	^ Character codePoint:codePoint
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   238
    ].
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   239
    (codePoint bitAnd:2r11110000) == 2r11100000 ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   240
	"/ 1110xxxx 10xxxxxx 10xxxxxx - 16 bits
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   241
	codePoint := codePoint bitAnd:16r0F.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   242
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   243
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   244
	codePoint <= 16r7FF ifTrue:[
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   245
	    InvalidEncodingError raiseRequestWith:codePoint.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   246
	].
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   247
	^ Character codePoint:codePoint
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   248
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   249
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   250
    (codePoint bitAnd:2r11111000) == 2r11110000 ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   251
	"/ 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx - 21 bits
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   252
	codePoint := codePoint bitAnd:16r07.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   253
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   254
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   255
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   256
	codePoint <= 16rFFFF ifTrue:[
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   257
	    InvalidEncodingError raiseRequestWith:codePoint.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   258
	].
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   259
	^ Character codePoint:codePoint
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   260
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   261
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   262
    (codePoint bitAnd:2r11111100) == 2r11111000 ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   263
	"/ 111110xx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx - 26 bits
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   264
	codePoint := codePoint bitAnd:16r03.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   265
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   266
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   267
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   268
	codePoint := (codePoint bitShift:6) bitOr:(fetchNext value).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   269
	codePoint <= 16r1FFFFF ifTrue:[
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   270
	    InvalidEncodingError raiseRequestWith:codePoint.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   271
	].
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   272
	^ Character codePoint:codePoint
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   273
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   274
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   275
    (codePoint bitAnd:2r11111110) == 2r11111100 ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   276
	"/ 1111110x ... 10xxxxxx - any number of bits
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   277
	codePoint := codePoint bitAnd:16r01.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   278
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   279
	c2 := aStream peek.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   280
	[c2 notNil and:[(c2 codePoint bitAnd:2r11000000) == 2r10000000]] whileTrue:[
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   281
	    codePoint := (codePoint bitShift:6) bitOr:(c2 codePoint bitAnd:16r3F).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   282
	    aStream next.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   283
	    c2 := aStream peek.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   284
	].
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   285
	codePoint <= 16r3FFFFFF ifTrue:[
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   286
	    InvalidEncodingError raiseRequestWith:codePoint.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   287
	].
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   288
	^ Character codePoint:codePoint
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   289
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   290
11325
b2ba4174deb8 Raise InvalidEncodingError when utf8 decode fails
Stefan Vogel <sv@exept.de>
parents: 11321
diff changeset
   291
    InvalidEncodingError raiseRequestWith:codePoint.
18605
4f0a114fed00 class: Character
Stefan Vogel <sv@exept.de>
parents: 18595
diff changeset
   292
    ^ c1 asCharacter.
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   293
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   294
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   295
      Character utf8DecodeFrom:'a' readStream
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   296
      Character utf8DecodeFrom:#[195 188] asString readStream
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   297
    "
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   298
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   299
    "test:
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   300
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   301
      |utf8Encoding original readBack|
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   302
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   303
      1 to:16rFFFF do:[:codePoint |
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   304
	original := Character value:codePoint.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   305
	utf8Encoding := original utf8Encoded.
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   306
	readBack := Character utf8DecodeFrom:(utf8Encoding readStream).
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   307
	readBack codePoint = codePoint ifFalse:[
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   308
	    self halt
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
   309
	]
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   310
      ]
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   311
    "
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   312
!
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   313
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   314
value:anInteger
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   315
    "return a character with codePoint anInteger - backward compatibility"
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   316
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   317
    ^ self codePoint:anInteger
5945
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   318
! !
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   319
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   320
!Character class methodsFor:'accessing untypeable characters'!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   321
15689
76245e4ca77b class: Character
Claus Gittinger <cg@exept.de>
parents: 15262
diff changeset
   322
controlCharacter:char
76245e4ca77b class: Character
Claus Gittinger <cg@exept.de>
parents: 15262
diff changeset
   323
    "Answer the Character representing ctrl-char.
76245e4ca77b class: Character
Claus Gittinger <cg@exept.de>
parents: 15262
diff changeset
   324
     ctrl-a -> 1; ctrl-@ -> 0"
76245e4ca77b class: Character
Claus Gittinger <cg@exept.de>
parents: 15262
diff changeset
   325
21864
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   326
    |lcChar|
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   327
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   328
    char == $@ ifTrue:[^ Character value:0 ].
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   329
    
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   330
    lcChar := char asUppercase.
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   331
    self assert:(lcChar between:$A and:$Z+5).
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   332
    ^ self codePoint:(lcChar - $A + 1)
15689
76245e4ca77b class: Character
Claus Gittinger <cg@exept.de>
parents: 15262
diff changeset
   333
76245e4ca77b class: Character
Claus Gittinger <cg@exept.de>
parents: 15262
diff changeset
   334
    "
21864
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   335
     self controlCharacter:$@ -> 0
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   336
     self controlCharacter:$a -> 1
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   337
     self controlCharacter:$d -> 4
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   338
     self controlCharacter:$z -> 26
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   339
     self controlCharacter:$[ -> 27    
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   340
     self controlCharacter:$\ -> 28     
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   341
     self controlCharacter:$] -> 29
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   342
     self controlCharacter:$_ -> 31     
15689
76245e4ca77b class: Character
Claus Gittinger <cg@exept.de>
parents: 15262
diff changeset
   343
    "
21864
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   344
d2ad44002ad2 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 21673
diff changeset
   345
    "Modified (comment): / 20-06-2017 / 17:20:21 / cg"
15689
76245e4ca77b class: Character
Claus Gittinger <cg@exept.de>
parents: 15262
diff changeset
   346
!
76245e4ca77b class: Character
Claus Gittinger <cg@exept.de>
parents: 15262
diff changeset
   347
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   348
endOfInput
11144
ba697d3f940c category change
Claus Gittinger <cg@exept.de>
parents: 10948
diff changeset
   349
    "Answer the Character representing ctrl-d (Unix-EOF)."
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   350
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   351
    ^ self codePoint:4
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   352
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   353
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   354
leftParenthesis
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   355
    "Answer the Character representing a left parenthesis."
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   356
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   357
    ^ self codePoint:40
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   358
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   359
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   360
period
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   361
    "Answer the Character representing a carriage period."
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   362
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   363
    ^ self codePoint:46
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   364
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   365
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   366
poundSign
11144
ba697d3f940c category change
Claus Gittinger <cg@exept.de>
parents: 10948
diff changeset
   367
    "Answer the Character representing a pound sign (hash)."
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   368
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   369
    ^ self codePoint:35
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   370
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   371
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   372
rightParenthesis
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   373
    "Answer the Character representing a right parenthesis."
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   374
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   375
    ^ self codePoint:41
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   376
! !
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   377
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   378
!Character class methodsFor:'constants'!
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   379
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   380
backspace
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   381
    "return the backspace character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   382
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   383
    ^ Character codePoint:8
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   384
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   385
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   386
bell
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   387
    "return the bell character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   388
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   389
    ^ Character codePoint:7
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   390
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   391
22000
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
   392
byteOrderMark
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   393
    "the unicode BOM character as a singleton"
22000
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
   394
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
   395
    BOMCharacter isNil ifTrue:[
22018
b9f13bb90e55 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 22002
diff changeset
   396
        BOMCharacter := super basicNew setCodePoint:16rFEFF
22000
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
   397
    ].
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
   398
    ^ BOMCharacter
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
   399
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   400
    "
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   401
     self byteOrderMark
22018
b9f13bb90e55 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 22002
diff changeset
   402
     self codePoint:16rFEFF
22002
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   403
    "
a56e43b3b432 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 22000
diff changeset
   404
22000
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
   405
    "Created: / 13-07-2017 / 12:04:34 / cg"
22018
b9f13bb90e55 #BUGFIX by cg
Claus Gittinger <cg@exept.de>
parents: 22002
diff changeset
   406
    "Modified (comment): / 14-07-2017 / 15:22:14 / cg"
22000
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
   407
!
78991867cfeb #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 21864
diff changeset
   408
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   409
cr
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   410
    "return the lineEnd character
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   411
     - actually (in unix) this is a newline character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   412
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   413
    ^ Character codePoint:10
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   414
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   415
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   416
del
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   417
    "return the delete character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   418
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   419
    ^ Character codePoint:16r7F
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   420
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   421
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   422
doubleQuote
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   423
    "return the double-quote character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   424
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   425
    ^ Character codePoint:34
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   426
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   427
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   428
esc
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   429
    "return the escape character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   430
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   431
    ^ Character codePoint:27
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   432
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   433
10428
fm
parents: 9654
diff changeset
   434
etx
fm
parents: 9654
diff changeset
   435
    "return the end-of-text character"
fm
parents: 9654
diff changeset
   436
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   437
    ^ Character codePoint:3
10428
fm
parents: 9654
diff changeset
   438
!
fm
parents: 9654
diff changeset
   439
7688
8494a4155037 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7354
diff changeset
   440
euro
7689
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   441
    "The Euro currency sign (notice: not all fonts support it).
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   442
     The Unicode encoding is U+20AC"
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   443
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   444
    ^ Character codePoint:16r20AC
7689
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   445
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   446
    "
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   447
     Transcript font:(Font family:'courier' size:12 encoding:'iso10646-1').
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   448
     Transcript showCR:Character euro
7976
1ce9243a8601 euro encoding
Claus Gittinger <cg@exept.de>
parents: 7951
diff changeset
   449
    "
1ce9243a8601 euro encoding
Claus Gittinger <cg@exept.de>
parents: 7951
diff changeset
   450
    "
7689
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   451
     0 to:255 do:[:i |
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
   452
	Transcript
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
   453
	    show:'| '; show:((i printStringRadix:16) leftPaddedTo:2);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
   454
	    show:' | '; show:(i printStringPaddedTo:3);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
   455
	    show:' | '; show:(Character value:i);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
   456
	    cr.
7689
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   457
     ]
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   458
    "
7688
8494a4155037 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7354
diff changeset
   459
!
8494a4155037 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7354
diff changeset
   460
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   461
excla
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   462
    "return the exclamation-mark character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   463
    ^ $!!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   464
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   465
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   466
ff
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   467
    "return the form-feed character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   468
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   469
    ^ Character codePoint:12
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   470
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   471
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   472
lf
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   473
    "return the newline/linefeed character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   474
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   475
    ^ Character codePoint:10
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   476
!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   477
4340
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   478
linefeed
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   479
    "squeak compatibility: return the newline/linefeed character"
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   480
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   481
    ^ Character codePoint:10
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   482
!
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   483
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   484
maxImmediateCodePoint
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   485
    "return the maximum codePoint until which the characters are shared"
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   486
%{ /* NOCONTEXT */
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   487
    RETURN(__mkSmallInteger(MAX_IMMEDIATE_CHARACTER));
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   488
%}.
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   489
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   490
    "
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   491
      self maxImmediateCodePoint
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   492
    "
4340
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   493
!
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   494
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   495
maxValue
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   496
    "return the maximum codePoint a character may have"
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   497
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   498
    ^ 16r3FFFFFFF
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   499
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   500
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   501
newPage
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   502
    "return the form-feed character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   503
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   504
    ^ Character codePoint:12
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   505
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   506
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   507
nl
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   508
    "return the newline character"
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   509
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   510
    ^ Character codePoint:10
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   511
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   512
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   513
null
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   514
    ^ Character codePoint:0
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   515
!
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   516
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   517
quote
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   518
    "return the single-quote character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   519
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   520
    ^ Character codePoint:39
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   521
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   522
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   523
return
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   524
    "return the return character.
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   525
     In ST/X, this is different from cr - for Unix reasons."
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   526
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   527
    ^ Character codePoint:13
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   528
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   529
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   530
space
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   531
    "return the blank character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   532
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   533
    ^ Character codePoint:32
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   534
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   535
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   536
tab
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   537
    "return the tabulator character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   538
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   539
    ^ Character codePoint:9
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   540
! !
a27a279701f8 Initial revision
claus
parents:
diff changeset
   541
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   542
!Character class methodsFor:'primitive input'!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   543
a27a279701f8 Initial revision
claus
parents:
diff changeset
   544
fromUser
357
claus
parents: 345
diff changeset
   545
    "return a character from the keyboard (C's standard input stream)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   546
     - this should only be used for emergency evaluators and the like."
a27a279701f8 Initial revision
claus
parents:
diff changeset
   547
a27a279701f8 Initial revision
claus
parents:
diff changeset
   548
%{  /* NOCONTEXT */
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
   549
#   include <errno.h>
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
   550
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   551
    int c;
a27a279701f8 Initial revision
claus
parents:
diff changeset
   552
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
   553
    for (;;) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   554
	c = getchar();
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   555
	if (c >= 0) break;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   556
	if (errno != EINTR) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   557
	    RETURN (nil);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   558
	}
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   559
    }
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
   560
    RETURN ( __MKCHARACTER(c & 0xFF) );
5433
aa2365ee12cf migration support
Claus Gittinger <cg@exept.de>
parents: 5423
diff changeset
   561
%}.
19378
93bf71d3fc81 #UI_ENHANCEMENT
Claus Gittinger <cg@exept.de>
parents: 19195
diff changeset
   562
    ^ Processor activeProcess stdin next
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   563
! !
a27a279701f8 Initial revision
claus
parents:
diff changeset
   564
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   565
!Character class methodsFor:'queries'!
3
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   566
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   567
allCharacters
17249
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   568
    "added for squeak compatibility: return a collection of all singleton chars.
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   569
     Notice, for memory efficiency reasons, only some of the low-codepoint characters
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   570
     are actually kept as singletons. less frequently used character instances are created on the fly,
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   571
     as wide string elements are accessed (and hopefully garbage collected sooner or later)"
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   572
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   573
    ^ CharacterTable
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   574
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   575
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   576
     Character allCharacters
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   577
    "
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   578
!
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   579
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   580
hasSharedInstances
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   581
    "return true if this class has shared instances, that is, instances
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   582
     with the same value are identical.
9148
b33cbf0648d6 comment
Claus Gittinger <cg@exept.de>
parents: 8885
diff changeset
   583
     Although not always shared (TwoByte CodePoint-Characters), these should be treated
b33cbf0648d6 comment
Claus Gittinger <cg@exept.de>
parents: 8885
diff changeset
   584
     so, to be independent of the number of the underlying implementation"
2672
dc3662188b2c added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents: 2561
diff changeset
   585
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   586
    ^ true
2672
dc3662188b2c added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents: 2561
diff changeset
   587
!
dc3662188b2c added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents: 2561
diff changeset
   588
3
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   589
isBuiltInClass
1271
a11ef7b9c780 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1229
diff changeset
   590
    "return true if this class is known by the run-time-system.
a11ef7b9c780 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1229
diff changeset
   591
     Here, true is returned for myself, false for subclasses."
3
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   592
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   593
    ^ self == Character
1271
a11ef7b9c780 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1229
diff changeset
   594
a11ef7b9c780 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1229
diff changeset
   595
    "Modified: 23.4.1996 / 15:56:39 / cg"
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   596
!
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   597
8206
485f3b8d4c81 #isLegalUnicodeCodePoint:
Stefan Vogel <sv@exept.de>
parents: 8144
diff changeset
   598
isLegalUnicodeCodePoint:anInteger
485f3b8d4c81 #isLegalUnicodeCodePoint:
Stefan Vogel <sv@exept.de>
parents: 8144
diff changeset
   599
    "answer true, if anInteger is a valid unicode code point"
485f3b8d4c81 #isLegalUnicodeCodePoint:
Stefan Vogel <sv@exept.de>
parents: 8144
diff changeset
   600
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   601
    "Range 16rD800 - 16rDFFF is reserved for the
8207
12131fc77a99 #isLegalUnicodeCodePoint: - Fix comment
Stefan Vogel <sv@exept.de>
parents: 8206
diff changeset
   602
     lower and upper substitution page for UCS-16"
8206
485f3b8d4c81 #isLegalUnicodeCodePoint:
Stefan Vogel <sv@exept.de>
parents: 8144
diff changeset
   603
    (anInteger >= 16rD800) ifTrue:[
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   604
	(anInteger <= 16rDFFF) ifTrue:[
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   605
	    ^ false.
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   606
	].
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   607
	(anInteger == 16rFFFE) ifTrue:[
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   608
	    ^ false.
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   609
	].
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   610
	(anInteger == 16rFFFF) ifTrue:[
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   611
	    ^ false.
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   612
	].
8206
485f3b8d4c81 #isLegalUnicodeCodePoint:
Stefan Vogel <sv@exept.de>
parents: 8144
diff changeset
   613
    ].
485f3b8d4c81 #isLegalUnicodeCodePoint:
Stefan Vogel <sv@exept.de>
parents: 8144
diff changeset
   614
    ^ true
485f3b8d4c81 #isLegalUnicodeCodePoint:
Stefan Vogel <sv@exept.de>
parents: 8144
diff changeset
   615
!
485f3b8d4c81 #isLegalUnicodeCodePoint:
Stefan Vogel <sv@exept.de>
parents: 8144
diff changeset
   616
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   617
separators
17249
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   618
    "return a collection of separator chars.
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   619
     Added for squeak compatibility"
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   620
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   621
    Separators isNil ifTrue:[
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   622
	Separators := Array
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   623
	    with:Character space
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   624
	    with:Character return
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   625
	    "/ with:Character cr
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   626
	    with:Character tab
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   627
	    with:Character lf
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   628
	    with:Character ff
17249
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   629
    ].
5e3ec80d0d0a class: Character
Claus Gittinger <cg@exept.de>
parents: 17188
diff changeset
   630
    ^ Separators
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   631
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   632
    "
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   633
     Character separators
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   634
    "
3
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   635
! !
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   636
7261
f35fc9cee675 method category rename
Claus Gittinger <cg@exept.de>
parents: 7257
diff changeset
   637
!Character methodsFor:'Compatibility-Dolphin'!
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   638
7351
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   639
isAlphaNumeric
18805
7e090beaf16a #DOCUMENTATION
Claus Gittinger <cg@exept.de>
parents: 18691
diff changeset
   640
    "Compatibility method for dolphin and VSE - do not use in new code.
11956
ca9071118979 comments
Claus Gittinger <cg@exept.de>
parents: 11933
diff changeset
   641
     Return true, if I am a letter or a digit
7351
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   642
     Please use isLetterOrDigit for compatibility reasons (which is ANSI)."
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   643
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   644
    ^ self isLetterOrDigit
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   645
!
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   646
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   647
isAlphabetic
11956
ca9071118979 comments
Claus Gittinger <cg@exept.de>
parents: 11933
diff changeset
   648
    "Compatibility method - do not use in new code.
ca9071118979 comments
Claus Gittinger <cg@exept.de>
parents: 11933
diff changeset
   649
     Return true, if I am a letter.
7351
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   650
     Please use isLetter for compatibility reasons (which is ANSI)."
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   651
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   652
    ^ self isLetter
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   653
!
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   654
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   655
isControl
11956
ca9071118979 comments
Claus Gittinger <cg@exept.de>
parents: 11933
diff changeset
   656
    "Compatibility method - do not use in new code.
ca9071118979 comments
Claus Gittinger <cg@exept.de>
parents: 11933
diff changeset
   657
     Return true if I am a control character (i.e. ascii value < 32)"
7353
a5227413a23a comment
Claus Gittinger <cg@exept.de>
parents: 7351
diff changeset
   658
a5227413a23a comment
Claus Gittinger <cg@exept.de>
parents: 7351
diff changeset
   659
    ^ self isControlCharacter
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   660
!
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   661
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   662
isHexDigit
7354
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   663
    "return true if I am a valid hexadecimal digit"
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   664
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   665
    ^ '0123456789abcdefABCDEF' includes:self
7354
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   666
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   667
    "
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   668
     $a isHexDigit
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   669
    "
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   670
!
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   671
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   672
isPunctuation
11956
ca9071118979 comments
Claus Gittinger <cg@exept.de>
parents: 11933
diff changeset
   673
    "Compatibility method - do not use in new code.
ca9071118979 comments
Claus Gittinger <cg@exept.de>
parents: 11933
diff changeset
   674
     The code below is not unicode aware"
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   675
6327
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   676
    ^ (asciivalue between:16r21 and:16r40)
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   677
      or:[ (asciivalue between:16r5B and:16r60)
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   678
      or:[ (asciivalue between:123 and:126)
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   679
      or:[ (asciivalue between:161 and:191)
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   680
      or:[ (asciivalue == 215 )
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   681
      or:[ (asciivalue == 247 ) ]]]]]
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   682
! !
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   683
21374
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
   684
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   685
!Character methodsFor:'accessing'!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   686
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   687
codePoint
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   688
    "return the codePoint of myself.
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   689
     Traditionally, this was named 'asciiValue';
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   690
     however, characters are not limited to 8bit characters."
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   691
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   692
    ^ asciivalue
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   693
!
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   694
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   695
instVarAt:index put:anObject
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
   696
    "catch instvar access - asciivalue may not be changed"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   697
a27a279701f8 Initial revision
claus
parents:
diff changeset
   698
    self error:'Characters may not be modified'
a27a279701f8 Initial revision
claus
parents:
diff changeset
   699
! !
a27a279701f8 Initial revision
claus
parents:
diff changeset
   700
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   701
!Character methodsFor:'arithmetic'!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   702
a27a279701f8 Initial revision
claus
parents:
diff changeset
   703
+ aMagnitude
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   704
    "Return the Character that is <aMagnitude> higher than the receiver.
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   705
     Wrap if the resulting value is not a legal Character value. (JS)"
a27a279701f8 Initial revision
claus
parents:
diff changeset
   706
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   707
    ^ Character codePoint:((asciivalue + aMagnitude asInteger) \\ 16r3FFFFFFF)
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   708
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   709
    "
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   710
     $A + 5
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   711
    "
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   712
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   713
    "Modified: 27.6.1996 / 12:34:51 / cg"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   714
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   715
a27a279701f8 Initial revision
claus
parents:
diff changeset
   716
- aMagnitude
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   717
    "Return the Character that is <aMagnitude> lower than the receiver.
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   718
     Wrap if the resulting value is not a legal Character value. (JS)
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   719
     claus:
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   720
	modified to return the difference as integer, if the argument
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   721
	is another character. If the argument is a number, a character is
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   722
	returned."
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   723
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   724
    aMagnitude isCharacter ifTrue:[
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   725
	^ asciivalue - aMagnitude asInteger
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   726
    ].
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   727
    ^ Character codePoint:((asciivalue - aMagnitude asInteger) \\ 16r3FFFFFFF)
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   728
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   729
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   730
     $z - $a
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   731
     $d - 3
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   732
    "
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   733
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   734
    "Modified: 27.6.1996 / 12:35:34 / cg"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   735
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   736
a27a279701f8 Initial revision
claus
parents:
diff changeset
   737
// aMagnitude
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   738
    "Return the Character who's value is the receiver divided by <aMagnitude>.
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   739
     Wrap if the resulting value is not a legal Character value. (JS)"
a27a279701f8 Initial revision
claus
parents:
diff changeset
   740
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   741
    ^ Character codePoint:(asciivalue // aMagnitude asInteger \\ 16r3FFFFFFF)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   742
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   743
a27a279701f8 Initial revision
claus
parents:
diff changeset
   744
\\ aMagnitude
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   745
    "Return the Character who's value is the receiver modulo <aMagnitude>.
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   746
     Wrap if the resulting value is not a legal Character value. (JS)"
a27a279701f8 Initial revision
claus
parents:
diff changeset
   747
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
   748
    ^ Character codePoint:(asciivalue \\ aMagnitude asInteger \\ 16r3FFFFFFF)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   749
! !
a27a279701f8 Initial revision
claus
parents:
diff changeset
   750
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   751
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   752
!Character methodsFor:'comparing'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   753
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   754
< aMagnitude
7799
99753997e677 comments
Claus Gittinger <cg@exept.de>
parents: 7715
diff changeset
   755
    "return true, if the arguments asciiValue is greater than the receiver's"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   756
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   757
    ^ (asciivalue < aMagnitude asInteger)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   758
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   759
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   760
<= aMagnitude
7799
99753997e677 comments
Claus Gittinger <cg@exept.de>
parents: 7715
diff changeset
   761
    "return true, if the arguments asciiValue is greater or equal to the receiver's"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   762
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   763
    ^ (asciivalue <= aMagnitude asInteger)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   764
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   765
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   766
= aCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   767
    "return true, if the argument, aCharacter is the same character
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   768
     Redefined to take care of character sizes > 8bit."
18298
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   769
%{
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   770
#ifdef __SCHTEAM__
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   771
    if (self == aCharacter) {
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   772
	return context._RETURN_true();
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   773
    }
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   774
    if (aCharacter.characterP() == STObject.True) {
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   775
	if (self.charValue() == aCharacter.charValue()) {
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   776
	    return context._RETURN_true();
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   777
	} else {
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   778
	    return context._RETURN_false();
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   779
	}
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   780
    }
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   781
#endif /* not SCHTEAM */
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
   782
%}.
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   783
    self == aCharacter ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   784
    aCharacter isCharacter ifFalse:[^ false].
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   785
    ^ asciivalue = aCharacter codePoint
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   786
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   787
    "
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   788
	$A = (Character value:65)
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   789
	$A = (Character codePoint:65)
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   790
	$A = ($B-1)
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   791
	$A = 65
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   792
    "
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   793
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   794
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   795
> aMagnitude
7799
99753997e677 comments
Claus Gittinger <cg@exept.de>
parents: 7715
diff changeset
   796
    "return true, if the arguments asciiValue is less than the receiver's"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   797
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   798
    ^ (asciivalue > aMagnitude asInteger)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   799
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   800
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   801
>= aMagnitude
7799
99753997e677 comments
Claus Gittinger <cg@exept.de>
parents: 7715
diff changeset
   802
    "return true, if the arguments asciiValue is less or equal to the receiver's"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   803
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   804
    ^ (asciivalue >= aMagnitude asInteger)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   805
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   806
5540
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   807
hash
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   808
    "return an integer useful for hashing"
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   809
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   810
    ^ asciivalue
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   811
!
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   812
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   813
identityHash
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   814
    "return an integer useful for hashing on identity"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   815
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   816
%{
8100
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   817
    INT __codePoint;
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   818
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   819
    __codePoint = __smallIntegerVal(__INST(asciivalue));
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   820
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   821
    if (__codePoint <= MAX_IMMEDIATE_CHARACTER) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   822
	RETURN ( __mkSmallInteger(__codePoint + 4096) );
8100
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   823
    }
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   824
%}.
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   825
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   826
    ^ super identityHash
8100
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   827
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   828
    "
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   829
      $a identityHash.
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   830
      (Character value:1234) identityHash
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
   831
    "
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   832
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   833
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   834
sameAs:aCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   835
    "return true, if the argument, aCharacter is the same character,
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   836
     ignoring case differences."
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   837
21673
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
   838
    "Note: stc inlines codePoint, so this is faster than compare with #=
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
   839
     and works also for chars > 8bit"
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
   840
    self codePoint == aCharacter codePoint ifTrue:[^ true].
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
   841
    ^ self asLowercase codePoint == aCharacter asLowercase codePoint.
14663
a33ac1c6c69f class: Character
Stefan Vogel <sv@exept.de>
parents: 14120
diff changeset
   842
a33ac1c6c69f class: Character
Stefan Vogel <sv@exept.de>
parents: 14120
diff changeset
   843
    "
a33ac1c6c69f class: Character
Stefan Vogel <sv@exept.de>
parents: 14120
diff changeset
   844
      (Character value:345) sameAs:(Character value:345)
21673
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
   845
      $Ж sameAs:$ж 
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
   846
      $ж sameAs:$Ж 
14663
a33ac1c6c69f class: Character
Stefan Vogel <sv@exept.de>
parents: 14120
diff changeset
   847
    "
21673
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
   848
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
   849
    "Modified (comment): / 28-03-2017 / 16:19:48 / stefan"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   850
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   851
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   852
~= aCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   853
    "return true, if the argument, aCharacter is not the same character
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   854
     Redefined to take care of character sizes > 8bit."
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   855
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   856
    self == aCharacter ifTrue:[^ false].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   857
    aCharacter isCharacter ifFalse:[^ true].
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
   858
    ^ (asciivalue ~~ aCharacter codePoint)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   859
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   860
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   861
!Character methodsFor:'converting'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   862
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   863
asCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   864
    "usually sent to integers, but redefined here to allow integers
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   865
     and characters to be used commonly without a need for a test."
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   866
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   867
    ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   868
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   869
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   870
     32 asCharacter
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   871
    "
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   872
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   873
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   874
asInteger
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   875
    "the same as #codePoint.
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   876
     Use #asInteger, if you need protocol compatibility with Numbers etc..
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
   877
     Use #codePoint in any other case for better stc optimization"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   878
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   879
    ^ asciivalue
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   880
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   881
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   882
asLowercase
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   883
    "return a character with same letter as the receiver, but in lowercase.
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   884
     Returns the receiver if it is already lowercase or if there is no lowercase equivalent.
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   885
     CAVEAT:
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   886
	for now, this method is only correct for unicode characters up to u+1d6ff (Unicode3.1).
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   887
	(which is more than mozilla does, btw. ;-)"
7989
7907420b2fab asUppercase / asLowercase for U0100..U04FF
Claus Gittinger <cg@exept.de>
parents: 7988
diff changeset
   888
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   889
%{
18240
28af09029a8b ifdef for SCHTEAM engine changed (not relevant for ST/X)
Claus Gittinger <cg@exept.de>
parents: 18215
diff changeset
   890
#ifdef __SCHTEAM__
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   891
    {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   892
	char ch = self.charValue("[asLowercase]");
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   893
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   894
	ch = java.lang.Character.toLowerCase(ch);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
   895
	return context._RETURN(STCharacter._new(ch));
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   896
    }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   897
    /* NOTREACHED */
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
   898
#else
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
   899
    static int __mapping[] = {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
   900
    /* From    To             Every   Diff   */
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   901
       0x0041, ((0x19 << 8) | 0x01), 0x0020  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   902
       0x00c0, ((0x16 << 8) | 0x01), 0x0020  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   903
       0x00d8, ((0x06 << 8) | 0x01), 0x0020  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   904
       0x0100, ((0x2e << 8) | 0x02), 0x0001  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   905
       0x0130, ((0x00 << 8) | 0x00), -199   ,
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   906
       0x0132, ((0x04 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   907
       0x0139, ((0x0e << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   908
       0x014a, ((0x2c << 8) | 0x02), 0x0001  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   909
       0x0178, ((0x00 << 8) | 0x00), -121   ,
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   910
       0x0179, ((0x04 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   911
       0x0181, ((0x00 << 8) | 0x00), 0x00d2  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   912
       0x0182, ((0x02 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   913
       0x0186, ((0x00 << 8) | 0x00), 0x00ce  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   914
       0x0187, ((0x00 << 8) | 0x00), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   915
       0x0189, ((0x01 << 8) | 0x01), 0x00cd  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   916
       0x018b, ((0x00 << 8) | 0x00), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   917
       0x018e, ((0x00 << 8) | 0x00), 0x004f  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   918
       0x018f, ((0x00 << 8) | 0x00), 0x00ca  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   919
       0x0190, ((0x00 << 8) | 0x00), 0x00cb  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   920
       0x0191, ((0x00 << 8) | 0x00), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   921
       0x0193, ((0x00 << 8) | 0x00), 0x00cd  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   922
       0x0194, ((0x00 << 8) | 0x00), 0x00cf  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   923
       0x0196, ((0x00 << 8) | 0x00), 0x00d3  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   924
       0x0197, ((0x00 << 8) | 0x00), 0x00d1  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   925
       0x0198, ((0x00 << 8) | 0x00), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   926
       0x019c, ((0x00 << 8) | 0x00), 0x00d3  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   927
       0x019d, ((0x00 << 8) | 0x00), 0x00d5  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   928
       0x019f, ((0x00 << 8) | 0x00), 0x00d6  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   929
       0x01a0, ((0x04 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   930
       0x01a6, ((0x00 << 8) | 0x00), 0x00da  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   931
       0x01a7, ((0x00 << 8) | 0x00), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   932
       0x01a9, ((0x00 << 8) | 0x00), 0x00da  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   933
       0x01ac, ((0x00 << 8) | 0x00), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   934
       0x01ae, ((0x00 << 8) | 0x00), 0x00da  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   935
       0x01af, ((0x00 << 8) | 0x00), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   936
       0x01b1, ((0x01 << 8) | 0x01), 0x00d9  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   937
       0x01b3, ((0x02 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   938
       0x01b7, ((0x00 << 8) | 0x00), 0x00db  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   939
       0x01b8, ((0x04 << 8) | 0x04), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   940
       0x01c4, ((0x00 << 8) | 0x00), 0x0002  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   941
       0x01c5, ((0x00 << 8) | 0x00), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   942
       0x01c7, ((0x00 << 8) | 0x00), 0x0002  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   943
       0x01c8, ((0x00 << 8) | 0x00), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   944
       0x01ca, ((0x00 << 8) | 0x00), 0x0002  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   945
       0x01cb, ((0x10 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   946
       0x01de, ((0x10 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   947
       0x01f1, ((0x00 << 8) | 0x00), 0x0002  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   948
       0x01f2, ((0x02 << 8) | 0x02), 0x0001  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   949
       0x01f6, ((0x00 << 8) | 0x00), -97   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   950
       0x01f7, ((0x00 << 8) | 0x00), -56   ,
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   951
       0x01f8, ((0x26 << 8) | 0x02), 0x0001  ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   952
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   953
       0x0220, ((0x00 << 8) | 0x00), -130  ,          /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   954
#endif
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   955
       0x0222, ((0x10 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   956
       0x0386, ((0x00 << 8) | 0x00), 0x0026  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   957
       0x0388, ((0x02 << 8) | 0x01), 0x0025  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   958
       0x038c, ((0x00 << 8) | 0x00), 0x0040  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   959
       0x038e, ((0x01 << 8) | 0x01), 0x003f  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   960
       0x0391, ((0x10 << 8) | 0x01), 0x0020  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   961
       0x03a3, ((0x08 << 8) | 0x01), 0x0020  ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   962
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   963
       0x03d8, ((0x00 << 8) | 0x00), 1  ,             /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   964
#endif
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   965
       0x03da, ((0x14 << 8) | 0x02), 0x0001  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   966
       0x03f4, ((0x00 << 8) | 0x00), -60   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   967
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   968
       0x03f7, ((0x03 << 8) | 0x03), 1  ,             /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   969
       0x03f9, ((0x00 << 8) | 0x00), -7 ,             /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   970
#endif
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   971
       0x0400, ((0x0f << 8) | 0x01), 0x0050  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   972
       0x0410, ((0x1f << 8) | 0x01), 0x0020  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   973
       0x0460, ((0x20 << 8) | 0x02), 0x0001  ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   974
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   975
       0x048a, ((0x00 << 8) | 0x00), 1 ,              /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   976
#endif
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   977
       0x048c, ((0x32 << 8) | 0x02), 0x0001  ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   978
#ifdef UNICODE_3_2
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   979
       0x04c1, ((0x02 << 8) | 0x02), 0x0001  ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   980
#else
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   981
       0x04c1, ((0x04 << 8) | 0x02), 0x0001  ,        /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   982
#endif
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   983
#ifdef UNICODE_3_2
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   984
       0x04c7, ((0x04 << 8) | 0x04), 0x0001  ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   985
#else
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   986
       0x04c7, ((0x04 << 8) | 0x02), 0x0001  ,
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   987
       0x04cd, ((0x00 << 8) | 0x00), 0x0001  ,
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   988
#endif
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   989
       0x04d0, ((0x24 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   990
       0x04f8, ((0x00 << 8) | 0x00), 0x0001  ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   991
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   992
       0x0500, ((0x0E << 8) | 0x02), 1  ,
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
   993
#endif
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   994
       0x0531, ((0x25 << 8) | 0x01), 0x0030  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   995
       0x1e00, ((0x94 << 8) | 0x02), 0x0001  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
   996
       0x1ea0, ((0x58 << 8) | 0x02), 0x0001  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   997
       0x1f08, ((0x07 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   998
       0x1f18, ((0x05 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
   999
       0x1f28, ((0x07 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1000
       0x1f38, ((0x07 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1001
       0x1f48, ((0x05 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1002
       0x1f59, ((0x06 << 8) | 0x02), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1003
       0x1f68, ((0x07 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1004
       0x1f88, ((0x07 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1005
       0x1f98, ((0x07 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1006
       0x1fa8, ((0x07 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1007
       0x1fb8, ((0x01 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1008
       0x1fba, ((0x01 << 8) | 0x01), -74  ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1009
       0x1fbc, ((0x00 << 8) | 0x00), -9   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1010
       0x1fc8, ((0x03 << 8) | 0x01), -86  ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1011
       0x1fcc, ((0x00 << 8) | 0x00), -9   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1012
       0x1fd8, ((0x01 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1013
       0x1fda, ((0x01 << 8) | 0x01), -100 ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1014
       0x1fe8, ((0x01 << 8) | 0x01), -8   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1015
       0x1fea, ((0x01 << 8) | 0x01), -112 ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1016
       0x1fec, ((0x00 << 8) | 0x00), -7   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1017
       0x1ff8, ((0x01 << 8) | 0x01), -128 ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1018
       0x1ffa, ((0x01 << 8) | 0x01), -126 ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1019
       0x1ffc, ((0x00 << 8) | 0x00), -9   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1020
       0x2126, ((0x00 << 8) | 0x00), -7517   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1021
       0x212a, ((0x00 << 8) | 0x00), -8383   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1022
       0x212b, ((0x00 << 8) | 0x00), -8262   ,
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1023
       0x2160, ((0x0f << 8) | 0x01), 0x0010  ,
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1024
       0x24b6, ((0x19 << 8) | 0x01), 0x001a  ,
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1025
       0xff21, ((0x19 << 8) | 0x01), 0x0020  ,
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1026
       0x10400, ((0x27 << 8) | 0x01), 0x0028
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1027
    };
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
  1028
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1029
    REGISTER unsigned INT __codePoint;
8308
19a958b21ac0 no const for hpux
Claus Gittinger <cg@exept.de>
parents: 8222
diff changeset
  1030
    REGISTER int *  __p;
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1031
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1032
    __codePoint = __intVal(__INST(asciivalue));
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1033
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1034
    // comon ascii stuff first
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1035
    if (__codePoint < 0x80) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1036
	if ((__codePoint >= 'A') && (__codePoint <= 'Z')) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1037
	    unsigned int newCodePoint = __codePoint - 'A' + 'a';
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1038
	    RETURN (__MKCHARACTER(newCodePoint)) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1039
	}
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1040
	RETURN (self);
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1041
    }
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1042
8106
ee222c1314e6 cannot make pointer arith with a void *
Claus Gittinger <cg@exept.de>
parents: 8100
diff changeset
  1043
    for (__p = __mapping; (char *)__p < ((char *)__mapping) + sizeof(__mapping); __p += 3) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1044
	unsigned rangeStart, rangeSize, rangeEnd, mod;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1045
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1046
	rangeStart = (unsigned)__p[0];
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1047
	if (__codePoint < rangeStart) break;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1048
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1049
	rangeSize = ((unsigned)__p[1]) >> 8;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1050
	rangeEnd = rangeStart + rangeSize;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1051
	if (__codePoint <= rangeEnd) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1052
	    mod = __p[1] & 0xFF;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1053
	    if ((mod == 0) || (((__codePoint - rangeStart) % mod) == 0)) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1054
		OBJ newChar;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1055
		unsigned newCodePoint;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1056
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1057
		newCodePoint = __codePoint + __p[2];
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1058
		if (newCodePoint <= MAX_IMMEDIATE_CHARACTER) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1059
		    RETURN (__MKCHARACTER(newCodePoint)) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1060
		}
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1061
		newChar = __MKUCHARACTER(newCodePoint) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1062
		if (newChar == nil) goto allocationError;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1063
		RETURN (newChar) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1064
	    }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1065
	}
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1066
    }
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1067
    RETURN (self);
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1068
allocationError: ;
18240
28af09029a8b ifdef for SCHTEAM engine changed (not relevant for ST/X)
Claus Gittinger <cg@exept.de>
parents: 18215
diff changeset
  1069
#endif /* ! __SCHTEAM__ */
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1070
%}.
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1071
    ^ AllocationFailure raise.
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
  1072
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
  1073
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1074
     $A asLowercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1075
     $a asLowercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1076
     (Character value:16r01F5) asUppercase asLowercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1077
     (Character value:16r0205) asUppercase asLowercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1078
     (Character value:16r03B1) asUppercase asLowercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1079
     (Character value:16r1E00) asLowercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1080
    "
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1081
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1082
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1083
asString
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1084
    "return a string of len 1 with myself as contents"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1085
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1086
%{  /* NOCONTEXT */
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1087
    char buffer[2];
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1088
    OBJ s;
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1089
    unsigned INT val;
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1090
15262
5047292c9107 all stx macros begin with double underline (eg. __qClass instead of _qClass)
Claus Gittinger <cg@exept.de>
parents: 14684
diff changeset
  1091
    val = __intVal(__characterVal(self));
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
  1092
    if (val <= 0xFF) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1093
	buffer[0] = (char) val;
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1094
	buffer[1] = '\0';
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1095
	s = __MKSTRING_L(buffer, 1);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1096
	if (s != nil) {
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1097
	    RETURN (s);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1098
	}
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1099
    }
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1100
    if (val <= 0xFFFF) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1101
	s = __MKEMPTYU16STRING(1);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1102
	if (s != nil) {
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1103
	    __Unicode16StringInstPtr(s)->s_element[0] = val;
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1104
	    RETURN (s);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1105
	}
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1106
    }
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1107
%}.
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
  1108
    asciivalue > 16rFF ifTrue:[
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1109
	asciivalue > 16rFFFF ifTrue:[
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1110
	    ^ (Unicode32String new:1) at:1 put:self; yourself
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1111
	].
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1112
	^ (Unicode16String new:1) at:1 put:self; yourself
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
  1113
    ].
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
  1114
5407
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
  1115
    ^ (String new:1) at:1 put:self; yourself.
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1116
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1117
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1118
     (Character value:16rB5) asString
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1119
     (Character value:16r1B5) asString
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1120
    "
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1121
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1122
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1123
asSymbol
17184
f3ea3eef5857 class: Character
Claus Gittinger <cg@exept.de>
parents: 16744
diff changeset
  1124
    "Return a unique symbol with the name taken from the receiver's characters.
9229
74fe3012c7f6 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9153
diff changeset
  1125
     Here, a single character symbol is returned."
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1126
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1127
    ^ Symbol internCharacter:self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1128
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1129
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1130
asTitlecase
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1131
    "return a character with same letter as the receiver, but in titlecase.
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1132
     Returns the receiver if it is already titlecase or if there is no titlecase equivalent."
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1133
8026
96cc838078ba comment
Claus Gittinger <cg@exept.de>
parents: 8022
diff changeset
  1134
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1135
     For example, in Unicode, character U+01F3 is LATIN SMALL LETTER DZ.
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1136
     (Let us write this compound character using ASCII as 'dz'.)
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1137
     This character uppercases to character U+01F1, LATIN CAPITAL LETTER DZ.
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1138
     (Which is basically 'DZ'.)
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1139
     But it titlecases to to character U+01F2, LATIN CAPITAL LETTER D WITH SMALL LETTER Z.
8026
96cc838078ba comment
Claus Gittinger <cg@exept.de>
parents: 8022
diff changeset
  1140
     (Which we can write 'Dz'.)
96cc838078ba comment
Claus Gittinger <cg@exept.de>
parents: 8022
diff changeset
  1141
96cc838078ba comment
Claus Gittinger <cg@exept.de>
parents: 8022
diff changeset
  1142
      character uppercase titlecase
96cc838078ba comment
Claus Gittinger <cg@exept.de>
parents: 8022
diff changeset
  1143
      --------- --------- ---------
96cc838078ba comment
Claus Gittinger <cg@exept.de>
parents: 8022
diff changeset
  1144
      dz        DZ        Dz
96cc838078ba comment
Claus Gittinger <cg@exept.de>
parents: 8022
diff changeset
  1145
    "
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1146
    |ch|
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1147
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1148
%{
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1149
    static unsigned short __mapping[] = {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1150
       0x01C4,    0x01C5,
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1151
       0x01C6,    0x01C5,
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1152
       0x01C7,    0x01C8,
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1153
       0x01C9,    0x01C8,
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1154
       0x01CA,    0x01CB,
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1155
       0x01CC,    0x01CB,
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1156
       0x01F1,    0x01F2,
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1157
       0x01F3,    0x01F2,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1158
    };
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1159
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1160
    REGISTER unsigned INT __codePoint;
8885
57c672cb3a64 Fix signed/unsigned clash
Stefan Vogel <sv@exept.de>
parents: 8830
diff changeset
  1161
    REGISTER unsigned short *__p;
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1162
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1163
    __codePoint = __intVal(__INST(asciivalue));
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1164
    if ((__codePoint > 0x01C0) && (__codePoint < 0x01FF)) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1165
	for (__p = __mapping; (char *)__p < ((char *)__mapping) + sizeof(__mapping); __p += 2) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1166
	    if ((__codePoint == __p[0]) || (__codePoint == __p[1])) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1167
		short newCodePoint;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1168
		OBJ newChar;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1169
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1170
		newCodePoint = __p[1];
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1171
		if (newCodePoint == __codePoint) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1172
		    RETURN (self);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1173
		}
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1174
		if (newCodePoint <= MAX_IMMEDIATE_CHARACTER) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1175
		    RETURN (__MKCHARACTER(newCodePoint)) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1176
		}
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1177
		newChar = __MKUCHARACTER(newCodePoint) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1178
		if (newChar == nil) goto getOutOfHere;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1179
		RETURN (newChar) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1180
	    }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1181
	}
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1182
    }
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1183
    if (__codePoint < 0x80) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1184
	// do it here for common ascii characters
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1185
	if ((__codePoint >= 'a') && (__codePoint <= 'z')) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1186
	    unsigned char newCodePoint = __codePoint - 'a' + 'A';
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1187
	    RETURN (__MKCHARACTER(newCodePoint)) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1188
	}
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1189
	RETURN (self) ;
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1190
    }
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1191
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1192
    ch = self;
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1193
getOutOfHere: ;
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1194
%}.
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1195
    ch notNil ifTrue:[
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1196
	^ ch asUppercase.
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1197
    ].
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1198
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1199
    ^ AllocationFailure raise.
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1200
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1201
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1202
     $A asTitlecase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1203
     $a asTitlecase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1204
     (Character value:16r01F1) asTitlecase
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1205
     (Character value:16r01F2) asTitlecase
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1206
    "
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1207
!
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1208
6029
594664bbbc47 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5945
diff changeset
  1209
asUnicodeString
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1210
    "return a unicode string of len 1 with myself as contents.
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1211
     This will vanish, as we now (rel5.2.x) use Unicode as default."
6029
594664bbbc47 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5945
diff changeset
  1212
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
  1213
    asciivalue > 16rFFFF ifTrue:[
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1214
	^ (Unicode32String new:1) at:1 put:self; yourself.
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
  1215
    ].
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
  1216
    ^ (Unicode16String new:1) at:1 put:self; yourself.
6029
594664bbbc47 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5945
diff changeset
  1217
!
594664bbbc47 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5945
diff changeset
  1218
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1219
asUppercase
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1220
    "return a character with same letter as the receiver, but in uppercase.
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1221
     Returns the receiver if it is already uppercase or if there is no uppercase equivalent.
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1222
     CAVEAT:
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1223
	for now, this method is only correct for unicode characters up to u+1d6ff (Unicode3.1).
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1224
	(which is more than mozilla does, btw. ;-)"
7990
2f78c1d609c7 asUppercase / asLowercase for UFF00..UFFFF
Claus Gittinger <cg@exept.de>
parents: 7989
diff changeset
  1225
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1226
%{
18240
28af09029a8b ifdef for SCHTEAM engine changed (not relevant for ST/X)
Claus Gittinger <cg@exept.de>
parents: 18215
diff changeset
  1227
#ifdef __SCHTEAM__
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1228
    {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1229
	char ch = self.charValue("[asUppercase]");
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1230
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1231
	ch = java.lang.Character.toUpperCase(ch);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1232
	return context._RETURN(STCharacter._new(ch));
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1233
    }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1234
    /* NOTREACHED */
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1235
#else
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1236
    static int __mapping[] = {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1237
    /* From    To             Every   Diff   */
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1238
       0x0061, ((0x19 << 8) | 0x01), -32  ,
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1239
       0x00b5, ((0x00 << 8) | 0x3b), 0x02e7  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1240
       0x00e0, ((0x16 << 8) | 0x01), -32   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1241
       0x00f8, ((0x06 << 8) | 0x01), -32   ,
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1242
       0x00ff, ((0x00 << 8) | 0x01), 0x0079  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1243
       0x0101, ((0x2e << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1244
       0x0131, ((0x00 << 8) | 0x02), -232  ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1245
       0x0133, ((0x04 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1246
       0x013a, ((0x0e << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1247
       0x014b, ((0x2c << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1248
       0x017a, ((0x04 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1249
       0x017f, ((0x00 << 8) | 0x01), -300  ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1250
       0x0183, ((0x02 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1251
       0x0188, ((0x04 << 8) | 0x04), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1252
       0x0192, ((0x00 << 8) | 0x06), -1   ,
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1253
       0x0195, ((0x00 << 8) | 0x03), 0x0061  ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1254
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1255
       0x0199, ((0x04 << 8) | 0x08), -1   ,
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1256
       0x019e, ((0x00 << 8) | 0x00), 130  ,          /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1257
#endif
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1258
       0x0199, ((0x08 << 8) | 0x08), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1259
       0x01a3, ((0x02 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1260
       0x01a8, ((0x05 << 8) | 0x05), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1261
       0x01b0, ((0x04 << 8) | 0x04), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1262
       0x01b6, ((0x03 << 8) | 0x03), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1263
       0x01bd, ((0x00 << 8) | 0x04), -1   ,
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1264
       0x01bf, ((0x00 << 8) | 0x02), 0x0038  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1265
       0x01c5, ((0x00 << 8) | 0x06), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1266
       0x01c6, ((0x00 << 8) | 0x01), -2   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1267
       0x01c8, ((0x00 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1268
       0x01c9, ((0x00 << 8) | 0x01), -2   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1269
       0x01cb, ((0x00 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1270
       0x01cc, ((0x00 << 8) | 0x01), -2   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1271
       0x01ce, ((0x0e << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1272
       0x01dd, ((0x00 << 8) | 0x01), -79   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1273
       0x01df, ((0x10 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1274
       0x01f2, ((0x00 << 8) | 0x03), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1275
       0x01f3, ((0x00 << 8) | 0x01), -2   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1276
       0x01f5, ((0x04 << 8) | 0x04), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1277
       0x01fb, ((0x24 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1278
       0x0223, ((0x10 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1279
       0x0253, ((0x00 << 8) | 0x20), -210   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1280
       0x0254, ((0x00 << 8) | 0x01), -206   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1281
       0x0256, ((0x01 << 8) | 0x01), -205   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1282
       0x0259, ((0x00 << 8) | 0x02), -202   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1283
       0x025b, ((0x00 << 8) | 0x02), -203   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1284
       0x0260, ((0x00 << 8) | 0x05), -205   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1285
       0x0263, ((0x00 << 8) | 0x03), -207   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1286
       0x0268, ((0x00 << 8) | 0x05), -209   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1287
       0x0269, ((0x06 << 8) | 0x06), -211   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1288
       0x0272, ((0x00 << 8) | 0x03), -213   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1289
       0x0275, ((0x00 << 8) | 0x03), -214   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1290
       0x0280, ((0x03 << 8) | 0x03), -218   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1291
       0x0288, ((0x00 << 8) | 0x05), -218   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1292
       0x028a, ((0x01 << 8) | 0x01), -217   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1293
       0x0292, ((0x00 << 8) | 0x07), -219   ,
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1294
       0x0345, ((0x00 << 8) | 0xb3), 0x0054  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1295
       0x03ac, ((0x00 << 8) | 0x67), -38   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1296
       0x03ad, ((0x02 << 8) | 0x01), -37   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1297
       0x03b1, ((0x10 << 8) | 0x01), -32   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1298
       0x03c2, ((0x00 << 8) | 0x01), -31   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1299
       0x03c3, ((0x08 << 8) | 0x01), -32   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1300
       0x03cc, ((0x00 << 8) | 0x01), -64   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1301
       0x03cd, ((0x01 << 8) | 0x01), -63   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1302
       0x03d0, ((0x00 << 8) | 0x02), -62   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1303
       0x03d1, ((0x00 << 8) | 0x01), -57   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1304
       0x03d5, ((0x00 << 8) | 0x04), -47   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1305
       0x03d6, ((0x00 << 8) | 0x01), -54   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1306
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1307
       0x03d9, ((0x00 << 8) | 0x00), -1  ,             /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1308
#endif
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1309
       0x03db, ((0x14 << 8) | 0x02), -1    ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1310
       0x03f0, ((0x00 << 8) | 0x01), -86   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1311
       0x03f1, ((0x00 << 8) | 0x01), -80   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1312
#ifdef UNICODE_3_2
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1313
       0x03f2, ((0x00 << 8) | 0x01), -79   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1314
#else
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1315
       0x03f2, ((0x00 << 8) | 0x00), 7  ,              /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1316
#endif
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1317
       0x03f5, ((0x00 << 8) | 0x00), -96   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1318
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1319
       0x03f8, ((0x03 << 8) | 0x03), -1  ,             /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1320
#endif
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1321
       0x0430, ((0x1f << 8) | 0x01), -32   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1322
       0x0450, ((0x0f << 8) | 0x01), -80   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1323
       0x0461, ((0x20 << 8) | 0x02), -1   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1324
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1325
       0x048b, ((0x00 << 8) | 0x00), -1 ,              /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1326
#endif
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1327
       0x048d, ((0x32 << 8) | 0x02), -1   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1328
#ifdef UNICODE_3_2
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1329
       0x04c2, ((0x02 << 8) | 0x02), -1   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1330
#else
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1331
       0x04c2, ((0x04 << 8) | 0x02), -1   ,            /* Unicode4.0 - not in X fonts - sigh */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1332
#endif
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1333
#ifdef UNICODE_3_2
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1334
       0x04c8, ((0x04 << 8) | 0x04), -1   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1335
#else
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1336
       0x04c8, ((0x04 << 8) | 0x02), -1   ,
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1337
       0x04ce, ((0x00 << 8) | 0x00), -1   ,
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1338
#endif
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1339
       0x04d1, ((0x24 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1340
       0x04f9, ((0x00 << 8) | 0x04), -1   ,
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1341
#ifndef UNICODE_3_2
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1342
       0x0501, ((0x0E << 8) | 0x02), -1  ,
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  1343
#endif
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1344
       0x0561, ((0x25 << 8) | 0x01), -48   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1345
       0x1e01, ((0x94 << 8) | 0x02), -1   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1346
       0x1e9b, ((0x00 << 8) | 0x06), -59   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1347
       0x1ea1, ((0x58 << 8) | 0x02), -1   ,
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1348
       0x1f00, ((0x07 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1349
       0x1f10, ((0x05 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1350
       0x1f20, ((0x07 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1351
       0x1f30, ((0x07 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1352
       0x1f40, ((0x05 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1353
       0x1f51, ((0x06 << 8) | 0x02), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1354
       0x1f60, ((0x07 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1355
       0x1f70, ((0x01 << 8) | 0x01), 0x004a  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1356
       0x1f72, ((0x03 << 8) | 0x01), 0x0056  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1357
       0x1f76, ((0x01 << 8) | 0x01), 0x0064  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1358
       0x1f78, ((0x01 << 8) | 0x01), 0x0080  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1359
       0x1f7a, ((0x01 << 8) | 0x01), 0x0070  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1360
       0x1f7c, ((0x01 << 8) | 0x01), 0x007e  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1361
       0x1f80, ((0x07 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1362
       0x1f90, ((0x07 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1363
       0x1fa0, ((0x07 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1364
       0x1fb0, ((0x01 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1365
       0x1fb3, ((0x00 << 8) | 0x02), 0x0009  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1366
       0x1fbe, ((0x00 << 8) | 0x0b), -7205   ,
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1367
       0x1fc3, ((0x00 << 8) | 0x05), 0x0009  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1368
       0x1fd0, ((0x01 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1369
       0x1fe0, ((0x01 << 8) | 0x01), 0x0008  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1370
       0x1fe5, ((0x00 << 8) | 0x04), 0x0007  ,
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1371
       0x1ff3, ((0x00 << 8) | 0x0e), 0x0009  ,
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1372
       0x2170, ((0x0f << 8) | 0x01), -16   ,
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1373
       0x24d0, ((0x19 << 8) | 0x01), -26   ,
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1374
       0xff41, ((0x19 << 8) | 0x01), -32  ,
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1375
       0x10428, ((0x27 << 8) | 0x01), -40
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1376
    };
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1377
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1378
    REGISTER unsigned INT __codePoint;
8308
19a958b21ac0 no const for hpux
Claus Gittinger <cg@exept.de>
parents: 8222
diff changeset
  1379
    REGISTER int *__p;
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1380
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1381
    __codePoint = __intVal(__INST(asciivalue));
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1382
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1383
   // comon ascii stuff first
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1384
    if (__codePoint < 0x80) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1385
	if ((__codePoint >= 'a') && (__codePoint <= 'z')) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1386
	    unsigned newCodePoint;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1387
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1388
	    newCodePoint = __codePoint - 'a' + 'A';
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1389
	    RETURN (__MKCHARACTER(newCodePoint)) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1390
	}
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1391
	RETURN (self);
18691
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1392
    }
0b9e9bc60d61 class: Character
Claus Gittinger <cg@exept.de>
parents: 18658
diff changeset
  1393
8106
ee222c1314e6 cannot make pointer arith with a void *
Claus Gittinger <cg@exept.de>
parents: 8100
diff changeset
  1394
    for (__p = __mapping; (char *)__p < ((char *)__mapping) + sizeof(__mapping); __p += 3) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1395
	unsigned rangeStart, rangeSize, rangeEnd, mod;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1396
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1397
	rangeStart = (unsigned)__p[0];
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1398
	if (rangeStart > __codePoint) break;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1399
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1400
	rangeSize = ((unsigned)__p[1]) >> 8;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1401
	rangeEnd = rangeStart + rangeSize;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1402
	if (__codePoint <= rangeEnd) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1403
	    mod = __p[1] & 0xFF;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1404
	    if ((mod == 0) || (((__codePoint - rangeStart) % mod) == 0)) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1405
		OBJ newChar;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1406
		unsigned newCodePoint;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1407
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1408
		newCodePoint = __codePoint + __p[2];
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1409
		if (newCodePoint <= MAX_IMMEDIATE_CHARACTER) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1410
		    RETURN (__MKCHARACTER(newCodePoint)) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1411
		}
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1412
		newChar = __MKUCHARACTER(newCodePoint) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1413
		if (newChar == nil) goto allocationError;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1414
		RETURN (newChar) ;
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1415
	    }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1416
	}
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1417
    }
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1418
    RETURN (self);
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  1419
allocationError: ;
18240
28af09029a8b ifdef for SCHTEAM engine changed (not relevant for ST/X)
Claus Gittinger <cg@exept.de>
parents: 18215
diff changeset
  1420
#endif /* ! __SCHTEAM__ */
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1421
%}.
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1422
    ^ AllocationFailure raise.
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1423
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1424
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1425
     $A asLowercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1426
     $a asUppercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1427
     (Character value:16r01F5) asUppercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1428
     (Character value:16r0205) asUppercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1429
     (Character value:16r03B1) asUppercase
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1430
    "
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1431
!
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1432
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1433
digitValue
11653
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1434
    "return my digitValue for any base (up to 37)"
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1435
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1436
    |code "{ Class: SmallInteger }" |
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1437
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1438
    code := asciivalue.
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1439
    (code between:($0 codePoint) and:($9 codePoint)) ifTrue:[
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1440
	^ code - $0 codePoint
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1441
    ].
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1442
    (code between:($a codePoint) and:($z codePoint)) ifTrue:[
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1443
	^ code + (10 - $a codePoint)
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1444
    ].
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1445
    (code between:($A codePoint) and:($Z codePoint)) ifTrue:[
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1446
	^ code + (10 - $A codePoint)
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1447
    ].
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1448
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1449
"remove error below for X3J20 conformance ... "
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1450
    self error:'bad character'.
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1451
" "
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1452
    ^ -1
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1453
!
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1454
11653
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1455
digitValueRadix:base
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1456
    "return my digitValue for base.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1457
     Return nil, if it is not a valid character for that base"
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1458
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1459
    |code "{ Class: SmallInteger }" |
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1460
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1461
    code := asciivalue.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1462
    base < 10 ifTrue:[
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1463
	(code between:($0 codePoint) and:($0 codePoint + base - 1)) ifTrue:[
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1464
	    ^ code - $0 codePoint
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1465
	].
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1466
	^ nil.
11653
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1467
    ].
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1468
    (code between:($0 codePoint) and:($9 codePoint)) ifTrue:[
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1469
	^ code - $0 codePoint
11653
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1470
    ].
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1471
    base <= 10 ifTrue:[
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1472
	^ nil.
11653
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1473
    ].
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1474
    (code between:($a codePoint) and:($a codePoint + base - 1 - 10)) ifTrue:[
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1475
	^ code + (10 - $a codePoint)
11653
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1476
    ].
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1477
    (code between:($A codePoint) and:($A codePoint + base - 1 - 10)) ifTrue:[
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1478
	^ code + (10 - $A codePoint)
11653
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1479
    ].
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1480
    ^ nil
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1481
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1482
    "
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1483
     self assert:($0 digitValueRadix:10) == 0.
11653
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1484
     self assert:($9 digitValueRadix:10) == 9.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1485
     self assert:($a digitValueRadix:10) == nil.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1486
     self assert:($a digitValueRadix:11) == 10.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1487
     self assert:($A digitValueRadix:11) == 10.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1488
     self assert:($a digitValueRadix:16) == 10.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1489
     self assert:($A digitValueRadix:16) == 10.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1490
     self assert:($f digitValueRadix:16) == 15.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1491
     self assert:($F digitValueRadix:16) == 15.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1492
     self assert:($g digitValueRadix:16) == nil.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1493
     self assert:($G digitValueRadix:16) == nil.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1494
     self assert:($g digitValueRadix:17) == 16.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1495
     self assert:($G digitValueRadix:17) == 16.
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1496
    "
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1497
!
f48525bf7a6e +digitValueRadix:
Claus Gittinger <cg@exept.de>
parents: 11328
diff changeset
  1498
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1499
literalArrayEncoding
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1500
    "encode myself as an array literal, from which a copy of the receiver
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1501
     can be reconstructed with #decodeAsLiteralArray."
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1502
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1503
    ^ self
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1504
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1505
    "Created: / 27.10.1997 / 14:40:37 / cg"
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1506
!
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1507
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1508
to:aMagnitude
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1509
    "Return an Interval over the characters from the receiver to <aMagnitude>.
21520
808c4175a149 #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 21519
diff changeset
  1510
     Wrap <aMagnitude> if it is not a legal Character value. (JS)
808c4175a149 #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 21519
diff changeset
  1511
     CG: why wrap - is this a good idea?"
8010
6dddd9cb3f84 asXCase: Unicode
Claus Gittinger <cg@exept.de>
parents: 8004
diff changeset
  1512
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1513
    ^ Interval from:self to:(aMagnitude \\ 16r3FFFFFFF)
21520
808c4175a149 #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 21519
diff changeset
  1514
808c4175a149 #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 21519
diff changeset
  1515
    "Modified (comment): / 22-02-2017 / 10:56:38 / cg"
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1516
!
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1517
13499
Claus Gittinger <cg@exept.de>
parents: 11956
diff changeset
  1518
to:aMagnitude by:inc
Claus Gittinger <cg@exept.de>
parents: 11956
diff changeset
  1519
    "Return an Interval over the characters from the receiver to <aMagnitude>.
21519
f39808be9f7e #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 21484
diff changeset
  1520
     Wrap <aMagnitude> if it is not a legal Character value. (JS)
f39808be9f7e #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 21484
diff changeset
  1521
     CG: why wrap - is this a good idea?"
13499
Claus Gittinger <cg@exept.de>
parents: 11956
diff changeset
  1522
Claus Gittinger <cg@exept.de>
parents: 11956
diff changeset
  1523
    ^ Interval from:self to:(aMagnitude \\ 16r3FFFFFFF) by:inc
Claus Gittinger <cg@exept.de>
parents: 11956
diff changeset
  1524
Claus Gittinger <cg@exept.de>
parents: 11956
diff changeset
  1525
    "Created: / 04-07-2011 / 19:35:15 / cg"
21519
f39808be9f7e #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 21484
diff changeset
  1526
    "Modified (comment): / 22-02-2017 / 10:48:42 / cg"
13499
Claus Gittinger <cg@exept.de>
parents: 11956
diff changeset
  1527
!
Claus Gittinger <cg@exept.de>
parents: 11956
diff changeset
  1528
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
  1529
utf8Encoded
11321
ceb4c9bf0085 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 11144
diff changeset
  1530
    "convert a character to its UTF-8 encoding.
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
  1531
     this returns a String"
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
  1532
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
  1533
    |s|
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
  1534
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1535
    asciivalue <= 16r7F ifTrue:[
21374
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1536
        ^ self asString.
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1537
    ].
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1538
21374
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1539
    s := WriteStream on:(String new:self utf8BytesPerCharacter).
11321
ceb4c9bf0085 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 11144
diff changeset
  1540
    s nextPutUtf8:self.
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
  1541
    ^ s contents
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1542
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
  1543
    "
21673
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1544
     'ä' utf8Encoded
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1545
     'a' utf8Encoded
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
  1546
    "
21374
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1547
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1548
    "Modified: / 07-02-2017 / 14:37:06 / stefan"
21673
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1549
!
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1550
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1551
withoutDiacritics
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1552
    "return a character with same letter as the receiver, but in without diacritics modifiers
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1553
     (mapping e.g. Ä to A).
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1554
     Returns the receiver if it has no diacritics modifiers."
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1555
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1556
    ^ self shouldImplement
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1557
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  1558
    "Created: / 28-03-2017 / 16:01:45 / stefan"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1559
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1560
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1561
!Character methodsFor:'copying'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1562
15984
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1563
, aStringOrCharacter
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1564
    "return a string containing the concatenation of the receiver character
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1565
     and the argument, a string or character.
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1566
     Added for symetry, as we allow string,char also char,string should be allowed"
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1567
16074
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1568
%{
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1569
    OBJ s;
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1570
    unsigned INT val;
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1571
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1572
    // fast code for common cases
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1573
    val = __intVal(__characterVal(self));
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1574
    if (val <= 0xFF) {
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1575
	if (__isCharacter(aStringOrCharacter)) {
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1576
	    unsigned INT val2 = __intVal(__characterVal(aStringOrCharacter));
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1577
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1578
	    if (val2 <= 0xFF) {
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1579
		char buffer[2];
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1580
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1581
		buffer[0] = val;
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1582
		buffer[1] = val2;
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1583
		s = __MKSTRING_L(buffer, 2);
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1584
		if (s != nil) {
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1585
		    RETURN (s);
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1586
		}
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1587
	    }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1588
	} else {
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1589
	    if (__isString(aStringOrCharacter)) {
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1590
		int strSize = __stringSize(aStringOrCharacter);
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1591
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1592
		s = __MKEMPTYSTRING(strSize+1);
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1593
		if (s != nil) {
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1594
		    __StringInstPtr(s)->s_element[0] = val;
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1595
		    memcpy(__StringInstPtr(s)->s_element+1, __stringVal(aStringOrCharacter), strSize+1); // copies 0-byte too
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1596
		    RETURN (s);
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1597
		}
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1598
	    }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  1599
	}
16074
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1600
    }
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1601
%}.
15984
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1602
    ^ self asString , aStringOrCharacter
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1603
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1604
    "
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1605
     $. , $:
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1606
     $. , 'abc' , $.
16074
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1607
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1608
      Time millisecondsToRun:[ 10000000 timesRepeat:[ $a , $b ]]
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1609
      Time millisecondsToRun:[ 10000000 timesRepeat:[ $a , 'b' ]]
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1610
      Time millisecondsToRun:[ 10000000 timesRepeat:[ 'a' , 'b' ]]
c45e7c47def3 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 15984
diff changeset
  1611
      Time millisecondsToRun:[ 10000000 timesRepeat:[ 'a' , $b ]]
15984
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1612
    "
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1613
!
4f74a20b24f7 class: Character
Claus Gittinger <cg@exept.de>
parents: 15782
diff changeset
  1614
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1615
copy
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1616
    "return a copy of myself
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1617
     reimplemented since characters are unique"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1618
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1619
     ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1620
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1621
10948
a90f2ab2c6ec deepCopy change
ab
parents: 10936
diff changeset
  1622
deepCopyUsing:aDictionary postCopySelector:postCopySelector
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1623
    "return a deep copy of myself
4728
37eaa8241422 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 4682
diff changeset
  1624
     reimplemented since characters are immutable"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1625
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1626
     ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1627
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1628
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1629
shallowCopy
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1630
    "return a shallow copy of myself
4728
37eaa8241422 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 4682
diff changeset
  1631
     reimplemented since characters are immutable"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1632
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1633
     ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1634
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1635
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1636
simpleDeepCopy
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1637
    "return a deep copy of myself
4728
37eaa8241422 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 4682
diff changeset
  1638
     reimplemented since characters are immutable"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1639
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1640
     ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1641
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1642
5471
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1643
!Character methodsFor:'encoding'!
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1644
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1645
rot13
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1646
     "Usenet: from `rotate alphabet 13 places']
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1647
      The simple Caesar-cypher encryption that replaces each English
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1648
      letter with the one 13 places forward or back along the alphabet,
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1649
      so that 'The butler did it!!' becomes 'Gur ohgyre qvq vg!!'
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1650
      Most Usenet news reading and posting programs include a rot13 feature.
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1651
      It is used to enclose the text in a sealed wrapper that the reader must choose
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1652
      to open -- e.g., for posting things that might offend some readers, or spoilers.
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1653
      A major advantage of rot13 over rot(N) for other N is that it
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1654
      is self-inverse, so the same code can be used for encoding and decoding."
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1655
11864
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1656
    ^ self rot:13
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1657
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1658
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1659
     $h rot13
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1660
     $h rot13 rot13
7715
0e69a830f5d8 use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents: 7689
diff changeset
  1661
     'The butler did it!!' rot13             -> 'Gur ohgyre qvq vg!!'
0e69a830f5d8 use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents: 7689
diff changeset
  1662
     'The butler did it!!' rot13 rot13       -> 'The butler did it!!'
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1663
    "
11864
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1664
!
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1665
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1666
rot:n
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1667
     "Usenet: from `rotate alphabet N places']
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1668
      The simple Caesar-cypher encryption that replaces each English
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1669
      letter with the one N places forward or back along the alphabet,
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1670
      so that 'The butler did it!!' becomes 'Gur ohgyre qvq vg!!' by rot:13
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1671
      Most Usenet news reading and posting programs include a rot13 feature.
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1672
      It is used to enclose the text in a sealed wrapper that the reader must choose
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1673
      to open -- e.g., for posting things that might offend some readers, or spoilers.
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1674
      A major advantage of rot13 over rot(N) for other N is that it
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1675
      is self-inverse, so the same code can be used for encoding and decoding."
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1676
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1677
    (self isLetter) ifTrue:[
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1678
	self isLowercase ifTrue:[
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1679
	    ^ 'abcdefghijklmnopqrstuvwxyzabcdefghijklmnopqrstuvwxyz' at:(self-$a+1+n)
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1680
	].
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1681
	^ 'ABCDEFGHIJKLMNOPQRSTUVWXYZABCDEFGHIJKLMNOPQRSTUVWXYZ' at:(self-$A+1+n)
11864
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1682
    ].
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1683
    ^ self
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1684
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1685
    "
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1686
     'The butler did it!!' rot:13                -> 'Gur ohgyre qvq vg!!'
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1687
     ('The butler did it!!' rot:13) rot:13       -> 'The butler did it!!'
Claus Gittinger <cg@exept.de>
parents: 11653
diff changeset
  1688
    "
5471
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1689
! !
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1690
10428
fm
parents: 9654
diff changeset
  1691
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1692
!Character methodsFor:'obsolete'!
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1693
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1694
asciiValue
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1695
    "return the asciivalue of myself.
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1696
     The name 'asciiValue' is a historic leftover:
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1697
	characters are not limited to 8bit characters.
8143
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1698
     So the actual value returned is a codePoint (i.e. full potential for 31bit encoding).
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1699
     PP has removed this method with 4.1 and providing asInteger instead.
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1700
     ANSI defines #codePoint, please use this method"
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1701
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1702
    <resource:#obsolete>
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1703
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1704
    ^ asciivalue
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1705
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1706
    "Modified: 27.6.1996 / 12:34:34 / cg"
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1707
! !
b57bf0242c55 Fix comments
Stefan Vogel <sv@exept.de>
parents: 8106
diff changeset
  1708
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1709
!Character methodsFor:'printing & storing'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1710
14117
299ac69e8f9d added: #displayOn:
Stefan Vogel <sv@exept.de>
parents: 13558
diff changeset
  1711
displayOn:aGCOrStream
14120
fdf215af772c added: #displayOn: (instead of #displaySting)
Stefan Vogel <sv@exept.de>
parents: 14117
diff changeset
  1712
    "Compatibility
fdf215af772c added: #displayOn: (instead of #displaySting)
Stefan Vogel <sv@exept.de>
parents: 14117
diff changeset
  1713
     append a printed desription on some stream (Dolphin,  Squeak)
fdf215af772c added: #displayOn: (instead of #displaySting)
Stefan Vogel <sv@exept.de>
parents: 14117
diff changeset
  1714
     OR:
fdf215af772c added: #displayOn: (instead of #displaySting)
Stefan Vogel <sv@exept.de>
parents: 14117
diff changeset
  1715
     display the receiver in a graphicsContext at 0@0 (ST80).
fdf215af772c added: #displayOn: (instead of #displaySting)
Stefan Vogel <sv@exept.de>
parents: 14117
diff changeset
  1716
     This method allows for any object to be displayed in some view
fdf215af772c added: #displayOn: (instead of #displaySting)
Stefan Vogel <sv@exept.de>
parents: 14117
diff changeset
  1717
     (although the fallBack is to display its printString ...)"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1718
14117
299ac69e8f9d added: #displayOn:
Stefan Vogel <sv@exept.de>
parents: 13558
diff changeset
  1719
    "/ what a kludge - Dolphin and Squeak mean: printOn: a stream;
21535
f1ca40b3f545 #OTHER by cg
Claus Gittinger <cg@exept.de>
parents: 21520
diff changeset
  1720
    "/ old ST80 means: draw-yourself on a GC.
16744
eef6b34f19d7 displayOn: cleanup
Claus Gittinger <cg@exept.de>
parents: 16727
diff changeset
  1721
    (aGCOrStream isStream) ifFalse:[
21484
183245e23a14 #REFACTORING by stefan
Stefan Vogel <sv@exept.de>
parents: 21374
diff changeset
  1722
        ^ super displayOn:aGCOrStream
14117
299ac69e8f9d added: #displayOn:
Stefan Vogel <sv@exept.de>
parents: 13558
diff changeset
  1723
    ].
299ac69e8f9d added: #displayOn:
Stefan Vogel <sv@exept.de>
parents: 13558
diff changeset
  1724
299ac69e8f9d added: #displayOn:
Stefan Vogel <sv@exept.de>
parents: 13558
diff changeset
  1725
    self storeOn:aGCOrStream.
299ac69e8f9d added: #displayOn:
Stefan Vogel <sv@exept.de>
parents: 13558
diff changeset
  1726
    aGCOrStream nextPutAll:' "16r'.
299ac69e8f9d added: #displayOn:
Stefan Vogel <sv@exept.de>
parents: 13558
diff changeset
  1727
    asciivalue printOn:aGCOrStream base:16.
21484
183245e23a14 #REFACTORING by stefan
Stefan Vogel <sv@exept.de>
parents: 21374
diff changeset
  1728
    aGCOrStream space.
19195
32f423697432 #DOCUMENTATION
Claus Gittinger <cg@exept.de>
parents: 19120
diff changeset
  1729
    asciivalue printOn:aGCOrStream.
14117
299ac69e8f9d added: #displayOn:
Stefan Vogel <sv@exept.de>
parents: 13558
diff changeset
  1730
    aGCOrStream nextPut:$".
21484
183245e23a14 #REFACTORING by stefan
Stefan Vogel <sv@exept.de>
parents: 21374
diff changeset
  1731
183245e23a14 #REFACTORING by stefan
Stefan Vogel <sv@exept.de>
parents: 21374
diff changeset
  1732
    "Modified: / 17-02-2017 / 10:53:26 / stefan"
21535
f1ca40b3f545 #OTHER by cg
Claus Gittinger <cg@exept.de>
parents: 21520
diff changeset
  1733
    "Modified (comment): / 22-02-2017 / 16:49:35 / cg"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1734
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1735
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1736
isLiteral
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1737
    "return true, if the receiver can be used as a literal constant in ST syntax
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1738
     (i.e. can be used in constant arrays)"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1739
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1740
    ^ true
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1741
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1742
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1743
print
5746
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1744
    "print myself on stdout.
19378
93bf71d3fc81 #UI_ENHANCEMENT
Claus Gittinger <cg@exept.de>
parents: 19195
diff changeset
  1745
     If Stdout is nil, this method does NOT (by purpose) use the stream classes and
93bf71d3fc81 #UI_ENHANCEMENT
Claus Gittinger <cg@exept.de>
parents: 19195
diff changeset
  1746
     will therefore work even in case of emergency or very early startup (but only, if Stdout is nil)."
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1747
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1748
%{  /* NOCONTEXT */
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1749
5746
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1750
    if (@global(Stdout) == nil) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1751
	putchar(__intVal(__INST(asciivalue)));
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1752
	RETURN(self);
5746
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1753
    }
5452
71fd110c347a allow print, printCR during early initialization
Claus Gittinger <cg@exept.de>
parents: 5433
diff changeset
  1754
%}.
71fd110c347a allow print, printCR during early initialization
Claus Gittinger <cg@exept.de>
parents: 5433
diff changeset
  1755
    super print
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1756
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1757
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1758
printOn:aStream
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1759
    "print myself on aStream"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1760
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1761
    aStream nextPut:self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1762
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1763
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1764
printString
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1765
    "return a string to print me"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1766
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1767
    ^ self asString
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1768
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1769
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1770
storeOn:aStream
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1771
    "store myself on aStream"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1772
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1773
    |special|
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1774
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1775
    (asciivalue between:33 and:127) ifTrue:[
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1776
	aStream nextPut:$$; nextPut:self
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1777
    ] ifFalse:[
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1778
	(self == Character space) ifTrue:[
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1779
	    special := #space
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1780
	] ifFalse:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1781
	    (self == Character cr) ifTrue:[
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1782
		special := #cr.
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1783
	    ] ifFalse:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1784
		(self == Character tab) ifTrue:[
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1785
		    special := #tab.
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1786
		] ifFalse:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1787
		    (self == Character esc) ifTrue:[
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1788
			special := #esc.
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1789
		    ]
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1790
		]
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1791
	    ]
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1792
	].
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1793
	special notNil ifTrue:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1794
	    aStream nextPutAll:'(Character '; nextPutAll:special; nextPut:$).
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1795
	    ^ self
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1796
	].
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1797
	aStream nextPutAll:'(Character codePoint:16r'.
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1798
	asciivalue printOn:aStream base:16.
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1799
	aStream nextPut:$)
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1800
     ].
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
  1801
3190
81ffb25d1d86 Use #printOn: instead of #printString
Stefan Vogel <sv@exept.de>
parents: 3072
diff changeset
  1802
    "Modified: / 23.2.1996 / 23:27:32 / cg"
81ffb25d1d86 Use #printOn: instead of #printString
Stefan Vogel <sv@exept.de>
parents: 3072
diff changeset
  1803
    "Modified: / 20.1.1998 / 14:10:46 / stefan"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1804
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1805
7257
b9f0fb923c72 method category rename
Claus Gittinger <cg@exept.de>
parents: 7250
diff changeset
  1806
!Character methodsFor:'private-accessing'!
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1807
8100
1bcc9d4ee4bc Fix #identityHash
Stefan Vogel <sv@exept.de>
parents: 8097
diff changeset
  1808
setCodePoint:anInteger
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1809
    "very private - set the codePoint.
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1810
     - use this only for newly created characters with codes > MAX_IMMEDIATE_CHARACTER.
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1811
     DANGER alert:
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1812
	funny things happen, if this is applied to
10936
9381620deb4d use #codePoint: instead of #value:
Stefan Vogel <sv@exept.de>
parents: 10428
diff changeset
  1813
	one of the shared characters with codePoints 0..MAX_IMMEDIATE_CHARACTER."
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1814
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1815
    asciivalue := anInteger
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1816
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1817
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1818
!Character methodsFor:'queries'!
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1819
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1820
bitsPerCharacter
20698
6838c575c1c6 #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 20111
diff changeset
  1821
    "return the number of bits I require for storage.
6838c575c1c6 #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 20111
diff changeset
  1822
     (i.e. am I an Ascii/ISO8859-1 Character or will I need more
6838c575c1c6 #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 20111
diff changeset
  1823
     bits for storage."
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1824
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1825
    asciivalue <= 16rFF ifTrue:[^ 8].
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1826
    asciivalue <= 16rFFFF ifTrue:[^ 16].
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1827
    ^ 32
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1828
!
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1829
19756
94cdf82295d7 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19754
diff changeset
  1830
bytesPerCharacter
94cdf82295d7 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19754
diff changeset
  1831
    "return the number of bytes I require for storage"
94cdf82295d7 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19754
diff changeset
  1832
94cdf82295d7 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19754
diff changeset
  1833
    asciivalue <= 16rFF ifTrue:[^ 1].
94cdf82295d7 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19754
diff changeset
  1834
    asciivalue <= 16rFFFF ifTrue:[^ 2].
94cdf82295d7 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19754
diff changeset
  1835
    ^ 4
21374
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1836
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1837
    "Modified: / 07-02-2017 / 14:36:05 / stefan"
19756
94cdf82295d7 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19754
diff changeset
  1838
!
94cdf82295d7 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19754
diff changeset
  1839
18595
aa02a050b351 class: Character
Stefan Vogel <sv@exept.de>
parents: 18588
diff changeset
  1840
characterSize
18588
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  1841
    "return the number of bits I require for storage.
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  1842
     Protocol compatibility with CharacterArray."
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
  1843
18595
aa02a050b351 class: Character
Stefan Vogel <sv@exept.de>
parents: 18588
diff changeset
  1844
    asciivalue <= 16r80 ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
  1845
	^ 7
18595
aa02a050b351 class: Character
Stefan Vogel <sv@exept.de>
parents: 18588
diff changeset
  1846
    ].
18588
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  1847
    asciivalue <= 16rFF ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
  1848
	^ 8
18588
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  1849
    ].
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  1850
    asciivalue <= 16rFFFF ifTrue:[
18656
fab035561c68 fix: assign instead of compare
Claus Gittinger <cg@exept.de>
parents: 18605
diff changeset
  1851
	^ 16
18588
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  1852
    ].
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  1853
    ^ 32
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  1854
!
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  1855
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1856
stringSpecies
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1857
    "return the type of string that is needed to store me"
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1858
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1859
    asciivalue <= 16rFF ifTrue:[^ String].
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1860
    asciivalue <= 16rFFFF ifTrue:[^ Unicode16String].
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1861
    ^ Unicode32String
20015
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1862
!
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1863
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1864
unicodeBlock
20016
40b9d147b855 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 20015
diff changeset
  1865
    "return the name of the unicode block in which this character is.
40b9d147b855 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 20015
diff changeset
  1866
     incomplete"
20015
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1867
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1868
    asciivalue <= 16r007F ifTrue:[^ #BASIC_LATIN ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1869
    asciivalue <= 16r00FF ifTrue:[^ #LATIN1_SUPPLEMENT ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1870
    asciivalue <= 16r017F ifTrue:[^ #LATIN1_EXTENDED_A].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1871
    asciivalue <= 16r024F ifTrue:[^ #LATIN1_EXTENDED_B].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1872
    asciivalue <= 16r02AF ifTrue:[^ #IPA_EXTENSIONS].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1873
    asciivalue <= 16r02FF ifTrue:[^ #SPACING_MODIFIER_LETTERS].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1874
    asciivalue <= 16r036f ifTrue:[ ^ #COMBINING_DIACRITICAL_MARKS ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1875
    asciivalue <= 16r03FF ifTrue:[ ^ #GREEK_AND_COPTIC ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1876
    asciivalue <= 16r04FF ifTrue:[ ^ #CYRILLIC ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1877
    asciivalue <= 16r052F ifTrue:[ ^ #CYRILLIC_SUPPLEMENT ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1878
    asciivalue <= 16r058F ifTrue:[ ^ #ARMENIAN ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1879
    asciivalue <= 16r05FF ifTrue:[ ^ #HEBREW ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1880
    asciivalue <= 16r06FF ifTrue:[ ^ #ARABIC ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1881
    asciivalue <= 16r074F ifTrue:[ ^ #SYRIAC ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1882
    asciivalue <= 16r077F ifTrue:[ ^ #ARABIC_SUPPLEMENT ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1883
    asciivalue <= 16r07BF ifTrue:[ ^ #THAANA ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1884
    asciivalue <= 16r07FF ifTrue:[ ^ #NKO ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1885
    asciivalue <= 16r083F ifTrue:[ ^ #SAMARITAN ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1886
    asciivalue <= 16r085F ifTrue:[ ^ #MANDAIC ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1887
    asciivalue <= 16r087F ifTrue:[ ^ nil ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1888
    asciivalue <= 16r08FF ifTrue:[ ^ #ARABIC_EXTENDED_A ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1889
    asciivalue <= 16r097F ifTrue:[ ^ #DEVANAGARI ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1890
    asciivalue <= 16r09FF ifTrue:[ ^ #BENGALI ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1891
    asciivalue <= 16r0A7F ifTrue:[ ^ #GURMUKHI ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1892
    asciivalue <= 16r0AFF ifTrue:[ ^ #GUJARATI ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1893
    asciivalue <= 16r0B7F ifTrue:[ ^ #ORIYA ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1894
    asciivalue <= 16r0BFF ifTrue:[ ^ #TAMIL ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1895
    asciivalue <= 16r0C7F ifTrue:[ ^ #TELUGU ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1896
    asciivalue <= 16r0CFF ifTrue:[ ^ #KANNADA ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1897
    asciivalue <= 16r0D7F ifTrue:[ ^ #MALAYALAM ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1898
    asciivalue <= 16r0DFF ifTrue:[ ^ #SINHALA ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1899
    asciivalue <= 16r0E7F ifTrue:[ ^ #THAI ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1900
    asciivalue <= 16r0EFF ifTrue:[ ^ #LAO ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1901
    asciivalue <= 16r0FFF ifTrue:[ ^ #TIBETAN ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1902
    asciivalue <= 16r109F ifTrue:[ ^ #MYANMAR ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1903
    asciivalue <= 16r10FF ifTrue:[ ^ #GEORGIAN ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1904
    asciivalue <= 16r11FF ifTrue:[ ^ #HANGUL_JAMO ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1905
    asciivalue <= 16r137F ifTrue:[ ^ #ETHIOPIC ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1906
    asciivalue <= 16r139F ifTrue:[ ^ #ETHIOPIC_SUPPLEMENT ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1907
    asciivalue <= 16r13FF ifTrue:[ ^ #CHEROKEE ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1908
    asciivalue <= 16r167F ifTrue:[ ^ #UNIFIED_CANADIAN_ABORIGINAL_SYLLABICS ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1909
    asciivalue < 16r1AB0 ifTrue:[ ^ #OTHER ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1910
    asciivalue <= 16r1AFF ifTrue:[ ^ #COMBINING_DIACRITICAL_MARKS_EXTENDED ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1911
    asciivalue < 16r1DC0 ifTrue:[ ^ #OTHER ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1912
    asciivalue <= 16r1DFF ifTrue:[ ^ #COMBINING_DIACRITICAL_MARKS_SUPPLEMENT ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1913
    asciivalue <= 16r1EFF ifTrue:[ ^ #LATIN_EXTENDED_ADDITIONAL ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1914
    asciivalue <= 16r1FFF ifTrue:[ ^ #GREEK_EXTENDED ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1915
    asciivalue <= 16r206F ifTrue:[ ^ #GENERAL_PUNKTUATION ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1916
    asciivalue <= 16r209F ifTrue:[ ^ #SUPERSCRIPTS_AND_SUBSCRIPTS ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1917
    asciivalue <= 16r20CF ifTrue:[ ^ #CURRENCY_SYMBOLS ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1918
    asciivalue < 16r2190 ifTrue:[ ^ #OTHER ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1919
    asciivalue <= 16r21FF ifTrue:[ ^ #ARROWS ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1920
    asciivalue <= 16r22FF ifTrue:[ ^ #MATHEMATICAL_OPERATORS ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1921
    asciivalue <= 16r23FF ifTrue:[ ^ #MISCELLANEOUS_TECHNICAL ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1922
    asciivalue < 16r2600 ifTrue:[ ^ #OTHER ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1923
    asciivalue <= 16r26FF ifTrue:[ ^ #MISCELLANEOUS_SYMBOLS ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1924
    asciivalue <= 16r27BF ifTrue:[ ^ #DINGBATS ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1925
    asciivalue <= 16r27EF ifTrue:[ ^ #MISCELLANEOUS_MATHEMATICAL_SYMBOLS_A ].
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1926
    ^ #OTHER
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1927
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1928
    "
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1929
     (Character value:16r200) unicodeBlock
e91971b89af2 #FEATURE by cg
Claus Gittinger <cg@exept.de>
parents: 19810
diff changeset
  1930
    "
21374
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1931
!
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1932
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1933
utf8BytesPerCharacter
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1934
    "return the number of bytes I require for storage in utf-8 encoding"
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1935
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1936
    asciivalue <= 16r7F ifTrue:[^ 1].
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1937
    asciivalue <= 16r7FF ifTrue:[^ 2].
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1938
    asciivalue <= 16rFFFF ifTrue:[^ 3].
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1939
    asciivalue <= 16r1FFFFF ifTrue:[^ 4].
22251
a1ebd9c9af37 #BUGFIX by sr
sr
parents: 22018
diff changeset
  1940
    asciivalue <= 16r3FFFFFF ifTrue:[^ 5].
a1ebd9c9af37 #BUGFIX by sr
sr
parents: 22018
diff changeset
  1941
    asciivalue <= 16r3FFFFFFF ifTrue:[^ 6].
a1ebd9c9af37 #BUGFIX by sr
sr
parents: 22018
diff changeset
  1942
a1ebd9c9af37 #BUGFIX by sr
sr
parents: 22018
diff changeset
  1943
    self error:'character cannot represented as utf8 (too large, only 31bit supported)'
21374
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1944
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  1945
    "Created: / 07-02-2017 / 14:35:56 / stefan"
8004
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1946
! !
a09a6a745e65 asLowercase/asUppercase and friends care for 16-bit chars
Claus Gittinger <cg@exept.de>
parents: 7997
diff changeset
  1947
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1948
!Character methodsFor:'testing'!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1949
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1950
isCharacter
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1951
    "return true,  if the receiver is some kind of character"
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1952
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1953
    ^ true
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1954
!
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1955
3667
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1956
isControlCharacter
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  1957
    "return true if I am a control character (i.e. ascii value < 32 or == 16rFF)"
3667
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1958
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1959
%{  /* NOCONTEXT */
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1960
    REGISTER INT val;
3667
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1961
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1962
    val = __intVal(__INST(asciivalue));
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  1963
    if (val < ' ' || val == 0xFF) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  1964
	RETURN ( true );
3667
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1965
    }
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1966
    RETURN (false);
3667
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1967
%}.
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1968
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1969
    "
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1970
     (Character value:1) isControlCharacter
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1971
     $a isControlCharacter
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1972
    "
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1973
!
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1974
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1975
isDigit
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1976
    "return true, if I am a digit (i.e. $0 .. $9)"
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1977
7980
058af5174df2 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 7979
diff changeset
  1978
%{  /* NOCONTEXT */
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1979
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1980
    REGISTER INT val;
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1981
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1982
    val = __intVal(__INST(asciivalue));
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  1983
    if ((unsigned INT)(val - '0') <= ('9' - '0')) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1984
	RETURN ( true );
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1985
    }
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1986
    RETURN ( false );
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1987
%}.
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  1988
    ^ asciivalue between:$0 codePoint and:$9 codePoint
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1989
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1990
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1991
isDigitRadix:r
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1992
    "return true, if I am a digit of a base r number"
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1993
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1994
    (asciivalue < $0 codePoint) ifTrue:[^ false].
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1995
    (r > 10) ifTrue:[
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1996
	(asciivalue <= $9 codePoint) ifTrue:[
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1997
	    ^ true
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1998
	].
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  1999
	((asciivalue - $a codePoint) between:0 and:(r - 11)) ifTrue:[
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2000
	    ^ true
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2001
	].
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2002
	^ (asciivalue - $A codePoint) between:0 and:(r - 11)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2003
    ].
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2004
    (asciivalue - $0 codePoint) < r ifTrue:[^ true].
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2005
    ^ false
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2006
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2007
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2008
isEndOfLineCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2009
    "return true if I am a line delimitting character"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2010
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2011
%{  /* NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2012
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2013
    REGISTER INT val;
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2014
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  2015
    val = __intVal(__INST(asciivalue));
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2016
    if ((val == '\n')
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2017
     || (val == '\r')
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2018
     || (val == '\f')) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2019
	RETURN ( true );
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  2020
    }
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2021
    RETURN (false);
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2022
%}.
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2023
    ^ asciivalue == 16r0A
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2024
      or:[asciivalue == 16r0D
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2025
      or:[asciivalue == 16r0C]]
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2026
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2027
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2028
5473
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  2029
isImmediate
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  2030
    "return true if I am an immediate object
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  2031
     i.e. I am represented in the pointer itself and
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  2032
     no real object header/storage is used me.
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2033
     For VW compatibility, shared characters (i.e. in the range 0..MAX_IMMEDIATE_CHARACTER)
5473
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  2034
     also return true here"
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2035
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2036
%{  /* NOCONTEXT */
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2037
    if (__smallIntegerVal(__INST(asciivalue)) <= MAX_IMMEDIATE_CHARACTER) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2038
	RETURN ( true );
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2039
    }
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2040
%}.
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2041
    ^ false
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2042
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2043
    "
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2044
	$a isImmediate.
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2045
	(Character value:255) isImmediate.
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2046
	(Character value:256) isImmediate.
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2047
	(Character value:1566) isImmediate.
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2048
    "
5473
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  2049
!
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  2050
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2051
isLetter
7979
7515722ccfb1 isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents: 7976
diff changeset
  2052
    "return true, if I am a letter in the 'a'..'z' range.
7515722ccfb1 isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents: 7976
diff changeset
  2053
     Use isNationalLetter, if you are interested in those."
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2054
7980
058af5174df2 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 7979
diff changeset
  2055
%{  /* NOCONTEXT */
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2056
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2057
    REGISTER INT val;
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2058
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  2059
    val = __intVal(__INST(asciivalue));
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2060
    if ((unsigned INT)(val - 'a') <= ('z' - 'a')) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2061
	RETURN ( true );
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  2062
    }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2063
    if ((unsigned INT)(val - 'A') <= ('Z' - 'A')) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2064
	RETURN ( true );
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  2065
    }
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  2066
    RETURN ( false );
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2067
%}.
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2068
    ^ (asciivalue between:($a codePoint) and:($z codePoint))
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2069
      or:[(asciivalue between:($A codePoint) and:($Z codePoint))]
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2070
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2071
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
  2072
isLetterOrDigit
7979
7515722ccfb1 isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents: 7976
diff changeset
  2073
    "return true, if I am a letter (a..z or A..Z) or a digit (0..9)
7515722ccfb1 isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents: 7976
diff changeset
  2074
     Use isNationalAlphaNumeric, if you are interested in those."
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2075
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2076
%{  /* NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2077
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2078
    REGISTER INT val;
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2079
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  2080
    val = __intVal(__INST(asciivalue));
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2081
    if ((unsigned INT)(val - 'a') <= ('z' - 'a')) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2082
	RETURN ( true );
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2083
    }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2084
    if ((unsigned INT)(val - 'A') <= ('Z' - 'A')) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2085
	RETURN ( true );
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2086
    }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2087
    if ((unsigned INT)(val - '0') <= ('9' - '0')) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2088
	RETURN ( true );
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2089
    }
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2090
    RETURN ( false );
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2091
%}.
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2092
    ^ self isLetter or:[self isDigit]
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2093
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2094
17440
6c2d9887ba90 class: Character
Claus Gittinger <cg@exept.de>
parents: 17346
diff changeset
  2095
isLetterOrUnderline
6c2d9887ba90 class: Character
Claus Gittinger <cg@exept.de>
parents: 17346
diff changeset
  2096
    "return true, if I am a letter or $_"
6c2d9887ba90 class: Character
Claus Gittinger <cg@exept.de>
parents: 17346
diff changeset
  2097
6c2d9887ba90 class: Character
Claus Gittinger <cg@exept.de>
parents: 17346
diff changeset
  2098
    ^ self == $_ or:[ self isLetter ]
6c2d9887ba90 class: Character
Claus Gittinger <cg@exept.de>
parents: 17346
diff changeset
  2099
!
6c2d9887ba90 class: Character
Claus Gittinger <cg@exept.de>
parents: 17346
diff changeset
  2100
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2101
isLowercase
7979
7515722ccfb1 isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents: 7976
diff changeset
  2102
    "return true, if I am a lower-case letter.
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  2103
     This one does care for national characters.
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2104
     Caveat:
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2105
	only returns the correct value for codes up to u+1d6ff (Unicode3.1).
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2106
	(which is more than mozilla does, btw. ;-)"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2107
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2108
%{  /* NOCONTEXT */
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2109
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2110
    REGISTER unsigned INT val;
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2111
    REGISTER unsigned int lowByte;
7989
7907420b2fab asUppercase / asLowercase for U0100..U04FF
Claus Gittinger <cg@exept.de>
parents: 7988
diff changeset
  2112
#define TRUE_IF_ODD(x)      ((x & 1) ? true : false)
7907420b2fab asUppercase / asLowercase for U0100..U04FF
Claus Gittinger <cg@exept.de>
parents: 7988
diff changeset
  2113
#define TRUE_IF_EVEN(x)     ((x & 1) ? false : true)
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
  2114
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2115
    /* because used so often, this is open coded, instead of table driven */
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  2116
    val = __intVal(__INST(asciivalue));
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2117
    lowByte = val & 0xFF;
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2118
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2119
    /* the most likely case here, outside the switch */
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2120
    if (val <= 0xFF) {
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2121
	if ((unsigned)(lowByte - 'a') <= ('z' - 'a')) {
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2122
	    RETURN ( true );
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2123
	}
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2124
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2125
	/* iso8859 puts national lower case characters at e0 .. ff */
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2126
	if ((lowByte >= 0xDF) && (lowByte <= 0xFF)) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2127
	    if (lowByte != 0xF7) {
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2128
		RETURN(true);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2129
	    }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2130
	}
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2131
	if (lowByte == 0xAA) RETURN(true);     /* FEMININE ORDINAL INDICATOR (high a-underline) */
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2132
	if (lowByte == 0xB5) RETURN(true);     /* MICRO SIGN */
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2133
	if (lowByte == 0xBA) RETURN(true);     /* MASCULINE ORDINAL INDICATOR (high o-underline) */
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2134
	RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2135
    }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2136
7989
7907420b2fab asUppercase / asLowercase for U0100..U04FF
Claus Gittinger <cg@exept.de>
parents: 7988
diff changeset
  2137
    switch (val >> 8) {
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2138
	case 0x01:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2139
	    if (lowByte <= 0x37) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2140
	    if (lowByte <= 0x48) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2141
	    if (lowByte <= 0x78) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2142
	    if (lowByte <= 0x7E) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2143
	    if (lowByte <= 0x80) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2144
	    if (lowByte < 0xCD) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2145
		if (lowByte == 0x81) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2146
		if (lowByte <= 0x85) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2147
		    RETURN (TRUE_IF_ODD(lowByte));
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2148
		}
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2149
		if (lowByte == 0x88) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2150
		if (lowByte == 0x8C) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2151
		if (lowByte == 0x8D) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2152
		if (lowByte == 0x92) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2153
		if (lowByte == 0x95) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2154
		if (lowByte == 0x99) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2155
		if (lowByte == 0x9A) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2156
		if (lowByte == 0x9B) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2157
		if (lowByte == 0x9E) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2158
		if (lowByte <= 0xA0) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2159
		if (lowByte <= 0xA6) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2160
		if (lowByte <= 0xAA) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2161
		if (lowByte <= 0xAE) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2162
		if (lowByte == 0xB2) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2163
		if (lowByte <= 0xB6) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2164
		if (lowByte == 0xB9) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2165
		if (lowByte == 0xBA) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2166
		if (lowByte == 0xBD) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2167
		if (lowByte == 0xBE) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2168
		if (lowByte == 0xBF) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2169
		if (lowByte == 0xC6) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2170
		if (lowByte == 0xC9) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2171
		if (lowByte == 0xCC) { RETURN (true); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2172
		RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2173
	    }
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2174
	    if (lowByte <= 0xDC) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2175
	    if (lowByte <= 0xEF) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2176
	    if (lowByte == 0xF0) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2177
	    if (lowByte == 0xF1) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2178
	    if (lowByte == 0xF2) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2179
	    if (lowByte == 0xF3) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2180
	    if (lowByte <= 0xF6) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2181
	    if (lowByte == 0xF7) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2182
	    RETURN (TRUE_IF_ODD(lowByte));
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2183
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2184
	case 0x02:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2185
	    if (lowByte <= 0x33) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2186
	    if (lowByte <= 0x36) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2187
	    if (lowByte < 0x50) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2188
	    if (lowByte < 0xB0) { RETURN (true); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2189
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2190
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2191
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2192
	case 0x03:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2193
	    if (lowByte == 0x90) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2194
	    if (lowByte <= 0xAB) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2195
	    if (lowByte <= 0xD1) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2196
	    if (lowByte == 0xD5) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2197
	    if (lowByte == 0xD6) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2198
	    if (lowByte < 0xD7) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2199
	    if (lowByte <= 0xEF) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2200
	    if (lowByte <= 0xF3) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2201
	    if (lowByte == 0xF5) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2202
#ifndef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2203
	    if (lowByte == 0xF8) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2204
	    if (lowByte == 0xFB) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2205
#endif
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2206
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2207
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2208
	case 0x04:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2209
	    if (lowByte <= 0x2F) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2210
	    if (lowByte <= 0x5F) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2211
	    if (lowByte <= 0x81) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2212
	    if (lowByte < 0x8A) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2213
	    if (lowByte <= 0xC0) { RETURN (TRUE_IF_ODD(lowByte)); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2214
#ifdef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2215
	    if (lowByte == 0xC5) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2216
#endif
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2217
#ifdef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2218
	    if (lowByte <= 0xC8) { RETURN (TRUE_IF_EVEN(lowByte)); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2219
#else
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2220
	    if (lowByte <= 0xCA) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2221
	    if (lowByte == 0xCD) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2222
	    if (lowByte == 0xCE) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2223
#endif
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2224
	    if (lowByte == 0xCB) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2225
	    if (lowByte == 0xCC) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2226
	    RETURN (TRUE_IF_ODD(lowByte));
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2227
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2228
	case 0x05:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2229
	    if (lowByte <= 0x0F) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2230
	    if (lowByte < 0x61) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2231
	    if (lowByte <= 0x87) { RETURN (true); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2232
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2233
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2234
	case 0x1D:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2235
#ifndef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2236
	    if (lowByte <= 0x2B) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2237
	    if (lowByte <= 0x61) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2238
	    if (lowByte <= 0x70) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2239
#endif
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2240
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2241
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2242
	case 0x1E:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2243
	    if (lowByte < 0x96) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2244
	    if (lowByte <= 0x9F) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2245
	    RETURN (TRUE_IF_ODD(lowByte));
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2246
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2247
	case 0x1F:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2248
	    if (lowByte <= 0x6F) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2249
		if (lowByte & 0x0008) { RETURN (false); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2250
		RETURN (true);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2251
	    }
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2252
	    if (lowByte <= 0x87) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2253
	    if (lowByte < 0xB8) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2254
		if (lowByte & 0x0008) { RETURN (false); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2255
		RETURN (true);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2256
	    }
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2257
	    if (lowByte == 0xBE) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2258
	    if (lowByte == 0xD4) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2259
	    if (lowByte == 0xC5) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2260
	    if (lowByte == 0xD5) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2261
	    if (lowByte == 0xC1) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2262
	    if (lowByte == 0xF1) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2263
	    if (lowByte == 0xC0) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2264
	    if (lowByte == 0xF0) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2265
	    if (((lowByte & 0x000F) >= 0x0000) && ((lowByte & 0x000F) <= 0x0007)) { RETURN (true); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2266
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2267
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2268
	case 0x20:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2269
#ifndef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2270
	    if (lowByte == 0x71) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2271
#endif
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2272
	    if (lowByte == 0x7F) { RETURN (true); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2273
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2274
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2275
	case 0x21:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2276
	    if (lowByte == 0x0A) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2277
	    if (lowByte < 0x0E) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2278
	    if (lowByte <= 0x0F) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2279
	    if (lowByte == 0x13) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2280
	    if (lowByte == 0x2F) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2281
	    if (lowByte == 0x34) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2282
	    if (lowByte == 0x39) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2283
#ifndef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2284
	    if (lowByte == 0x3D) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2285
	    if (lowByte <= 0x45) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2286
	    if (lowByte <= 0x49) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2287
#endif
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2288
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2289
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2290
	case 0xFB:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2291
	    if (lowByte <= 0x1C) { RETURN (true); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2292
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2293
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2294
	case 0xFF:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2295
	    if ((lowByte >= 0x41) && (lowByte <= 0x5A)) { RETURN (true); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2296
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2297
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2298
	case 0x104:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2299
	    if (lowByte <= 0x27) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2300
	    if (lowByte <= 0x4D) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2301
#ifdef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2302
	    if (lowByte <= 0x4D) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2303
#else
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2304
	    if (lowByte <= 0x4F) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2305
#endif
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2306
	    RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2307
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2308
	case 0x1D4:
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2309
	case 0x1D5:
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2310
	case 0x1D6:
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2311
	    if (val <= 0x1D419) { RETURN (false); }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2312
	    if (val < 0x1D6be) {
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2313
		if (((val - 0x1D41A) % 52) <= 25) { RETURN (true); }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2314
		RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2315
	    }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2316
	    if (val < 0x1D6c2) { RETURN (false); }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2317
	    if (val < 0x1D6db) { RETURN (true); }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2318
	    if (val == 0x1D6db) { RETURN (false); }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2319
	    if (val < 0x1D6e2) { RETURN (true); }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2320
	    if (val <= 0x1D6fb) { RETURN (false); }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2321
	    RETURN (true);
13558
8257cf70c967 changed: #isLowercase
Claus Gittinger <cg@exept.de>
parents: 13499
diff changeset
  2322
8257cf70c967 changed: #isLowercase
Claus Gittinger <cg@exept.de>
parents: 13499
diff changeset
  2323
#ifdef UNICODE_4
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2324
	case 0x1D7:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2325
	    if (lowByte <= 0x1b) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2326
	    if (lowByte <= 0x35) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2327
	    if (lowByte <= 0x55) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2328
	    if (lowByte <= 0x6f) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2329
	    if (lowByte <= 0x8F) { RETURN (true); }
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2330
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2331
	    RETURN (false);
13558
8257cf70c967 changed: #isLowercase
Claus Gittinger <cg@exept.de>
parents: 13499
diff changeset
  2332
#endif
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2333
    }
7989
7907420b2fab asUppercase / asLowercase for U0100..U04FF
Claus Gittinger <cg@exept.de>
parents: 7988
diff changeset
  2334
#undef TRUE_IF_ODD
7907420b2fab asUppercase / asLowercase for U0100..U04FF
Claus Gittinger <cg@exept.de>
parents: 7988
diff changeset
  2335
#undef TRUE_IF_EVEN
7907420b2fab asUppercase / asLowercase for U0100..U04FF
Claus Gittinger <cg@exept.de>
parents: 7988
diff changeset
  2336
    RETURN (false);
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2337
%}.
13558
8257cf70c967 changed: #isLowercase
Claus Gittinger <cg@exept.de>
parents: 13499
diff changeset
  2338
8257cf70c967 changed: #isLowercase
Claus Gittinger <cg@exept.de>
parents: 13499
diff changeset
  2339
    "Modified: / 05-08-2011 / 18:56:33 / cg"
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
  2340
!
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
  2341
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2342
isPrintable
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2343
    "return true, if the receiver is a useful printable character
19810
eb006895e0f9 #DOCUMENTATION by cg
Claus Gittinger <cg@exept.de>
parents: 19756
diff changeset
  2344
     (see fileBrowser's showFile:-method on how it can be used)"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2345
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2346
    (asciivalue between:32 and:127) ifTrue:[^ true].
6398
1a32afa7b397 ff isPrintable -> true
Claus Gittinger <cg@exept.de>
parents: 6327
diff changeset
  2347
    asciivalue == 12 ifTrue:[^ true].   "/ FF
1a32afa7b397 ff isPrintable -> true
Claus Gittinger <cg@exept.de>
parents: 6327
diff changeset
  2348
    asciivalue == 13 ifTrue:[^ true].   "/ CR
1a32afa7b397 ff isPrintable -> true
Claus Gittinger <cg@exept.de>
parents: 6327
diff changeset
  2349
    asciivalue == 9 ifTrue:[^ true].    "/ TAB
1a32afa7b397 ff isPrintable -> true
Claus Gittinger <cg@exept.de>
parents: 6327
diff changeset
  2350
    asciivalue == 10 ifTrue:[^ true].   "/ NL
2840
493b24293589 isPrintable
Claus Gittinger <cg@exept.de>
parents: 2672
diff changeset
  2351
8097
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2352
    (asciivalue between:16rA0 and:16rBF) ifTrue:[^ true]. "/ ISO-8859
abec585462e8 Fix c-compiler warnings.
Stefan Vogel <sv@exept.de>
parents: 8030
diff changeset
  2353
    ^ self isNationalAlphaNumeric
2840
493b24293589 isPrintable
Claus Gittinger <cg@exept.de>
parents: 2672
diff changeset
  2354
493b24293589 isPrintable
Claus Gittinger <cg@exept.de>
parents: 2672
diff changeset
  2355
    "Modified: 7.8.1997 / 17:05:24 / cg"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2356
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2357
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2358
isSeparator
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2359
    "return true if I am a space, cr, tab, nl, or newPage"
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2360
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2361
%{  /* NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2362
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2363
    REGISTER INT val;
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2364
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  2365
    val = __intVal(__INST(asciivalue));
328
claus
parents: 314
diff changeset
  2366
    if (val <= ' ')
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2367
	if ((val == ' ')
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2368
	 || (val == '\n')
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2369
	 || (val == '\t')
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2370
	 || (val == '\r')
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2371
	 || (val == '\f')) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2372
	    RETURN ( true );
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2373
	}
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2374
    RETURN (false);
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2375
%}.
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2376
    ^ (asciivalue == 16r20)
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2377
      or:[asciivalue == 16r0D
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2378
      or:[asciivalue == 16r0A
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2379
      or:[asciivalue == 16r09
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  2380
      or:[asciivalue == 16r0C]]]]
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2381
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2382
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2383
isUppercase
7979
7515722ccfb1 isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents: 7976
diff changeset
  2384
    "return true, if I am an upper-case letter.
8022
d901d41171a5 isXXX and asXXX are now valid up to U+1d6FF
Claus Gittinger <cg@exept.de>
parents: 8012
diff changeset
  2385
     This one does care for national characters.
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2386
     Caveat:
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2387
	only returns the correct value for codes up to u+1d6ff (Unicode3.1).
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2388
	(which is more than mozilla does, btw. ;-)"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2389
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2390
%{  /* NOCONTEXT */
7988
cb1c920e67eb isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents: 7987
diff changeset
  2391
#define TRUE_IF_ODD(x)      ((x & 1) ? true : false)
cb1c920e67eb isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents: 7987
diff changeset
  2392
#define TRUE_IF_EVEN(x)     ((x & 1) ? false : true)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2393
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2394
    /* because used so often, this is open coded, instead of table driven */
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2395
    REGISTER unsigned INT val;
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2396
    REGISTER int lowByte;
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2397
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  2398
    val = __intVal(__INST(asciivalue));
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2399
    lowByte = val & 0xFF;
7979
7515722ccfb1 isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents: 7976
diff changeset
  2400
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2401
    /* the most likely case here, outside the switch */
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2402
    if (val <= 0xFF) {
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2403
	if ((unsigned int)(lowByte - 'A') <= ('Z' - 'A')) {
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2404
	    RETURN ( true );
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2405
	}
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2406
	/* iso8859 puts national upper case characters at c0 .. df */
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2407
	if ((lowByte >= 0xC0) && (lowByte <= 0xDE)) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2408
	    if (lowByte != 0xD7) {
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2409
		RETURN(true);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2410
	    }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2411
	}
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2412
	RETURN (false);
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2413
    }
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2414
7988
cb1c920e67eb isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents: 7987
diff changeset
  2415
    switch (val >> 8) {
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2416
	case 0x01:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2417
	    if (lowByte <= 0x37) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2418
	    if (lowByte <= 0x48) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2419
	    if (lowByte <= 0x78) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2420
	    if (lowByte <= 0x7E) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2421
	    if (lowByte < 0xCD) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2422
		if (lowByte == 0x80) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2423
		if (lowByte == 0x81) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2424
		if (lowByte <= 0x86) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2425
		    RETURN (TRUE_IF_EVEN(lowByte));
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2426
		}
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2427
		if (lowByte <= 0x89) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2428
		    RETURN (TRUE_IF_ODD(lowByte));
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2429
		}
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2430
		if (lowByte <= 0x8B) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2431
		if (lowByte <= 0x8D) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2432
		if (lowByte <= 0x91) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2433
		if (lowByte == 0x93) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2434
		if (lowByte == 0x94) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2435
		if (lowByte == 0x96) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2436
		if (lowByte == 0x97) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2437
		if (lowByte == 0x98) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2438
		if (lowByte == 0x9C) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2439
		if (lowByte == 0x9D) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2440
		if (lowByte == 0x9F) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2441
		if (lowByte < 0xA0) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2442
		if (lowByte <= 0xA6) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2443
		if (lowByte <= 0xAA) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2444
		if (lowByte <= 0xAE) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2445
		if (lowByte == 0xB2) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2446
		if (lowByte <= 0xB7) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2447
		if (lowByte == 0xB8) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2448
		if (lowByte == 0xBC) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2449
		if (lowByte == 0xC4) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2450
		if (lowByte == 0xC7) { RETURN (true); }
8308
19a958b21ac0 no const for hpux
Claus Gittinger <cg@exept.de>
parents: 8222
diff changeset
  2451
#if 0
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2452
		if (lowByte == 0xC8) { RETURN (true); }
8308
19a958b21ac0 no const for hpux
Claus Gittinger <cg@exept.de>
parents: 8222
diff changeset
  2453
#endif
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2454
		if (lowByte == 0xCA) { RETURN (true); }
8308
19a958b21ac0 no const for hpux
Claus Gittinger <cg@exept.de>
parents: 8222
diff changeset
  2455
#if 0
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2456
		if (lowByte == 0xCB) { RETURN (true); }
8308
19a958b21ac0 no const for hpux
Claus Gittinger <cg@exept.de>
parents: 8222
diff changeset
  2457
#endif
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2458
		RETURN (false);         /* WRONG !!! */
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2459
	    }
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2460
	    if (lowByte <= 0xDC) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2461
	    if (lowByte <= 0xEF) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2462
	    if (lowByte == 0xF0) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2463
	    if (lowByte == 0xF1) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2464
	    if (lowByte == 0xF2) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2465
	    if (lowByte == 0xF3) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2466
	    if (lowByte == 0xF4) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2467
	    if (lowByte == 0xF5) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2468
	    if (lowByte == 0xF6) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2469
	    if (lowByte == 0xF7) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2470
	    RETURN (TRUE_IF_EVEN(lowByte));
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2471
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2472
	case 0x02:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2473
	    if (lowByte <= 0x33) { RETURN (TRUE_IF_EVEN(lowByte)); }
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2474
	    RETURN (false);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2475
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2476
	case 0x03:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2477
	    if (lowByte < 0x86) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2478
	    if (lowByte == 0x87) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2479
	    if (lowByte == 0x90) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2480
	    if (lowByte <= 0xAB) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2481
	    if (lowByte <= 0xD1) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2482
	    if (lowByte <= 0xD4) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2483
	    if (lowByte <= 0xD7) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2484
	    if (lowByte <= 0xEF) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2485
	    if (lowByte == 0xF4) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2486
#ifndef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2487
	    if (lowByte == 0xF7) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2488
	    if (lowByte == 0xF9) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2489
	    if (lowByte == 0xFa) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2490
#endif
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2491
	    RETURN (false);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2492
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2493
	case 0x04:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2494
	    if (lowByte <= 0x2F) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2495
	    if (lowByte <= 0x5F) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2496
	    if (lowByte <= 0x81) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2497
	    if (lowByte < 0x8A) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2498
	    if (lowByte <= 0xC0) { RETURN (TRUE_IF_EVEN(lowByte)); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2499
#ifdef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2500
	    if (lowByte == 0xC5) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2501
#endif
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2502
#ifdef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2503
	    if (lowByte <= 0xC8) { RETURN (TRUE_IF_ODD(lowByte)); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2504
#else
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2505
	    if (lowByte <= 0xCA) { RETURN (TRUE_IF_ODD(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2506
	    if (lowByte == 0xCD) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2507
	    if (lowByte == 0xCE) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2508
#endif
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2509
	    if (lowByte == 0xCB) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2510
	    if (lowByte == 0xCC) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2511
	    RETURN (TRUE_IF_EVEN(lowByte));
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2512
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2513
	case 0x05:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2514
	    if (lowByte <= 0x0F) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2515
	    if (lowByte < 0x31) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2516
	    if (lowByte <= 0x56) { RETURN (true); }
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2517
	    RETURN (false);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2518
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2519
	case 0x10:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2520
	    if (lowByte < 0xA0) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2521
	    if (lowByte <= 0xCF) { RETURN (true); }
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2522
	    RETURN (false);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2523
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2524
	case 0x1E:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2525
	    if (lowByte < 0x96) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2526
	    if (lowByte < 0xA0) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2527
	    RETURN (TRUE_IF_EVEN(lowByte));
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2528
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2529
	case 0x1F:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2530
	    if (lowByte <= 0x6F) {
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2531
		if (lowByte & 0x0008) { RETURN (true); }
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2532
	    }
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2533
	    if (lowByte <= 0x87) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2534
	    if (lowByte < 0xB8) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2535
	    if (lowByte < 0xBC) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2536
	    if (lowByte == 0xEC) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2537
	    if (((lowByte & 0x000F) >= 0x0008) && ((lowByte & 0x000F) <= 0x000B)) { RETURN (true); }
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2538
	    RETURN (false);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2539
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2540
	case 0x21:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2541
	    if (lowByte == 0x02) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2542
	    if (lowByte == 0x07) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2543
	    if (lowByte < 0x0B) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2544
	    if (lowByte < 0x0E) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2545
	    if (lowByte == 0x10) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2546
	    if (lowByte == 0x11) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2547
	    if (lowByte == 0x12) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2548
	    if (lowByte == 0x15) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2549
	    if (lowByte == 0x19) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2550
	    if (lowByte == 0x1A) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2551
	    if (lowByte == 0x1B) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2552
	    if (lowByte == 0x1C) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2553
	    if (lowByte == 0x1D) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2554
	    if (lowByte < 0x24) { RETURN (false); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2555
	    if (lowByte <= 0x2A) { RETURN (TRUE_IF_EVEN(lowByte)); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2556
	    if (lowByte == 0x2B) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2557
	    if (lowByte == 0x2C) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2558
	    if (lowByte == 0x2D) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2559
	    if (lowByte == 0x30) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2560
	    if (lowByte == 0x31) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2561
	    if (lowByte == 0x33) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2562
#ifndef UNICODE_3_2
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2563
	    if (lowByte == 0x3E) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2564
	    if (lowByte == 0x3F) { RETURN (true); }
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2565
	    if (lowByte == 0x45) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2566
#endif
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2567
	    RETURN (false);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2568
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2569
	case 0xFF:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2570
	    if ((lowByte >= 0x21) && (lowByte <= 0x3A)) { RETURN (true); }
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2571
	    RETURN (false);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2572
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2573
	case 0x104:
18658
d1665870d020 better code for asLowercase and asUppercase
Claus Gittinger <cg@exept.de>
parents: 18656
diff changeset
  2574
	    if (lowByte <= 0x27) { RETURN (true); }
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2575
	    RETURN (false);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2576
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2577
	case 0x1D4:
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2578
	case 0x1D5:
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2579
	case 0x1D6:
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2580
	    if (val <= 0x1D419) { RETURN (true); }
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2581
	    if (val < 0x1D6be) {
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2582
		if (((val - 0x1D41A) % 52) <= 25) { RETURN (false); }
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2583
		RETURN (true);
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2584
	    }
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2585
	    if (val < 0x1D6c1) { RETURN (true); }
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2586
	    if (val < 0x1D6e2) { RETURN (false); }
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2587
	    if (val < 0x1D6fb) { RETURN (true); }
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2588
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2589
	    RETURN (false);
7979
7515722ccfb1 isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents: 7976
diff changeset
  2590
    }
7988
cb1c920e67eb isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents: 7987
diff changeset
  2591
    RETURN (false);
7985
4a28f0e3832d checkin from browser
Claus Gittinger <cg@exept.de>
parents: 7980
diff changeset
  2592
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2593
#undef TRUE_IF_ODD
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2594
#undef TRUE_IF_EVEN
7988
cb1c920e67eb isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents: 7987
diff changeset
  2595
%}
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2596
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2597
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2598
isVowel
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2599
    "return true, if I am a vowel (lower- or uppercase)"
333
claus
parents: 328
diff changeset
  2600
6066
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  2601
    "/ I know the code is ugly;
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  2602
    "/ better code is:
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  2603
    "/     'aeiou' includes:self asLowercase
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  2604
    "/ or:
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  2605
    "/     'aeiouAEIOU' includes:self
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  2606
    "/
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  2607
    "/ until I have a smart compiler, I use the shorter (codewise):
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  2608
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2609
    (self == $a) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2610
    (self == $e) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2611
    (self == $i) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2612
    (self == $o) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2613
    (self == $u) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2614
    (self == $A) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2615
    (self == $E) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2616
    (self == $I) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2617
    (self == $O) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2618
    (self == $U) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2619
    ^ false
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  2620
! !
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  2621
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2622
!Character methodsFor:'testing - national'!
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2623
17188
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2624
asNonDiacritical
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2625
    "return a new character which represents the receiver without diacritics.
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2626
     This is used with string search and when lists are to be ordered/sorted by base character order.
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2627
     CAVEAT:
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2628
	for now, this method is only correct for unicode characters up to u+2FF,
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2629
	i.e. latin languages"
17188
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2630
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2631
%{  /* NOCONTEXT */
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2632
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2633
    REGISTER INT val;
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2634
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2635
    /* because used so often, this is open coded, instead of table driven */
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2636
    val = __intVal(__INST(asciivalue));
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2637
    switch (val >> 8) {
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2638
	case 0x00:
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2639
	    if (val < 0xC0) { RETURN(self); }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2640
	    if (val <= 0xC6) { val = 'A'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2641
	    if (val == 0xC7) { val = 'C'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2642
	    if (val <= 0xCB) { val = 'E'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2643
	    if (val <= 0xCF) { val = 'I'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2644
	    if (val == 0xD0) { val = 'D'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2645
	    if (val == 0xD1) { val = 'N'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2646
	    if (val <= 0xD6) { val = 'O'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2647
	    if (val == 0xD7) { RETURN(self) }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2648
	    if (val == 0xD8) { val = 'O'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2649
	    if (val <= 0xDC) { val = 'U'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2650
	    if (val == 0xDD) { val = 'Y'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2651
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2652
	    if (val < 0xE0) { RETURN(self) }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2653
	    if (val <= 0xE6) { val = 'a'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2654
	    if (val == 0xE7) { val = 'c'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2655
	    if (val <= 0xEB) { val = 'e'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2656
	    if (val <= 0xEF) { val = 'i'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2657
	    if (val == 0xF0) { val = 'd'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2658
	    if (val == 0xF1) { val = 'n'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2659
	    if (val <= 0xF6) { val = 'o'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2660
	    if (val == 0xF7) { RETURN(self) }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2661
	    if (val == 0xF8) { val = 'o'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2662
	    if (val <= 0xFC) { val = 'u'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2663
	    if (val == 0xFD) { val = 'y'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2664
	    if (val == 0xFF) { val = 'y'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2665
	    RETURN (self);
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2666
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2667
	case 0x01:
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2668
	    if (val <= 0x105) { val = (val & 1) ? 'a' : 'A'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2669
	    if (val <= 0x10D) { val = (val & 1) ? 'c' : 'C'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2670
	    if (val <= 0x111) { val = (val & 1) ? 'd' : 'D'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2671
	    if (val <= 0x11B) { val = (val & 1) ? 'e' : 'E'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2672
	    if (val <= 0x123) { val = (val & 1) ? 'g' : 'G'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2673
	    if (val <= 0x127) { val = (val & 1) ? 'h' : 'H'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2674
	    if (val <= 0x133) { val = (val & 1) ? 'i' : 'I'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2675
	    if (val <= 0x137) { val = (val & 1) ? 'k' : 'K'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2676
	    if (val == 0x138) { val = 'K'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2677
	    if (val <= 0x142) { val = (val & 1) ? 'L' : 'l'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2678
	    if (val <= 0x148) { val = (val & 1) ? 'N' : 'n'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2679
	    if (val <= 0x14B) { val = (val & 1) ? 'n' : 'N'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2680
	    if (val <= 0x153) { val = (val & 1) ? 'o' : 'O'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2681
	    if (val <= 0x159) { val = (val & 1) ? 'r' : 'R'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2682
	    if (val <= 0x161) { val = (val & 1) ? 's' : 'S'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2683
	    if (val <= 0x167) { val = (val & 1) ? 't' : 'T'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2684
	    if (val <= 0x173) { val = (val & 1) ? 'u' : 'U'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2685
	    if (val <= 0x175) { val = (val & 1) ? 'w' : 'W'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2686
	    if (val <= 0x178) { val = (val & 1) ? 'y' : 'Y'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2687
	    if (val <= 0x17E) { val = (val & 1) ? 'Z' : 'z'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2688
	    RETURN (self);
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2689
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2690
	case 0x02:
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2691
	    if (val <= 0x203) { val = (val & 1) ? 'a' : 'A'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2692
	    if (val <= 0x207) { val = (val & 1) ? 'e' : 'E'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2693
	    if (val <= 0x20B) { val = (val & 1) ? 'i' : 'I'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2694
	    if (val <= 0x20F) { val = (val & 1) ? 'o' : 'O'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2695
	    if (val <= 0x213) { val = (val & 1) ? 'r' : 'R'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2696
	    if (val <= 0x217) { val = (val & 1) ? 'u' : 'U'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2697
	    if (val <= 0x219) { val = (val & 1) ? 's' : 'S'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2698
	    if (val <= 0x21B) { val = (val & 1) ? 't' : 'T'; break; }
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2699
	    RETURN (self);
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2700
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2701
	case 0x03:
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2702
	    // to be done
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2703
	    RETURN (self);
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2704
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2705
	case 0x04:
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2706
	    // to be done
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2707
	    RETURN (self);
17188
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2708
    }
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2709
    if (val <= MAX_IMMEDIATE_CHARACTER) {
18215
5940d5eff81b java preps
Claus Gittinger <cg@exept.de>
parents: 17440
diff changeset
  2710
	RETURN (__MKCHARACTER(val)) ;
17188
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2711
    }
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2712
    RETURN (__MKUCHARACTER(val)) ;
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2713
%}
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2714
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2715
    "
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2716
     $e asNonDiacritical
21673
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  2717
     $é asNonDiacritical
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  2718
     $ä asNonDiacritical
8e83b97e0909 #TUNING by stefan
Stefan Vogel <sv@exept.de>
parents: 21535
diff changeset
  2719
     $Ã¥ asNonDiacritical
17188
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2720
    "
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2721
!
6a19e6954c3e class: Character
Claus Gittinger <cg@exept.de>
parents: 17184
diff changeset
  2722
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2723
isNationalAlphaNumeric
18268
70af591a198c class: Character
Stefan Vogel <sv@exept.de>
parents: 18240
diff changeset
  2724
    "return true, if the receiver is a letter or digit.
70af591a198c class: Character
Stefan Vogel <sv@exept.de>
parents: 18240
diff changeset
  2725
     This assumes unicode encoding."
70af591a198c class: Character
Stefan Vogel <sv@exept.de>
parents: 18240
diff changeset
  2726
70af591a198c class: Character
Stefan Vogel <sv@exept.de>
parents: 18240
diff changeset
  2727
    ^ self isNationalLetter or:[self isNationalDigit]
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2728
!
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2729
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2730
isNationalDigit
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2731
    "return true, if the receiver is a digit.
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2732
     This assumes unicode encoding.
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2733
     WARNING: this method is not complete."
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2734
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2735
    |codePoint "{ Class SmallInteger }"|
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2736
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2737
    codePoint := asciivalue.
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2738
9153
197d00a1e50a *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 9148
diff changeset
  2739
    codePoint <= 16rFF ifTrue:[                "/ u00xx - unicode latin1 page
18298
541b114e644a schteam
Claus Gittinger <cg@exept.de>
parents: 18268
diff changeset
  2740
	^ codePoint between:$0 codePoint and:$9 codePoint.
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2741
    ].
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2742
18268
70af591a198c class: Character
Stefan Vogel <sv@exept.de>
parents: 18240
diff changeset
  2743
    ^ codePoint between:16rFF10 and:16rFF19
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2744
!
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2745
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2746
isNationalLetter
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2747
    "return true, if the receiver is a letter.
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2748
     CAVEAT:
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2749
	for now, this method is only correct for unicode characters up to u+1d6ff (Unicode3.1).
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2750
	(which is more than mozilla does, btw. ;-)"
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2751
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2752
%{  /* NOCONTEXT */
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2753
14684
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2754
    REGISTER INT val;
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2755
dfe58a6be944 64bit fix;
Claus Gittinger <cg@exept.de>
parents: 14663
diff changeset
  2756
    /* because used so often, this is open coded, instead of table driven */
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2757
    val = __intVal(__INST(asciivalue));
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2758
    switch (val >> 8) {
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2759
	case 0x00:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2760
	    if ((unsigned INT)(val - 'A') <= ('Z' - 'A')) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2761
		RETURN ( true );
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2762
	    }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2763
	    if ((unsigned INT)(val - 'a') <= ('z' - 'a')) {
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2764
		RETURN ( true );
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2765
	    }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2766
	    if (val == 0xAA) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2767
	    if (val == 0xB5) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2768
	    if (val == 0xBA) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2769
	    if (val < 0xC0) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2770
	    if (val == 0xD7) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2771
	    if (val == 0xF7) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2772
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2773
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2774
	case 0x01:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2775
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2776
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2777
	case 0x02:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2778
#ifdef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2779
	    if (val <= 0x2B8) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2780
	    if (val == 0x2B9) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2781
	    if (val == 0x2BA) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2782
#else
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2783
	    if (val <= 0x2BA) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2784
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2785
	    if (val <= 0x2C1) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2786
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2787
	    if (val <= 0x2C5) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2788
	    if (val <= 0x2CF) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2789
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2790
	    if (val == 0x2D0) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2791
	    if (val == 0x2D1) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2792
	    if (val <= 0x2DF) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2793
	    if (val <= 0x2E4) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2794
	    if (val == 0x2EE) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2795
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2796
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2797
	case 0x03:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2798
	    if (val == 0x37A) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2799
	    if (val <= 0x385) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2800
	    if (val == 0x387) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2801
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2802
	    if (val == 0x3F6) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2803
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2804
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2805
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2806
	case 0x04:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2807
	    if (val <= 0x481) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2808
	    if (val <= 0x486) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2809
	    if (val == 0x487) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2810
#ifdef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2811
	    if (val <= 0x48A) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2812
#else
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2813
	    if (val <= 0x489) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2814
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2815
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2816
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2817
	case 0x05:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2818
	    if (val <= 0x50f) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2819
	    if (val <= 0x530) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2820
	    if (val <= 0x556) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2821
	    if (val <= 0x558) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2822
	    if (val <= 0x559) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2823
	    if (val <= 0x55F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2824
	    if (val <= 0x587) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2825
	    if (val <= 0x5cf) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2826
	    if (val <= 0x5f2) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2827
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2828
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2829
	case 0x06:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2830
	    if (val <= 0x620) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2831
	    if (val <= 0x64A) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2832
	    if (val <= 0x66D) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2833
	    if (val == 0x670) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2834
	    if (val <= 0x6D3) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2835
	    if (val == 0x6D5) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2836
	    if (val == 0x6E5) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2837
	    if (val == 0x6E6) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2838
	    if (val == 0x6EE) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2839
	    if (val == 0x6EF) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2840
	    if (val == 0x6FA) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2841
	    if (val == 0x6FB) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2842
	    if (val == 0x6FC) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2843
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2844
	    if (val == 0x6FF) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2845
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2846
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2847
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2848
	case 0x07:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2849
	    if (val <= 0x70F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2850
	    if (val == 0x711) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2851
	    if (val <= 0x72F) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2852
#ifdef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2853
	    if (val <= 0x74d) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2854
	    if (val <= 0x74e) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2855
#else
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2856
	    if (val <= 0x74c) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2857
	    if (val <= 0x74f) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2858
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2859
	    if (val <= 0x77F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2860
	    if (val <= 0x7a5) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2861
	    if (val <= 0x7af) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2862
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2863
	    if (val == 0x7B1) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2864
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2865
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2866
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2867
	case 0x09:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2868
#ifdef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2869
	    if (val <= 0x904) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2870
#else
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2871
	    if (val <= 0x903) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2872
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2873
	    if (val <= 0x93B) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2874
	    if (val == 0x93D) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2875
	    if (val == 0x950) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2876
	    if (val <= 0x957) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2877
	    if (val <= 0x961) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2878
	    if (val <= 0x984) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2879
	    if (val <= 0x9BB) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2880
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2881
	    if (val == 0x9BD) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2882
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2883
	    if (val <= 0x9DB) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2884
	    if (val <= 0x9E1) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2885
	    if (val <= 0x9EF) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2886
	    if (val <= 0x9F1) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2887
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2888
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2889
	case 0x0A:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2890
	    if (val <= 0xa04) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2891
	    if (val <= 0xa3B) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2892
	    if (val <= 0xa58) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2893
	    if (val <= 0xa65) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2894
	    if (val <= 0xa71) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2895
	    if (val <= 0xa80) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2896
	    if (val <= 0xa84) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2897
	    if (val <= 0xaBB) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2898
	    if (val == 0xaBD) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2899
	    if (val <= 0xaCF) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2900
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2901
	    if (val == 0xAE2) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2902
	    if (val == 0xAE3) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2903
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2904
	    if (val <= 0xaE5) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2905
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2906
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2907
	case 0x0B:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2908
	    if (val <= 0xB04) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2909
	    if (val <= 0xb3B) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2910
	    if (val == 0xb3d) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2911
	    if (val <= 0xb5B) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2912
	    if (val <= 0xb65) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2913
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2914
	    if (val == 0xB71) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2915
	    if (val == 0xB83) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2916
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2917
	    if (val <= 0xb84) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2918
	    if (val <= 0xbBB) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2919
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2920
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2921
	case 0x0c:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2922
	    if (val <= 0xc04) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2923
	    if (val <= 0xc3d) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2924
	    if (val <= 0xc5f) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2925
	    if (val <= 0xc65) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2926
	    if (val <= 0xc84) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2927
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2928
	    if (val == 0xcbc) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2929
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2930
	    if (val <= 0xcbd) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2931
	    if (val <= 0xcdc) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2932
	    if (val <= 0xce5) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2933
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2934
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2935
	case 0x0d:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2936
	    if (val <= 0xd04) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2937
	    if (val <= 0xd3d) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2938
	    if (val <= 0xd5f) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2939
	    if (val <= 0xd65) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2940
	    if (val <= 0xd84) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2941
	    if (val <= 0xdc9) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2942
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2943
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2944
	case 0x0E:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2945
	    if (val == 0xE31) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2946
	    if (val <= 0xE33) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2947
	    if (val <= 0xE3F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2948
	    if (val <= 0xE46) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2949
	    if (val <= 0xE7f) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2950
	    if (val <= 0xEb0) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2951
	    if (val == 0xEb1) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2952
	    if (val <= 0xEb3) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2953
	    if (val <= 0xEbc) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2954
	    if (val <= 0xEc7) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2955
	    if (val <= 0xEdb) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2956
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2957
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2958
	case 0x0F:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2959
	    if (val == 0xf00) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2960
	    if (val <= 0xf3F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2961
	    if (val <= 0xf70) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2962
	    if (val <= 0xf87) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2963
	    if (val <= 0xf8f) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2964
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2965
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2966
	case 0x10:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2967
	    if (val <= 0x102b) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2968
	    if (val <= 0x104f) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2969
	    if (val <= 0x1055) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2970
	    if (val <= 0x109f) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2971
	    if (val <= 0x10fa) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2972
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2973
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2974
	case 0x11:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2975
	case 0x12:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2976
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2977
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2978
	case 0x13:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2979
	    if (val <= 0x1360) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2980
	    if (val <= 0x139f) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2981
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2982
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2983
	case 0x14:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2984
	case 0x15:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2985
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2986
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2987
	case 0x16:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2988
	    if (val == 0x166d) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2989
	    if (val == 0x166e) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2990
	    if (val == 0x1680) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2991
	    if (val == 0x169b) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2992
	    if (val == 0x169c) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2993
	    if (val <= 0x16ea) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2994
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2995
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2996
	case 0x17:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  2997
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2998
	    if (val == 0x1712) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  2999
	    if (val == 0x1713) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3000
	    if (val == 0x1714) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3001
	    if (val == 0x1732) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3002
	    if (val == 0x1733) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3003
	    if (val == 0x1734) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3004
	    if (val == 0x1735) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3005
	    if (val == 0x1736) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3006
	    if (val == 0x1752) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3007
	    if (val == 0x1753) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3008
	    if (val == 0x1772) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3009
	    if (val == 0x1773) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3010
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3011
	    if (val <= 0x17b3) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3012
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3013
	    if (val == 0x17D7) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3014
	    if (val == 0x17DC) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3015
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3016
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3017
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3018
	case 0x18:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3019
	    if (val <= 0x181f) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3020
	    if (val <= 0x18a8) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3021
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3022
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3023
	case 0x19:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3024
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3025
	    if (val <= 0x191F) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3026
	    if (val <= 0x194F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3027
	    if (val <= 0x197F) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3028
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3029
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3030
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3031
	case 0x1d:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3032
	    if (val <= 0x1d6B) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3033
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3034
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3035
	case 0x1e:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3036
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3037
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3038
	case 0x1f:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3039
	    if (val <= 0x1fbc) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3040
	    if (val == 0x1fbe) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3041
	    if (val <= 0x1fc1) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3042
	    if (val <= 0x1fcc) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3043
	    if (val <= 0x1fcf) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3044
	    if (val <= 0x1fdc) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3045
	    if (val <= 0x1fdf) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3046
	    if (val <= 0x1fec) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3047
	    if (val <= 0x1ff1) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3048
	    if (val <= 0x1ffc) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3049
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3050
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3051
	case 0x20:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3052
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3053
	    if (val == 0x2071) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3054
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3055
	    if (val == 0x207f) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3056
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3057
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3058
	case 0x21:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3059
	    if (val == 0x2102) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3060
	    if (val == 0x2107) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3061
	    if (val <= 0x2109) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3062
	    if (val <= 0x2113) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3063
	    if (val == 0x2115) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3064
	    if (val <= 0x2118) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3065
	    if (val <= 0x211d) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3066
	    if (val <= 0x2123) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3067
	    if (val == 0x2125) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3068
	    if (val == 0x2127) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3069
	    if (val == 0x2129) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3070
	    if (val == 0x212E) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3071
	    if (val == 0x2132) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3072
	    if (val == 0x213A) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3073
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3074
	    if (val == 0x213B) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3075
	    if (val <= 0x213F) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3076
	    if (val <= 0x2144) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3077
	    if (val == 0x214A) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3078
	    if (val == 0x214B) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3079
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3080
	    if (val <= 0x2152) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3081
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3082
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3083
	case 0x30:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3084
	    if (val == 0x3005) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3085
	    if (val == 0x3006) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3086
	    if (val <= 0x3030) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3087
	    if (val <= 0x3035) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3088
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3089
	    if (val == 0x303B) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3090
	    if (val == 0x303C) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3091
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3092
	    if (val <= 0x3040) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3093
	    if (val <= 0x3098) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3094
	    if (val <= 0x309c) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3095
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3096
	    if (val == 0x30A0) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3097
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3098
	    if (val == 0x30Fb) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3099
	    RETURN ((true));
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3100
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3101
	case 0x31:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3102
	    if (val <= 0x318f) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3103
	    if (val <= 0x319F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3104
	    RETURN ((true));
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3105
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3106
	case 0x34:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3107
	    RETURN ((true));
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3108
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3109
	case 0x4d:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3110
	    if (val <= 0x4DB4) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3111
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3112
	    if (val <= 0x4DBF) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3113
	    RETURN (false);
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3114
#else
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3115
	    RETURN (true);
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3116
#endif
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3117
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3118
	case 0x4e:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3119
	    RETURN ((true));
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3120
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3121
	case 0x9f:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3122
	    if (val <= 0x9fa4) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3123
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3124
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3125
	case 0xA0:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3126
	case 0xA1:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3127
	case 0xA2:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3128
	case 0xA3:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3129
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3130
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3131
	case 0xA4:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3132
	    if (val <= 0xa48f) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3133
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3134
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3135
	case 0xA5:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3136
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3137
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3138
	case 0xAC:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3139
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3140
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3141
	case 0xD7:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3142
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3143
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3144
	case 0xF9:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3145
	case 0xFA:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3146
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3147
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3148
	case 0xFB:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3149
	    if (val == 0xfb1e) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3150
	    if (val == 0xfb29) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3151
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3152
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3153
	case 0xFC:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3154
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3155
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3156
	case 0xFD:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3157
	    if (val <= 0xFD3d) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3158
	    if (val <= 0xFD4F) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3159
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3160
	    if (val == 0xFDFC) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3161
	    if (val == 0xFDFD) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3162
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3163
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3164
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3165
	case 0xFE:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3166
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3167
	    if (val <= 0xFE0F) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3168
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3169
	    if (val <= 0xFE1f) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3170
	    if (val <= 0xFE6F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3171
	    if (val <= 0xFEFE) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3172
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3173
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3174
	case 0xFF:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3175
	    if (val <= 0xFF20) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3176
	    if (val <= 0xFF3a) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3177
	    if (val <= 0xFF40) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3178
	    if (val <= 0xFF5a) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3179
	    if (val <= 0xFF65) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3180
	    if (val <= 0xFFdC) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3181
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3182
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3183
	case 0x100:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3184
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3185
	    RETURN (true);
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3186
#else
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3187
	    RETURN (false);
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3188
#endif
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3189
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3190
	case 0x103:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3191
	    if (val <= 0x1031f) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3192
	    if (val <= 0x1032F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3193
	    if (val <= 0x10349) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3194
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3195
	    if (val <= 0x1037F) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3196
	    if (val <= 0x1039E) { RETURN (true); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3197
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3198
	    RETURN (false);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3199
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3200
	case 0x104:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3201
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3202
	    if (val <= 0x1049F) { RETURN (true); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3203
	    if (val <= 0x104aF) { RETURN (false); }
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3204
#endif
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3205
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3206
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3207
	case 0x108:
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3208
#ifndef UNICODE_3_2
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3209
	    RETURN (true);
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3210
#else
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3211
	    RETURN (false);
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3212
#endif
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3213
20706
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3214
	case 0x1D4:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3215
	case 0x1D5:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3216
	    RETURN (true);
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3217
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3218
	case 0x1D6:
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3219
	    if (val == 0x1d6c1) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3220
	    if (val == 0x1d6db) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3221
	    if (val == 0x1d6fb) { RETURN (false); }
009b8269bd08 we have class based exceptions - use them
Claus Gittinger <cg@exept.de>
parents: 20698
diff changeset
  3222
	    RETURN (true);
8030
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3223
    }
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3224
    RETURN (false);
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3225
%}
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3226
! !
5a4323d0280f updated to Unicode4.0.0 spec
Claus Gittinger <cg@exept.de>
parents: 8029
diff changeset
  3227
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  3228
!Character methodsFor:'tracing'!
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  3229
4682
4158042a9c8c *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 4655
diff changeset
  3230
traceInto:aRequestor level:level from:referrer
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  3231
    "double dispatch into tracer, passing my type implicitely in the selector"
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  3232
4682
4158042a9c8c *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 4655
diff changeset
  3233
    ^ aRequestor traceCharacter:self level:level from:referrer
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  3234
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  3235
! !
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  3236
8394
da194de43766 Generalize visitor pattern and define #visit...:with: -methods instead
Stefan Vogel <sv@exept.de>
parents: 8308
diff changeset
  3237
!Character methodsFor:'visiting'!
da194de43766 Generalize visitor pattern and define #visit...:with: -methods instead
Stefan Vogel <sv@exept.de>
parents: 8308
diff changeset
  3238
da194de43766 Generalize visitor pattern and define #visit...:with: -methods instead
Stefan Vogel <sv@exept.de>
parents: 8308
diff changeset
  3239
acceptVisitor:aVisitor with:aParameter
16727
75e88a387e62 comment/format only
Claus Gittinger <cg@exept.de>
parents: 16225
diff changeset
  3240
    "dispatch for visitor pattern; send #visitCharacter:with: to aVisitor"
8394
da194de43766 Generalize visitor pattern and define #visit...:with: -methods instead
Stefan Vogel <sv@exept.de>
parents: 8308
diff changeset
  3241
da194de43766 Generalize visitor pattern and define #visit...:with: -methods instead
Stefan Vogel <sv@exept.de>
parents: 8308
diff changeset
  3242
    ^ aVisitor visitCharacter:self with:aParameter
da194de43766 Generalize visitor pattern and define #visit...:with: -methods instead
Stefan Vogel <sv@exept.de>
parents: 8308
diff changeset
  3243
! !
da194de43766 Generalize visitor pattern and define #visit...:with: -methods instead
Stefan Vogel <sv@exept.de>
parents: 8308
diff changeset
  3244
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
  3245
!Character class methodsFor:'documentation'!
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  3246
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  3247
version
18588
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  3248
    ^ '$Header$'
14120
fdf215af772c added: #displayOn: (instead of #displaySting)
Stefan Vogel <sv@exept.de>
parents: 14117
diff changeset
  3249
!
fdf215af772c added: #displayOn: (instead of #displaySting)
Stefan Vogel <sv@exept.de>
parents: 14117
diff changeset
  3250
fdf215af772c added: #displayOn: (instead of #displaySting)
Stefan Vogel <sv@exept.de>
parents: 14117
diff changeset
  3251
version_CVS
18588
ed9d4665224c class: Character
Stefan Vogel <sv@exept.de>
parents: 18298
diff changeset
  3252
    ^ '$Header$'
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  3253
! !
21374
f7820456972c #FEATURE by stefan
Stefan Vogel <sv@exept.de>
parents: 20706
diff changeset
  3254