Character.st
author Claus Gittinger <cg@exept.de>
Thu, 19 Feb 2004 15:13:00 +0100
changeset 7970 2ae69eb663eb
parent 7951 062ac899852e
child 7976 1ce9243a8601
permissions -rw-r--r--
*** empty log message ***
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
     1
"
5
67342904af11 *** empty log message ***
claus
parents: 3
diff changeset
     2
 COPYRIGHT (c) 1988 by Claus Gittinger
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
     3
	      All Rights Reserved
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
     4
a27a279701f8 Initial revision
claus
parents:
diff changeset
     5
 This software is furnished under a license and may be used
a27a279701f8 Initial revision
claus
parents:
diff changeset
     6
 only in accordance with the terms of that license and with the
a27a279701f8 Initial revision
claus
parents:
diff changeset
     7
 inclusion of the above copyright notice.   This software may not
a27a279701f8 Initial revision
claus
parents:
diff changeset
     8
 be provided or otherwise made available to, or used by, any
a27a279701f8 Initial revision
claus
parents:
diff changeset
     9
 other person.  No title to or ownership of the software is
a27a279701f8 Initial revision
claus
parents:
diff changeset
    10
 hereby transferred.
a27a279701f8 Initial revision
claus
parents:
diff changeset
    11
"
a27a279701f8 Initial revision
claus
parents:
diff changeset
    12
5407
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
    13
"{ Package: 'stx:libbasic' }"
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
    14
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
    15
Magnitude subclass:#Character
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
    16
	instanceVariableNames:'asciivalue'
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
    17
	classVariableNames:'CharacterTable'
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
    18
	poolDictionaries:''
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
    19
	category:'Magnitude-General'
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
    20
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
    21
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
    22
!Character class methodsFor:'documentation'!
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
    23
88
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    24
copyright
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    25
"
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    26
 COPYRIGHT (c) 1988 by Claus Gittinger
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
    27
	      All Rights Reserved
88
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    28
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    29
 This software is furnished under a license and may be used
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    30
 only in accordance with the terms of that license and with the
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    31
 inclusion of the above copyright notice.   This software may not
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    32
 be provided or otherwise made available to, or used by, any
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    33
 other person.  No title to or ownership of the software is
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    34
 hereby transferred.
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    35
"
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    36
!
81dacba7a63a *** empty log message ***
claus
parents: 68
diff changeset
    37
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
    38
documentation
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
    39
"
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    40
    This class represents characters.
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    41
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    42
    For now, unicode characters can be created and also put-into/taken from
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    43
    Unicode strings, but the attribute queries (such as isLetter, isDigit etc. are
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    44
    not yet unicode aware).
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    45
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    46
    Notice, that actual character objects are not used when characters
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    47
    are stored in strings, symbols or twoByteStrings; these only store
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    48
    a characters asciiValue for a more compact representation.
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    49
    The word 'asciiValue' is a historic leftover - actually, any integer
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    50
    code is allowed (i.e. characters are not limited to 8bit).
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    51
1229
ace799c5765e commentary
Claus Gittinger <cg@exept.de>
parents: 1138
diff changeset
    52
    Single byte Characters are unique; 
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    53
    i.e. for every asciiValue (0..255) there exists exactly one instance of 
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    54
    Character, which is shared (Character value:xxx checks for this, and returns 
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    55
    a reference to an existing instance).
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    56
    Other characters (i.e. asciivalue > 255) are not shared; i.e. these
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    57
    are created as required.
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
    58
1229
ace799c5765e commentary
Claus Gittinger <cg@exept.de>
parents: 1138
diff changeset
    59
    This means: you may compare characters using #== ONLY IFF you are certain,
357
claus
parents: 345
diff changeset
    60
    that the characters ranges is 0..255. Otherwise, you HAVE TO compare
claus
parents: 345
diff changeset
    61
    using #=. (if in doubt, always compare using #=).
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    62
    Sorry for this inconvenience, but it is (practically) impossible to keep
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    63
    the possible maximum of 2^32 characters (Unicode) around, for that
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    64
    convenience alone.
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
    65
357
claus
parents: 345
diff changeset
    66
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
    67
    Methods marked as (JS) come from the manchester Character goody
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
    68
    (CharacterComparing) by Jan Steinman, which allow Characters to be used as
1229
ace799c5765e commentary
Claus Gittinger <cg@exept.de>
parents: 1138
diff changeset
    69
    Interval elements (i.e. ($a to:$z) do:[...] ); 
ace799c5765e commentary
Claus Gittinger <cg@exept.de>
parents: 1138
diff changeset
    70
    They are not a big deal, but convenient add-ons.
ace799c5765e commentary
Claus Gittinger <cg@exept.de>
parents: 1138
diff changeset
    71
    Some of these have been modified a bit.
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
    72
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
    73
    WARNING: characters are known by compiler and runtime system -
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    74
             do not change the instance layout. 
357
claus
parents: 345
diff changeset
    75
claus
parents: 345
diff changeset
    76
    Also, although you can create subclasses of Character, the compiler always
claus
parents: 345
diff changeset
    77
    creates instances of Character for literals ...
814
d4d28ca7afcd made the global CharacterTable a classVar of Character
Claus Gittinger <cg@exept.de>
parents: 760
diff changeset
    78
    ... and other classes are hard-wired to always return instances of characters
357
claus
parents: 345
diff changeset
    79
    in some cases (i.e. String>>at:, Symbol>>at: etc.).
claus
parents: 345
diff changeset
    80
    Therefore, it may not make sense to create a character-subclass.
1295
83f594f05c52 documentation
Claus Gittinger <cg@exept.de>
parents: 1271
diff changeset
    81
83f594f05c52 documentation
Claus Gittinger <cg@exept.de>
parents: 1271
diff changeset
    82
    [author:]
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    83
        Claus Gittinger
1295
83f594f05c52 documentation
Claus Gittinger <cg@exept.de>
parents: 1271
diff changeset
    84
83f594f05c52 documentation
Claus Gittinger <cg@exept.de>
parents: 1271
diff changeset
    85
    [see also:]
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    86
        String TwoByteString
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
    87
        StringCollection
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
    88
"
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
    89
! !
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
    90
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
    91
!Character class methodsFor:'instance creation'!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
    92
a27a279701f8 Initial revision
claus
parents:
diff changeset
    93
basicNew
a27a279701f8 Initial revision
claus
parents:
diff changeset
    94
    "catch new - Characters cannot be created with new"
a27a279701f8 Initial revision
claus
parents:
diff changeset
    95
a27a279701f8 Initial revision
claus
parents:
diff changeset
    96
    ^ self error:'Characters cannot be created with new'
a27a279701f8 Initial revision
claus
parents:
diff changeset
    97
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
    98
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
    99
digitValue:anInteger
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   100
    "return a character that corresponds to anInteger.
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   101
     0-9 map to $0-$9, 10-35 map to $A-$Z"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   102
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   103
    |val "{ Class: SmallInteger }" |
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   104
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   105
    val := anInteger.
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   106
    (val between:0 and:9) ifTrue:[
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   107
	^ Character value:(val + ($0 asciiValue))
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   108
    ].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   109
    (val between:10 and:35) ifTrue:[
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   110
	^ Character value:(val + ($A asciiValue - 10))
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   111
    ].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   112
    ^self error:'value not in range 0 to 35'
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   113
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   114
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   115
utf8DecodeFrom:aStream
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   116
    "read and return a single unicode character from an UTF8 encoded stream"
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   117
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   118
    |fetchNext c1 c2 c3 c4 c5 ascii|
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   119
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   120
    c1 := aStream next.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   121
    ascii := c1 asciiValue.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   122
    ascii <= 16r7F ifTrue:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   123
        "/ 0xxxxxxx - 7 bits   
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   124
        ^ c1.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   125
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   126
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   127
    (ascii bitAnd:2r11000000) == 2r10000000 ifTrue:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   128
        "/ out of sync (got an intermediate character)
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   129
        self error:'out of sync'.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   130
        ^ c1.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   131
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   132
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   133
    fetchNext := [  |ch|
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   134
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   135
                    ch := aStream next.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   136
                    (ch asciiValue bitAnd:2r11000000) == 2r10000000 ifFalse:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   137
                        "/ followup chars must have 2r10 in high bits
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   138
                        self error:'bad encoding'.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   139
                    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   140
                    ch
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   141
                 ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   142
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   143
    (ascii bitAnd:2r11100000) == 2r11000000 ifTrue:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   144
        "/ 110xxxxx 10xxxxxx - 11 bits
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   145
        c2 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   146
        ascii := c1 asciiValue bitAnd:16r1F.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   147
        ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F).
6811
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   148
        ascii <= 16r7F ifTrue:[
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   149
            self error:'invalid encoding'.
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   150
        ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   151
        ^ Character value:ascii        
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   152
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   153
    (ascii bitAnd:2r11110000) == 2r11100000 ifTrue:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   154
        "/ 1110xxxx 10xxxxxx 10xxxxxx - 16 bits
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   155
        c2 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   156
        c3 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   157
        ascii := c1 asciiValue bitAnd:16r0F.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   158
        ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F).
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   159
        ascii := (ascii bitShift:6) bitOr:(c3 asciiValue bitAnd:16r3F).
6811
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   160
        ascii <= 16r7FF ifTrue:[
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   161
            self error:'invalid encoding'.
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   162
        ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   163
        ^ Character value:ascii        
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   164
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   165
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   166
    "/ notice: currently, characters can only have 16bit encoding;
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   167
    "/ therefore the following will raise a runtime exception,
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   168
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   169
    (ascii bitAnd:2r11111000) == 2r11110000 ifTrue:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   170
        "/ 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx - 21 bits
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   171
        c2 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   172
        c3 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   173
        c4 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   174
        ascii := c1 asciiValue bitAnd:16r07.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   175
        ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F).
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   176
        ascii := (ascii bitShift:6) bitOr:(c3 asciiValue bitAnd:16r3F).
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   177
        ascii := (ascii bitShift:6) bitOr:(c4 asciiValue bitAnd:16r3F).
6811
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   178
        ascii <= 16rFFFF ifTrue:[
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   179
            self error:'invalid encoding'.
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   180
        ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   181
        ^ Character value:ascii        
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   182
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   183
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   184
    (ascii bitAnd:2r11111100) == 2r11111000 ifTrue:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   185
        "/ 111110xx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx - 26 bits
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   186
        c2 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   187
        c3 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   188
        c4 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   189
        c5 := fetchNext value.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   190
        ascii := c1 asciiValue bitAnd:16r03.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   191
        ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F).
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   192
        ascii := (ascii bitShift:6) bitOr:(c3 asciiValue bitAnd:16r3F).
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   193
        ascii := (ascii bitShift:6) bitOr:(c4 asciiValue bitAnd:16r3F).
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   194
        ascii := (ascii bitShift:6) bitOr:(c5 asciiValue bitAnd:16r3F).
6811
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   195
        ascii <= 16r1FFFFF ifTrue:[
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   196
            self error:'invalid encoding'.
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   197
        ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   198
        ^ Character value:ascii        
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   199
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   200
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   201
    (ascii bitAnd:2r11111110) == 2r11111100 ifTrue:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   202
        "/ 1111110x ... 10xxxxxx - any number of bits
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   203
        ascii := c1 asciiValue bitAnd:16r01.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   204
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   205
        c2 := aStream peek.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   206
        [ (c2 asciiValue bitAnd:2r11000000) == 2r10000000 ] whileTrue:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   207
            ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F).
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   208
            aStream next.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   209
        ].
6811
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   210
        ascii <= 16r3FFFFFF ifTrue:[
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   211
            self error:'invalid encoding'.
2fec3f732ac6 catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents: 6808
diff changeset
   212
        ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   213
        ^ Character value:ascii        
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   214
    ].
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   215
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   216
    self error:'bad encoding'.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   217
    ^ c1
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   218
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   219
    "
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   220
      Character utf8DecodeFrom:'a' readStream     
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   221
      Character utf8DecodeFrom:#[195 188] asString readStream   
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   222
    "                                            
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   223
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   224
    "test:
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   225
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   226
      |utf8Encoding original readBack|
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   227
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   228
      1 to:16rFFFF do:[:ascii |
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   229
        original := Character value:ascii.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   230
        utf8Encoding := original asString utf8Encoded.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   231
        readBack := Character utf8DecodeFrom:(utf8Encoding readStream).
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   232
        readBack asciiValue = ascii ifFalse:[
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   233
            self halt
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   234
        ]
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   235
      ]
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   236
    "
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   237
!
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   238
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   239
value:anInteger
a27a279701f8 Initial revision
claus
parents:
diff changeset
   240
    "return a character with asciivalue anInteger"
a27a279701f8 Initial revision
claus
parents:
diff changeset
   241
a27a279701f8 Initial revision
claus
parents:
diff changeset
   242
%{  /* NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
   243
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
   244
    INT ascii;
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   245
249
claus
parents: 216
diff changeset
   246
    if (__isSmallInteger(anInteger)) {
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   247
        ascii = __intVal(anInteger);
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   248
        if ((ascii & ~0xFF) == 0 /* (ascii >= 0) && (ascii <= 255) */)
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   249
            RETURN ( __MKCHARACTER(ascii) );
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   250
    }
819
f17cddc493f4 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 814
diff changeset
   251
%}.
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   252
    (anInteger between:0 and:16rFF) ifTrue:[
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   253
        ^ CharacterTable at:(anInteger + 1)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   254
    ].
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   255
    (anInteger between:16r100 and:16r3FFFFFFF) ifTrue:[
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   256
        ^ super basicNew setAsciiValue:anInteger
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   257
    ].
357
claus
parents: 345
diff changeset
   258
    "
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   259
     a characters ascii-code must be 0..16r3FFFFFFF.
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   260
     (i.e. only characters with up-to 30 bits are allowed.)
357
claus
parents: 345
diff changeset
   261
    "
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   262
    self error:'invalid ascii code for character'
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   263
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   264
7300
461bb09c5954 category
Claus Gittinger <cg@exept.de>
parents: 7261
diff changeset
   265
!Character class methodsFor:'Compatibility-ANSI'!
5945
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   266
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   267
codePoint:asciiValue
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   268
    ^ self value:asciiValue
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   269
! !
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   270
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   271
!Character class methodsFor:'accessing untypeable characters'!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   272
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   273
endOfInput
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   274
	"Answer the Character representing ctrl-d ."
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   275
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   276
	^self value: 4
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   277
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   278
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   279
leftParenthesis
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   280
	"Answer the Character representing a left parenthesis."
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   281
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   282
	^self value: 40
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   283
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   284
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   285
period
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   286
	"Answer the Character representing a carriage period."
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   287
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   288
	^self value: 46
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   289
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   290
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   291
poundSign
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   292
	"Answer the Character representing a pound sign."
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   293
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   294
	^self value: 35
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   295
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   296
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   297
rightParenthesis
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   298
	"Answer the Character representing a right parenthesis."
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   299
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   300
	^self value: 41
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   301
! !
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   302
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   303
!Character class methodsFor:'constants'!
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   304
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   305
backspace
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   306
    "return the backspace character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   307
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   308
    ^ Character value:8
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   309
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   310
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   311
bell
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   312
    "return the bell character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   313
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   314
    ^ Character value:7
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   315
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   316
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   317
cr
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   318
    "return the lineEnd character 
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   319
     - actually (in unix) this is a newline character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   320
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   321
    ^ Character value:10
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   322
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   323
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   324
del 
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   325
    "return the delete character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   326
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   327
    ^ Character value:16r7F 
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   328
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   329
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   330
doubleQuote
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   331
    "return the double-quote character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   332
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   333
    ^ Character value:34
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   334
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   335
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   336
esc
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   337
    "return the escape character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   338
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   339
    ^ Character value:27
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   340
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   341
7688
8494a4155037 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7354
diff changeset
   342
euro
7689
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   343
    "The Euro currency sign (notice: not all fonts support it).
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   344
     Thats the ascii-character code; the Unicode encoding is 0x8364"
7688
8494a4155037 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7354
diff changeset
   345
8494a4155037 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7354
diff changeset
   346
    ^ Character value: 128
7689
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   347
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   348
    "
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   349
     0 to:255 do:[:i |
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   350
        Transcript 
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   351
            show:'| '; show:((i printStringRadix:16) leftPaddedTo:2);
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   352
            show:' | '; show:(i printStringPaddedTo:3);
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   353
            show:' | '; show:(Character value:i);
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   354
            cr.
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   355
     ]
3f4870a7f12e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7688
diff changeset
   356
    "
7688
8494a4155037 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7354
diff changeset
   357
!
8494a4155037 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7354
diff changeset
   358
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   359
excla
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   360
    "return the exclamation-mark character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   361
    ^ $!!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   362
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   363
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   364
ff
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   365
    "return the form-feed character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   366
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   367
    ^ Character value:12
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   368
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   369
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   370
lf
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   371
    "return the newline/linefeed character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   372
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   373
    ^ Character value:10
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   374
!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   375
4340
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   376
linefeed
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   377
    "squeak compatibility: return the newline/linefeed character"
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   378
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   379
    ^ Character value:10
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   380
!
523ef8410fad added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4337
diff changeset
   381
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   382
maxValue 
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   383
    "return the maximum asciiValue a character can have"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   384
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   385
    ^ 16rFFFF
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   386
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   387
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   388
newPage
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   389
    "return the form-feed character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   390
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   391
    ^ Character value:12
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   392
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   393
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   394
nl
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   395
    "return the newline character"
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   396
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   397
    ^ Character value:10
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   398
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   399
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   400
null
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   401
    ^ Character value:0
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   402
!
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   403
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   404
quote
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   405
    "return the single-quote character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   406
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   407
    ^ Character value:39
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   408
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   409
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   410
return 
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   411
    "return the return character.
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   412
     In ST/X, this is different from cr - for Unix reasons."
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   413
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   414
    ^ Character value:13 
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   415
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   416
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   417
space
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   418
    "return the blank character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   419
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   420
    ^ Character value:32
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   421
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   422
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   423
tab
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   424
    "return the tabulator character"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   425
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   426
    ^ Character value:9
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   427
! !
a27a279701f8 Initial revision
claus
parents:
diff changeset
   428
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   429
!Character class methodsFor:'primitive input'!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   430
a27a279701f8 Initial revision
claus
parents:
diff changeset
   431
fromUser
357
claus
parents: 345
diff changeset
   432
    "return a character from the keyboard (C's standard input stream)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   433
     - this should only be used for emergency evaluators and the like."
a27a279701f8 Initial revision
claus
parents:
diff changeset
   434
a27a279701f8 Initial revision
claus
parents:
diff changeset
   435
%{  /* NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
   436
    int c;
a27a279701f8 Initial revision
claus
parents:
diff changeset
   437
a27a279701f8 Initial revision
claus
parents:
diff changeset
   438
    c = getchar();
a27a279701f8 Initial revision
claus
parents:
diff changeset
   439
    if (c < 0) {
5433
aa2365ee12cf migration support
Claus Gittinger <cg@exept.de>
parents: 5423
diff changeset
   440
        RETURN (nil);
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   441
    }
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
   442
    RETURN ( __MKCHARACTER(c & 0xFF) );
5433
aa2365ee12cf migration support
Claus Gittinger <cg@exept.de>
parents: 5423
diff changeset
   443
%}.
aa2365ee12cf migration support
Claus Gittinger <cg@exept.de>
parents: 5423
diff changeset
   444
    ^ Stdin next
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   445
! !
a27a279701f8 Initial revision
claus
parents:
diff changeset
   446
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   447
!Character class methodsFor:'queries'!
3
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   448
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   449
allCharacters
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   450
    "added for squeak compatibility: return a collection of all chars"
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   451
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   452
    ^ CharacterTable
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   453
        
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   454
    "
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   455
     Character allCharacters 
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   456
    "
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   457
!
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   458
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   459
hasSharedInstances
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   460
    "return true if this class has shared instances, that is, instances
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   461
     with the same value are identical.
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   462
     False is returned here, only redefined in classes which have unified
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   463
     instances (or should be treated so)."
2672
dc3662188b2c added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents: 2561
diff changeset
   464
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   465
    ^ true
2672
dc3662188b2c added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents: 2561
diff changeset
   466
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   467
2672
dc3662188b2c added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents: 2561
diff changeset
   468
!
dc3662188b2c added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents: 2561
diff changeset
   469
3
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   470
isBuiltInClass
1271
a11ef7b9c780 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1229
diff changeset
   471
    "return true if this class is known by the run-time-system.
a11ef7b9c780 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1229
diff changeset
   472
     Here, true is returned for myself, false for subclasses."
3
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   473
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   474
    ^ self == Character
1271
a11ef7b9c780 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1229
diff changeset
   475
a11ef7b9c780 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1229
diff changeset
   476
    "Modified: 23.4.1996 / 15:56:39 / cg"
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   477
!
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   478
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   479
separators
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   480
    "added for squeak compatibility: return a collection of separator chars"
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   481
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   482
    ^ Array 
7019
6591ea303fbd *** empty log message ***
penk
parents: 6811
diff changeset
   483
        with:Character space
6591ea303fbd *** empty log message ***
penk
parents: 6811
diff changeset
   484
        with:Character return
6591ea303fbd *** empty log message ***
penk
parents: 6811
diff changeset
   485
        "/ with:Character cr
6591ea303fbd *** empty log message ***
penk
parents: 6811
diff changeset
   486
        with:Character tab
6591ea303fbd *** empty log message ***
penk
parents: 6811
diff changeset
   487
        with:Character lf
6591ea303fbd *** empty log message ***
penk
parents: 6811
diff changeset
   488
        with:Character ff
4337
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   489
        
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   490
    "
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   491
     Character separators
07fad5b7af9b added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents: 4037
diff changeset
   492
    "
3
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   493
! !
24d81bf47225 *** empty log message ***
claus
parents: 2
diff changeset
   494
7261
f35fc9cee675 method category rename
Claus Gittinger <cg@exept.de>
parents: 7257
diff changeset
   495
!Character methodsFor:'Compatibility-ANSI'!
5945
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   496
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   497
codePoint
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   498
    "return the codePoint of myself.
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   499
     Traditionally, this was named 'asciiValue'; 
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   500
     however, characters are not limited to 8bit characters."
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   501
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   502
    ^ asciivalue
5945
Claus Gittinger <cg@exept.de>
parents: 5847
diff changeset
   503
! !
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   504
7261
f35fc9cee675 method category rename
Claus Gittinger <cg@exept.de>
parents: 7257
diff changeset
   505
!Character methodsFor:'Compatibility-Dolphin'!
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   506
7351
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   507
isAlphaNumeric
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   508
    "return true, if I am a letter or a digit
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   509
     Please use isLetterOrDigit for compatibility reasons (which is ANSI)."
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   510
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   511
    ^ self isLetterOrDigit
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   512
!
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   513
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   514
isAlphabetic
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   515
    "return true, if I am a letter.
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   516
     Please use isLetter for compatibility reasons (which is ANSI)."
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   517
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   518
    ^ self isLetter
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   519
!
1f805a32d551 comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents: 7300
diff changeset
   520
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   521
isControl
7353
a5227413a23a comment
Claus Gittinger <cg@exept.de>
parents: 7351
diff changeset
   522
    "return true if I am a control character (i.e. ascii value < 32)"
a5227413a23a comment
Claus Gittinger <cg@exept.de>
parents: 7351
diff changeset
   523
a5227413a23a comment
Claus Gittinger <cg@exept.de>
parents: 7351
diff changeset
   524
    ^ self isControlCharacter
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   525
!
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   526
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   527
isHexDigit
7354
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   528
    "return true if I am a valid hexadecimal digit"
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   529
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   530
    ^ '0123456789abcdefABCDEF' includes:self
7354
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   531
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   532
    "
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   533
     $a isHexDigit
b22823ab2232 comment
Claus Gittinger <cg@exept.de>
parents: 7353
diff changeset
   534
    "
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   535
!
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   536
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   537
isPunctuation
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   538
    "the code below is not unicode aware"
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   539
6327
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   540
    ^ (asciivalue between:16r21 and:16r40)
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   541
      or:[ (asciivalue between:16r5B and:16r60)
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   542
      or:[ (asciivalue between:123 and:126)
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   543
      or:[ (asciivalue between:161 and:191)
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   544
      or:[ (asciivalue == 215 )
6f90155dd142 #isPunctuation
Claus Gittinger <cg@exept.de>
parents: 6324
diff changeset
   545
      or:[ (asciivalue == 247 ) ]]]]]
6324
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   546
! !
377c841da590 added #null
Claus Gittinger <cg@exept.de>
parents: 6066
diff changeset
   547
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   548
!Character methodsFor:'accessing'!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   549
a27a279701f8 Initial revision
claus
parents:
diff changeset
   550
asciiValue
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
   551
    "return the asciivalue of myself.
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   552
     The name 'asciiValue' is a historic leftover:
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   553
        characters are not limited to 8bit characters.
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   554
     So the actual value returned is a codePoint (i.e. full potential for 31bit encoding).
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   555
     PP has removed this method with 4.1 and providing asInteger instead."
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   556
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   557
    ^ asciivalue
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   558
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   559
    "Modified: 27.6.1996 / 12:34:34 / cg"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   560
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   561
a27a279701f8 Initial revision
claus
parents:
diff changeset
   562
instVarAt:index put:anObject
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
   563
    "catch instvar access - asciivalue may not be changed"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   564
a27a279701f8 Initial revision
claus
parents:
diff changeset
   565
    self error:'Characters may not be modified'
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   566
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   567
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   568
newDigitValue
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   569
	"Answer 0-9 if the receiver is $0-$9, 10-35 if it is $A-$Z, and < 0 otherwise. 
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   570
	This is used to parse literal numbers of radix 2-36."
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   571
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   572
	| code value |
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   573
	code := self asInteger.
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   574
	value := code - $0 asInteger.
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   575
	value <= 9 ifTrue: [^value].
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   576
	value := code - $A asInteger.
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   577
	(value >= 0 and: [value < 26])
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   578
		ifTrue: [^value + 10].
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   579
	^-1
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   580
! !
a27a279701f8 Initial revision
claus
parents:
diff changeset
   581
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   582
!Character methodsFor:'arithmetic'!
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   583
a27a279701f8 Initial revision
claus
parents:
diff changeset
   584
+ aMagnitude
a27a279701f8 Initial revision
claus
parents:
diff changeset
   585
    "Return the Character that is <aMagnitude> higher than the receiver. 
a27a279701f8 Initial revision
claus
parents:
diff changeset
   586
     Wrap if the resulting value is not a legal Character value. (JS)"
a27a279701f8 Initial revision
claus
parents:
diff changeset
   587
6531
8798c9a1d1e6 arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents: 6527
diff changeset
   588
    ^ Character value:(asciivalue + aMagnitude asInteger \\ 16rFFFF)
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   589
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   590
    "
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   591
     $A + 5
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   592
    "
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   593
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   594
    "Modified: 27.6.1996 / 12:34:51 / cg"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   595
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   596
a27a279701f8 Initial revision
claus
parents:
diff changeset
   597
- aMagnitude
a27a279701f8 Initial revision
claus
parents:
diff changeset
   598
    "Return the Character that is <aMagnitude> lower than the receiver.  
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   599
     Wrap if the resulting value is not a legal Character value. (JS)
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   600
     claus: 
6531
8798c9a1d1e6 arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents: 6527
diff changeset
   601
        modified to return the difference as integer, if the argument
8798c9a1d1e6 arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents: 6527
diff changeset
   602
        is another character. If the argument is a number, a character is
8798c9a1d1e6 arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents: 6527
diff changeset
   603
        returned."
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   604
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   605
    aMagnitude isCharacter ifTrue:[
6531
8798c9a1d1e6 arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents: 6527
diff changeset
   606
        ^ self asciiValue - aMagnitude asciiValue
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   607
    ].
6531
8798c9a1d1e6 arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents: 6527
diff changeset
   608
    ^ Character value:(asciivalue - aMagnitude asInteger \\ 16rFFFF)
68
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   609
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   610
    "
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   611
     $z - $a  
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   612
     $d - 3
59faa75185ba *** empty log message ***
claus
parents: 54
diff changeset
   613
    "
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   614
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   615
    "Modified: 27.6.1996 / 12:35:34 / cg"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   616
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   617
a27a279701f8 Initial revision
claus
parents:
diff changeset
   618
// aMagnitude
a27a279701f8 Initial revision
claus
parents:
diff changeset
   619
    "Return the Character who's value is the receiver divided by <aMagnitude>. 
a27a279701f8 Initial revision
claus
parents:
diff changeset
   620
     Wrap if the resulting value is not a legal Character value. (JS)"
a27a279701f8 Initial revision
claus
parents:
diff changeset
   621
6531
8798c9a1d1e6 arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents: 6527
diff changeset
   622
    ^ Character value:(asciivalue // aMagnitude asInteger \\ 16rFFFF)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   623
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
   624
a27a279701f8 Initial revision
claus
parents:
diff changeset
   625
\\ aMagnitude
a27a279701f8 Initial revision
claus
parents:
diff changeset
   626
    "Return the Character who's value is the receiver modulo <aMagnitude>.  
a27a279701f8 Initial revision
claus
parents:
diff changeset
   627
     Wrap if the resulting value is not a legal Character value. (JS)"
a27a279701f8 Initial revision
claus
parents:
diff changeset
   628
6531
8798c9a1d1e6 arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents: 6527
diff changeset
   629
    ^ Character value:(asciivalue \\ aMagnitude asInteger \\ 16rFFFF)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
   630
! !
a27a279701f8 Initial revision
claus
parents:
diff changeset
   631
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   632
!Character methodsFor:'binary storage'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   633
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   634
hasSpecialBinaryRepresentation
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   635
    "return true, if the receiver has a special binary representation"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   636
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   637
    ^ true
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   638
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   639
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   640
storeBinaryOn:stream manager:manager
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   641
    "store a binary representation of the receiver on stream;
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   642
     redefined, since single-byte characters are stored more compact
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   643
     with a special type-code followed by the asciiValue."
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   644
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   645
    (asciivalue < 256) ifTrue:[
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   646
	stream nextPut:manager codeForCharacter; nextPut:asciivalue.
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   647
    ] ifFalse:[
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   648
	stream nextPut:manager codeForTwoByteCharacter.
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   649
	stream nextPutShort:asciivalue MSB:true
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   650
    ]
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   651
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   652
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   653
!Character methodsFor:'comparing'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   654
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   655
< aCharacter
7799
99753997e677 comments
Claus Gittinger <cg@exept.de>
parents: 7715
diff changeset
   656
    "return true, if the arguments asciiValue is greater than the receiver's"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   657
5315
04052c5626bf allow compare against integers (asciiValue compare).
Claus Gittinger <cg@exept.de>
parents: 4728
diff changeset
   658
    ^ (asciivalue < aCharacter asInteger)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   659
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   660
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   661
<= aCharacter
7799
99753997e677 comments
Claus Gittinger <cg@exept.de>
parents: 7715
diff changeset
   662
    "return true, if the arguments asciiValue is greater or equal to the receiver's"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   663
5315
04052c5626bf allow compare against integers (asciiValue compare).
Claus Gittinger <cg@exept.de>
parents: 4728
diff changeset
   664
    ^ (asciivalue <= aCharacter asInteger)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   665
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   666
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   667
= aCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   668
    "return true, if the argument, aCharacter is the same character
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   669
     Redefined to take care of 16bit characters."
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   670
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   671
    self == aCharacter ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   672
    aCharacter isCharacter ifFalse:[^ false].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   673
    ^ (asciivalue = aCharacter asciiValue)
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   674
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   675
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   676
> aCharacter
7799
99753997e677 comments
Claus Gittinger <cg@exept.de>
parents: 7715
diff changeset
   677
    "return true, if the arguments asciiValue is less than the receiver's"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   678
5315
04052c5626bf allow compare against integers (asciiValue compare).
Claus Gittinger <cg@exept.de>
parents: 4728
diff changeset
   679
    ^ (asciivalue > aCharacter asInteger)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   680
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   681
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   682
>= aCharacter
7799
99753997e677 comments
Claus Gittinger <cg@exept.de>
parents: 7715
diff changeset
   683
    "return true, if the arguments asciiValue is less or equal to the receiver's"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   684
5315
04052c5626bf allow compare against integers (asciiValue compare).
Claus Gittinger <cg@exept.de>
parents: 4728
diff changeset
   685
    ^ (asciivalue >= aCharacter asInteger)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   686
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   687
5540
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   688
hash
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   689
    "return an integer useful for hashing"
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   690
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   691
    ^ asciivalue
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   692
!
7e0811b2ded9 added #hash
Claus Gittinger <cg@exept.de>
parents: 5473
diff changeset
   693
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   694
identityHash
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   695
    "return an integer useful for hashing on identity"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   696
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   697
    asciivalue <= 255 ifTrue:[
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   698
	^ 4096 + asciivalue
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   699
    ].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   700
    ^ super identityHash
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   701
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   702
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   703
sameAs:aCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   704
    "return true, if the argument, aCharacter is the same character,
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   705
     ignoring case differences."
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   706
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   707
    self == aCharacter ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   708
    ^ self asLowercase = aCharacter asLowercase
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   709
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   710
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   711
~= aCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   712
    "return true, if the argument, aCharacter is not the same character
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   713
     Redefined to take care of 16bit characters."
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   714
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   715
    self == aCharacter ifTrue:[^ false].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   716
    aCharacter isCharacter ifFalse:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   717
    ^ (asciivalue ~~ aCharacter asciiValue)
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   718
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   719
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   720
!Character methodsFor:'converting'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   721
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   722
asCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   723
    "usually sent to integers, but redefined here to allow integers
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   724
     and characters to be used commonly without a need for a test."
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   725
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   726
    ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   727
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   728
    "
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   729
     32 asCharacter  
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   730
    "
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   731
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   732
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   733
asInteger
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   734
    "return an Integer with my ascii-value.
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   735
     OWST4.2 compatibility (sigh)"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   736
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   737
    ^ asciivalue
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   738
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   739
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   740
asLowercase
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   741
    "return a character with same letter as the receiver,
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   742
     but lowercase (the receiver if its lowercase or nonLetter)"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   743
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   744
    |code "{Class: SmallInteger }"|
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   745
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   746
"/ the old code:
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   747
"/    self isUppercase ifFalse:[^ self].
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   748
"/    ^ Character value:(asciivalue + 32)
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   749
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   750
    code := asciivalue.
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   751
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   752
    "/ ISO Latin-1
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   753
    ((code >= $A asciiValue) and:[code <= $Z asciiValue]) ifTrue:[
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   754
	^ Character value:(code + ($a asciiValue - $A asciiValue))
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   755
    ].
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   756
    code < 16r00C0 ifTrue:[^ self].
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   757
    code < 16r0100 ifTrue:[
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   758
	code >= 16r00DF ifTrue:[^ self].
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   759
	code == 16r00D7 ifTrue:[^ self].
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   760
	^ Character value:(code + 16r20)
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   761
    ].
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   762
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   763
    "/ mhmh - in which encoding is this character.
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   764
    "/ here, assume Unicode
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   765
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   766
    'Character [info]: Unicode support is under construction' infoPrintCR.
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   767
    ^ self
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   768
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   769
    "
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   770
     $A asLowercase 
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   771
     $1 asLowercase  
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   772
    "
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   773
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   774
    "Modified: 10.1.1997 / 15:35:44 / cg"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   775
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   776
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   777
asString
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   778
    "return a string of len 1 with myself as contents"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   779
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   780
%{  /* NOCONTEXT */
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   781
    char buffer[2];
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   782
    OBJ s;
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
   783
    unsigned val;
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   784
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
   785
    val = __intVal(_characterVal(self));
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
   786
    if (val <= 0xFF) {
5407
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
   787
        buffer[0] = (char) val;
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
   788
        buffer[1] = '\0';
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
   789
        s = __MKSTRING_L(buffer, 1 COMMA_SND);
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
   790
        if (s != nil) {
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
   791
            RETURN (s);
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
   792
        }
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   793
    }
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   794
%}.
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   795
    asciivalue > 16rFF ifTrue:[
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   796
        asciivalue > 16rFFFF ifTrue:[
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   797
            ^ (FourByteString new:1) at:1 put:self; yourself
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   798
        ].
5407
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
   799
        ^ (TwoByteString new:1) at:1 put:self; yourself
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
   800
    ].
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
   801
5407
d6729266a95b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5315
diff changeset
   802
    ^ (String new:1) at:1 put:self; yourself.
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   803
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   804
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   805
asSymbol
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   806
    "return a unique symbol which prints like I print"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   807
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   808
    ^ Symbol internCharacter:self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   809
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   810
6029
594664bbbc47 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5945
diff changeset
   811
asUnicodeString
594664bbbc47 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5945
diff changeset
   812
    "return a unicode string of len 1 with myself as contents"
594664bbbc47 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5945
diff changeset
   813
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   814
    asciivalue > 16rFFFF ifTrue:[
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   815
        ^ (Unicode32String new:1) at:1 put:self; yourself.
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   816
    ].
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
   817
    ^ (Unicode16String new:1) at:1 put:self; yourself.
6029
594664bbbc47 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5945
diff changeset
   818
!
594664bbbc47 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 5945
diff changeset
   819
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   820
asUppercase
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   821
    "return a character with same letter as the receiver,
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   822
     but uppercase (the receiver if its uppercase or nonLetter)"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   823
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   824
    |code "{Class: SmallInteger }"|
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   825
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   826
"/ the old code:
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   827
"/    self isLowercase ifFalse:[^ self].
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   828
"/    ^ Character value:(asciivalue - 32)
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   829
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   830
    code := asciivalue.
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   831
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   832
    "/ ISO Latin-1
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   833
    ((code >= $a asciiValue) and:[code <= $z asciiValue]) ifTrue:[
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   834
	^ Character value:(code + ($A asciiValue - $a asciiValue))
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   835
    ].
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   836
    code < 16r00E0 ifTrue:[^ self].
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   837
    code < 16r0100 ifTrue:[
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   838
	code == 16r00F7 ifTrue:[^ self]. "/ division
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   839
"/        code == 16r00FF ifTrue:[^ Character value:16r0178].  "/ y diaresis (no uppercase equivalent in ISO-latin 1
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   840
	code == 16r00FF ifTrue:[^ self]. 
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   841
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
   842
	^ Character value:(code - 16r20)
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   843
    ].
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   844
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   845
    "/ mhmh - in which encoding is this character.
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   846
    "/ here, assume Unicode
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   847
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   848
    'Character [info]: Unicode support is under construction' infoPrintCR.
1491
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   849
    ^ self
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   850
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   851
    "
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   852
     $A asLowercase 
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   853
     $1 asLowercase  
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   854
    "
a42ae3fbb756 fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents: 1295
diff changeset
   855
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
   856
    "Modified: 10.1.1997 / 15:35:51 / cg"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   857
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   858
3058
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   859
decodeAsLiteralArray
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   860
    "given a literalEncoding in the receiver,
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   861
     create & return the corresponding object.
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   862
     The inverse operation to #literalArrayEncoding."
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   863
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   864
    ^ self
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   865
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   866
    "Created: / 27.10.1997 / 14:40:16 / cg"
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   867
!
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   868
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   869
digitValue
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   870
    "return my digitValue for any base"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   871
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   872
    |ascii "{ Class: SmallInteger }" |
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   873
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   874
    ascii := asciivalue.
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   875
    (ascii between:($0 asciiValue) and:($9 asciiValue)) ifTrue:[
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   876
	^ ascii - $0 asciiValue
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   877
    ].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   878
    (ascii between:($a asciiValue) and:($z asciiValue)) ifTrue:[
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   879
	^ ascii + (10 - $a asciiValue)
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   880
    ]. 
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   881
    (ascii between:($A asciiValue) and:($Z asciiValue)) ifTrue:[
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   882
	^ ascii + (10 - $A asciiValue)
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   883
    ]. 
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   884
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   885
"remove error below for X3J20 conformance ... "
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   886
    self error:'bad character'.
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   887
" "
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   888
    ^ -1
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   889
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   890
3058
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   891
literalArrayEncoding
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   892
    "encode myself as an array literal, from which a copy of the receiver
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   893
     can be reconstructed with #decodeAsLiteralArray."
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   894
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   895
    ^ self
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   896
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   897
    "Created: / 27.10.1997 / 14:40:37 / cg"
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   898
!
187372cf3eb8 added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents: 2908
diff changeset
   899
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   900
to:aMagnitude
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   901
    "Return an Interval over the characters from the receiver to <aMagnitude>. 
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   902
     Wrap <aMagnitude> if it is not a legal Character value. (JS)"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   903
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   904
    ^ Interval from:self to:(aMagnitude \\ 256)
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   905
!
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   906
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   907
utf8Encoded
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   908
    "convert a character from ISO8859-1 to its UTF-8 encoding.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   909
     this returns a String"
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   910
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   911
    |s|
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   912
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   913
    asciivalue <= 16r7F ifTrue:[
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   914
        ^ self asString.
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   915
    ].
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   916
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   917
    s := (String new:6) writeStream.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   918
    self utf8EncodedOn:s.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   919
    ^ s contents
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   920
!
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   921
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   922
utf8EncodedOn:aStream
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   923
    "append my UTF-8 representation to the argument, aStream.
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   924
     Up to 31 bits can be encoded in up to 6 bytes. 
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   925
     However, currently, characters are limited to 16 bits."
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   926
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   927
    |b1 b2 b3 b4 b5 v|
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   928
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   929
    asciivalue <= 16r7F ifTrue:[
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   930
        aStream nextPut:self.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   931
        ^ self.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   932
    ].
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   933
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   934
    b1 := Character value:((asciivalue bitAnd:16r3F) bitOr:2r10000000).
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   935
    v := asciivalue bitShift:-6.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   936
    v <= 16r1F ifTrue:[
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   937
        aStream nextPut:(Character value:(v bitOr:2r11000000)).
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   938
        aStream nextPut:b1.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   939
        ^ self.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   940
    ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   941
    b2 := Character value:((v bitAnd:16r3F) bitOr:2r10000000).
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   942
    v := v bitShift:-6.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   943
    v <= 16r0F ifTrue:[
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   944
        aStream nextPut:(Character value:(v bitOr:2r11100000)).
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   945
        aStream nextPut:b2; nextPut:b1.
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   946
        ^ self.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   947
    ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   948
    b3 := Character value:((v bitAnd:16r3F) bitOr:2r10000000).
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   949
    v := v bitShift:-6.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   950
    v <= 16r07 ifTrue:[
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   951
        aStream nextPut:(Character value:(v bitOr:2r11110000)).
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   952
        aStream nextPut:b3; nextPut:b2; nextPut:b1.
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   953
        ^ self.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   954
    ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   955
    b4 := Character value:((v bitAnd:16r3F) bitOr:2r10000000).
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   956
    v := v bitShift:-6.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   957
    v <= 16r03 ifTrue:[
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   958
        aStream nextPut:(Character value:(v bitOr:2r11111000)).
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   959
        aStream nextPut:b4; nextPut:b3; nextPut:b2; nextPut:b1.
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   960
        ^ self.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   961
    ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   962
    b5 := Character value:((v bitAnd:16r3F) bitOr:2r10000000).
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   963
    v := v bitShift:-6.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   964
    v <= 16r01 ifTrue:[
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   965
        aStream nextPut:(Character value:(v bitOr:2r11111100)).
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
   966
        aStream nextPut:b5; nextPut:b4; nextPut:b3; nextPut:b2; nextPut:b1.
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   967
        ^ self.
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   968
    ].
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   969
    "/ cannot happen - we only support up to 16 bit characters
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   970
    self error:'ascii value > 31bit in utf8Encode'.
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   971
        
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   972
    "
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
   973
      $a utf8Encoded   
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
   974
      $ü utf8Encoded asByteArray  
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
   975
      (Character value:16r1fff) utf8Encoded asByteArray      
6808
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   976
      (Character value:16rffff) utf8Encoded asByteArray      
f45445c6dfdb utf8 encoding fixed;
Claus Gittinger <cg@exept.de>
parents: 6531
diff changeset
   977
      (Character value:16r800) utf8Encoded asByteArray      
5847
aab6e1f01f55 UTF-8 Stuff
Stefan Vogel <sv@exept.de>
parents: 5746
diff changeset
   978
    "
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   979
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   980
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   981
!Character methodsFor:'copying'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   982
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   983
copy
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   984
    "return a copy of myself
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   985
     reimplemented since characters are unique"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   986
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   987
     ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   988
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   989
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   990
copyUpToLast: char 
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   991
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   992
	^self
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   993
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   994
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   995
deepCopyUsing:aDictionary
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   996
    "return a deep copy of myself
4728
37eaa8241422 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 4682
diff changeset
   997
     reimplemented since characters are immutable"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   998
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
   999
     ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1000
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1001
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1002
shallowCopy
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1003
    "return a shallow copy of myself
4728
37eaa8241422 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 4682
diff changeset
  1004
     reimplemented since characters are immutable"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1005
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1006
     ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1007
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1008
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1009
simpleDeepCopy
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1010
    "return a deep copy of myself
4728
37eaa8241422 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 4682
diff changeset
  1011
     reimplemented since characters are immutable"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1012
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1013
     ^ self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1014
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1015
5471
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1016
!Character methodsFor:'encoding'!
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1017
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1018
encodeOn:anEncoder with:aParameter
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1019
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1020
    anEncoder encodeCharacter:self with:aParameter
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1021
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1022
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1023
!
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1024
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1025
rot13
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1026
     "Usenet: from `rotate alphabet 13 places'] 
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1027
      The simple Caesar-cypher encryption that replaces each English
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1028
      letter with the one 13 places forward or back along the alphabet, 
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1029
      so that 'The butler did it!!' becomes 'Gur ohgyre qvq vg!!'
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1030
      Most Usenet news reading and posting programs include a rot13 feature. 
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1031
      It is used to enclose the text in a sealed wrapper that the reader must choose
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1032
      to open -- e.g., for posting things that might offend some readers, or spoilers. 
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1033
      A major advantage of rot13 over rot(N) for other N is that it
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1034
      is self-inverse, so the same code can be used for encoding and decoding."
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1035
7715
0e69a830f5d8 use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents: 7689
diff changeset
  1036
    (((self >= $a) and:[ self < $n]) or:[(self >= $A) and:[ self < $N]])
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1037
        ifTrue: [ ^ Character value:(self asciiValue + 13) ].
7715
0e69a830f5d8 use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents: 7689
diff changeset
  1038
    (((self > $m) and:[ self <= $z]) or:[(self > $M) and:[ self <= $Z]])
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1039
        ifTrue: [ ^ Character value:(self asciiValue - 13) ].
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1040
    ^ self
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1041
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1042
    "
7715
0e69a830f5d8 use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents: 7689
diff changeset
  1043
     $h rot13      
0e69a830f5d8 use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents: 7689
diff changeset
  1044
     $h rot13 rot13           
0e69a830f5d8 use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents: 7689
diff changeset
  1045
     'The butler did it!!' rot13             -> 'Gur ohgyre qvq vg!!'
0e69a830f5d8 use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents: 7689
diff changeset
  1046
     'The butler did it!!' rot13 rot13       -> 'The butler did it!!'
6508
Claus Gittinger <cg@exept.de>
parents: 6398
diff changeset
  1047
    "
5471
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1048
! !
a57eeb01c5ab General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents: 5452
diff changeset
  1049
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1050
!Character methodsFor:'national testing'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1051
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1052
isNationalAlphaNumeric
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1053
    "return true, if the receiver is a letter or digit in the
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1054
     current language (Language variable)"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1055
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1056
    "stupid - should be configurable from a table ...
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1057
     ... good thing is, that iso8859 puts all national
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1058
	 characters above 16rC0"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1059
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1060
    self isLetterOrDigit ifTrue:[^ true].
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1061
    ^ self isNationalLetter
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1062
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1063
    "Modified: 21.1.1997 / 11:10:54 / cg"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1064
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1065
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1066
isNationalLetter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1067
    "return true, if the receiver is a letter in the
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1068
     current language (Language variable)"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1069
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1070
    "stupid - the code below is not unicode aware"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1071
    "stupid - should be configurable from a table ...
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1072
     ... good thing is, that iso8859 puts all national
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1073
         characters above 16rC0.
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1074
    "
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1075
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1076
    self isLetter ifTrue:[^ true].
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1077
    (asciivalue between:16rC0 and:16rD6) ifTrue:[^ true].
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1078
    (asciivalue between:16rD8 and:16rF6) ifTrue:[^ true].
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1079
    (asciivalue between:16rF8 and:16rFF) ifTrue:[^ true].
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1080
    ^ false.
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1081
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1082
    "Modified: 21.1.1997 / 11:13:02 / cg"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1083
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1084
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1085
!Character methodsFor:'printing & storing'!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1086
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1087
displayString
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1088
    "return a string used when the receiver is to be displayed
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1089
     in an inspector kind-of-thing"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1090
2908
b3be3629fcb1 vms stuff
Claus Gittinger <cg@exept.de>
parents: 2840
diff changeset
  1091
    ^ (self storeString , ' "16r' , (asciivalue printStringRadix:16) , '"')
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1092
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1093
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1094
isLiteral
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1095
    "return true, if the receiver can be used as a literal constant in ST syntax
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1096
     (i.e. can be used in constant arrays)"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1097
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1098
    ^ true
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1099
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1100
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1101
print
5746
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1102
    "print myself on stdout.
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1103
     This method does NOT (by purpose) use the stream classes and 
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1104
     will therefore work even in case of emergency (but only, if Stdout is nil)."
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1105
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1106
%{  /* NOCONTEXT */
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1107
5746
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1108
    if (@global(Stdout) == nil) {
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1109
        putchar(__intVal(__INST(asciivalue)));
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1110
        RETURN(self);
cf5e42cb72ef do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents: 5566
diff changeset
  1111
    }
5452
71fd110c347a allow print, printCR during early initialization
Claus Gittinger <cg@exept.de>
parents: 5433
diff changeset
  1112
%}.
71fd110c347a allow print, printCR during early initialization
Claus Gittinger <cg@exept.de>
parents: 5433
diff changeset
  1113
    super print
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1114
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1115
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1116
printOn:aStream
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1117
    "print myself on aStream"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1118
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1119
    aStream nextPut:self
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1120
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1121
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1122
printString
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1123
    "return a string to print me"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1124
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1125
    ^ self asString
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1126
!
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1127
2214
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1128
reconstructOn:aStream
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1129
    aStream nextPut:$$; nextPut:self
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1130
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1131
    "Created: 6.2.1996 / 09:46:12 / stefan"
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1132
    "Modified: 6.2.1996 / 10:57:35 / stefan"
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1133
!
2c73e6907cbc fixed #isNationalLetter
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
  1134
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1135
storeOn:aStream
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1136
    "store myself on aStream"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1137
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1138
    |special|
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1139
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1140
    (asciivalue between:33 and:127) ifFalse:[
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1141
	(self == Character space) ifTrue:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1142
	    special := 'space'
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1143
	] ifFalse:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1144
	    (self == Character cr) ifTrue:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1145
		special := 'cr'.
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1146
	    ] ifFalse:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1147
		(self == Character tab) ifTrue:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1148
		    special := 'tab'.
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1149
		] ifFalse:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1150
		    (self == Character esc) ifTrue:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1151
			special := 'esc'.
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1152
		    ]
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1153
		]
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1154
	    ]
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1155
	].
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1156
	special notNil ifTrue:[
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1157
	    aStream nextPutAll:'(Character '; nextPutAll:special; nextPut:$).
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1158
	    ^ self
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1159
	].
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1160
	aStream nextPutAll:'(Character value:16r'.
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1161
	asciivalue printOn:aStream base:16.
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1162
	aStream nextPut:$)
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1163
    ] ifTrue:[
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1164
	aStream nextPut:$$; nextPut:self
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1165
    ]
995
b018368b3a94 asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents: 819
diff changeset
  1166
3190
81ffb25d1d86 Use #printOn: instead of #printString
Stefan Vogel <sv@exept.de>
parents: 3072
diff changeset
  1167
    "Modified: / 23.2.1996 / 23:27:32 / cg"
81ffb25d1d86 Use #printOn: instead of #printString
Stefan Vogel <sv@exept.de>
parents: 3072
diff changeset
  1168
    "Modified: / 20.1.1998 / 14:10:46 / stefan"
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1169
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1170
7257
b9f0fb923c72 method category rename
Claus Gittinger <cg@exept.de>
parents: 7250
diff changeset
  1171
!Character methodsFor:'private-accessing'!
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1172
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1173
setAsciiValue:anInteger
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1174
    "very private - set the asciivalue / codePoint. 
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1175
     - use this only for characters with codes > 16rFF.
7897
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1176
     DANGER alert: 
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1177
        funny things happen, if this is applied to
08fd96f5c1d2 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 7799
diff changeset
  1178
        one of the shared characters with codePoints 0..255."
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1179
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1180
    asciivalue := anInteger
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1181
! !
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1182
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1183
!Character methodsFor:'testing'!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1184
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1185
isCharacter
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1186
    "return true,  if the receiver is some kind of character"
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1187
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1188
    ^ true
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1189
!
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1190
3667
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1191
isControlCharacter
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1192
    "return true if I am a control character (i.e. ascii value < 32)"
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1193
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1194
%{  /* NOCONTEXT */
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1195
#ifdef NON_ASCII       /* i.e. EBCDIC ;-) */
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1196
    not yet implemented - fails when compiled
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1197
#else
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1198
    REGISTER int val;
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1199
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1200
    val = __intVal(__INST(asciivalue));
3881
e8fd1f890897 fixed isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3667
diff changeset
  1201
    if (val < ' ') {
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1202
        RETURN ( true );
3667
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1203
    }
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1204
#endif
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1205
    RETURN (false);
3667
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1206
%}.
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1207
    ^ asciivalue < 16r20.
3667
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1208
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1209
    "
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1210
     (Character value:1) isControlCharacter
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1211
     $a isControlCharacter
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1212
    "
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1213
!
83b1cebac287 added #isControlCharacter
Claus Gittinger <cg@exept.de>
parents: 3190
diff changeset
  1214
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1215
isDigit
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1216
    "return true, if I am a digit (i.e. $0 .. $9)"
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1217
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1218
%{  /*NOCONTEXT */
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1219
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1220
    REGISTER int val;
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1221
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1222
    val = __intVal(__INST(asciivalue));
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1223
    if ((unsigned)(val - '0') <= ('9' - '0')) {
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1224
        RETURN ( true );
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1225
    }
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1226
    RETURN ( false );
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1227
%}.
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1228
    ^ asciivalue between:$0 asciiValue and:$9 asciiValue
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1229
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1230
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1231
isDigitRadix:r
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1232
    "return true, if I am a digit of a base r number"
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1233
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1234
    (asciivalue < $0 asciiValue) ifTrue:[^ false]. 
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1235
    (r > 10) ifTrue:[
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1236
        (asciivalue <= $9 asciiValue) ifTrue:[
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1237
            ^ true
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1238
        ].
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1239
        ((asciivalue - $a asciiValue) between:0 and:(r - 11)) ifTrue:[
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1240
            ^ true
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1241
        ].
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1242
        ^ (asciivalue - $A asciiValue) between:0 and:(r - 11)
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1243
    ].
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1244
    (asciivalue - $0 asciiValue) < r ifTrue:[^ true].
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1245
    ^ false
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1246
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1247
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1248
isEndOfLineCharacter
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1249
    "return true if I am a line delimitting character"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1250
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1251
%{  /* NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1252
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1253
    REGISTER int val;
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1254
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  1255
    val = __intVal(__INST(asciivalue));
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1256
    if ((val == '\n')
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1257
     || (val == '\r')
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1258
     || (val == '\f')) {
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1259
        RETURN ( true );
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1260
    }
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1261
    RETURN (false);
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1262
%}.
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1263
    ^ asciivalue == 16r0A
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1264
      or:[asciivalue == 16r0D
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1265
      or:[asciivalue == 16r0C]]
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1266
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1267
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1268
5473
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1269
isImmediate
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1270
    "return true if I am an immediate object
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1271
     i.e. I am represented in the pointer itself and
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1272
     no real object header/storage is used me.
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1273
     For VW compatibility, shared characters (i.e. in the range 0..255) 
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1274
     also return true here"
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1275
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1276
    ^ asciivalue <= 255
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1277
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1278
!
c48d8c45c740 isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents: 5471
diff changeset
  1279
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1280
isLetter
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1281
    "return true, if I am a letter 
760
7eeb5f12e7fe interest is written with one 'r' (shame on me)
Claus Gittinger <cg@exept.de>
parents: 700
diff changeset
  1282
     - use isNationalLetter, if you are interested in those."
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1283
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1284
%{  /*NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1285
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1286
    REGISTER int val;
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1287
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  1288
    val = __intVal(__INST(asciivalue));
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1289
    if ((unsigned)(val - 'a') <= ('z' - 'a')) {
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1290
        RETURN ( true );
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1291
    }
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1292
    if ((unsigned)(val - 'A') <= ('Z' - 'A')) {
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1293
        RETURN ( true );
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1294
    }
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1295
    RETURN ( false );
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1296
%}.
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1297
    ^ (asciivalue between:($a asciiValue) and:($z asciiValue))
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1298
      or:[(asciivalue between:($A asciiValue) and:($Z asciiValue))]
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1299
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1300
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
  1301
isLetterOrDigit
54
06dbdeeed4f9 *** empty log message ***
claus
parents: 37
diff changeset
  1302
    "return true, if I am a letter or a digit
760
7eeb5f12e7fe interest is written with one 'r' (shame on me)
Claus Gittinger <cg@exept.de>
parents: 700
diff changeset
  1303
     - use isNationalAlphaNumeric, if you are interested in those."
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1304
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1305
%{  /* NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1306
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1307
    REGISTER int val;
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1308
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  1309
    val = __intVal(__INST(asciivalue));
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1310
    if ((unsigned)(val - 'a') <= ('z' - 'a')) {
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1311
        RETURN ( true );
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1312
    }
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1313
    if ((unsigned)(val - 'A') <= ('Z' - 'A')) {
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1314
        RETURN ( true );
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1315
    }
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1316
    if ((unsigned)(val - '0') <= ('9' - '0')) {
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1317
        RETURN ( true );
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1318
    }
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1319
    RETURN ( false );
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1320
%}.
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1321
    ^ self isLetter or:[self isDigit]
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1322
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1323
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1324
isLowercase
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1325
    "return true, if I am a lower-case letter"
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1326
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1327
%{  /* NOCONTEXT */
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1328
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1329
    REGISTER int val;
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
  1330
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  1331
    val = __intVal(__INST(asciivalue));
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1332
    if ((unsigned)(val - 'a') <= ('z' - 'a')) {
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1333
        RETURN ( true );
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1334
    }
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1335
#ifndef OLD
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1336
    /* iso8859 puts national lower case characters at e0 .. ff */
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1337
    if ((val >= 0xE0) && (val <= 0xFF)) {
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1338
        RETURN(true);
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1339
    }
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1340
#endif
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1341
    RETURN ( false );
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1342
%}.
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1343
    ^ (asciivalue between:($a asciiValue) and:($z asciiValue))
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1344
      or:[asciivalue between:16rE0 and:16rFF]
154
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
  1345
!
d4236ec280a6 *** empty log message ***
claus
parents: 92
diff changeset
  1346
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1347
isPrintable
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1348
    "return true, if the receiver is a useful printable character
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1349
     (see fileBrowsers showFile:-method on how it can be used)"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1350
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1351
    (asciivalue between:32 and:127) ifTrue:[^ true].
6398
1a32afa7b397 ff isPrintable -> true
Claus Gittinger <cg@exept.de>
parents: 6327
diff changeset
  1352
    asciivalue == 12 ifTrue:[^ true].   "/ FF
1a32afa7b397 ff isPrintable -> true
Claus Gittinger <cg@exept.de>
parents: 6327
diff changeset
  1353
    asciivalue == 13 ifTrue:[^ true].   "/ CR
1a32afa7b397 ff isPrintable -> true
Claus Gittinger <cg@exept.de>
parents: 6327
diff changeset
  1354
    asciivalue == 9 ifTrue:[^ true].    "/ TAB
1a32afa7b397 ff isPrintable -> true
Claus Gittinger <cg@exept.de>
parents: 6327
diff changeset
  1355
    asciivalue == 10 ifTrue:[^ true].   "/ NL
2840
493b24293589 isPrintable
Claus Gittinger <cg@exept.de>
parents: 2672
diff changeset
  1356
493b24293589 isPrintable
Claus Gittinger <cg@exept.de>
parents: 2672
diff changeset
  1357
    (asciivalue between:16rA0 and:16rBF) ifTrue:[^ true]. "/ ISO8859
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1358
    ^ self isNationalLetter
2840
493b24293589 isPrintable
Claus Gittinger <cg@exept.de>
parents: 2672
diff changeset
  1359
493b24293589 isPrintable
Claus Gittinger <cg@exept.de>
parents: 2672
diff changeset
  1360
    "Modified: 7.8.1997 / 17:05:24 / cg"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1361
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1362
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1363
isSeparator
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1364
    "return true if I am a space, cr, tab, nl, or newPage"
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1365
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1366
%{  /* NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1367
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1368
    REGISTER int val;
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1369
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  1370
    val = __intVal(__INST(asciivalue));
328
claus
parents: 314
diff changeset
  1371
#ifndef NON_ASCII       /* i.e. EBCDIC ;-) */
claus
parents: 314
diff changeset
  1372
    if (val <= ' ')
claus
parents: 314
diff changeset
  1373
#endif
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1374
        if ((val == ' ')
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1375
         || (val == '\n') 
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1376
         || (val == '\t')
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1377
         || (val == '\r')
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1378
         || (val == '\f')) {
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1379
            RETURN ( true );
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1380
        }
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1381
    RETURN (false);
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1382
%}.                                  
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1383
    ^ (asciivalue == 16r20)
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1384
      or:[asciivalue == 16r0D
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1385
      or:[asciivalue == 16r0A
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1386
      or:[asciivalue == 16r09
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1387
      or:[asciivalue == 16r0C]]]]
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1388
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1389
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1390
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1391
isUppercase
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1392
    "return true, if I am an upper-case letter"
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1393
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1394
%{  /* NOCONTEXT */
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1395
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1396
    REGISTER int val;
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1397
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1007
diff changeset
  1398
    val = __intVal(__INST(asciivalue));
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1399
    if ((unsigned)(val - 'A') <= ('Z' - 'A')) {
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1400
        RETURN ( true );
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1401
    }
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1402
#ifndef OLD
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1403
    /* iso8859 puts national upper case characters at c0 .. df */
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1404
    if ((val >= 0xC0) && (val <= 0xDF)) {
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1405
        RETURN(true);
314
7581a5c57224 *** empty log message ***
claus
parents: 302
diff changeset
  1406
    }
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1407
#endif
6527
e6528da50c43 isXXX tuned
Claus Gittinger <cg@exept.de>
parents: 6526
diff changeset
  1408
    RETURN ( false );
5423
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1409
%}.
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1410
    ^ (asciivalue between:($A asciiValue) and:($Z asciiValue))
e33decc83182 non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents: 5407
diff changeset
  1411
      or:[asciivalue between:16rC0 and:16rDF]
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1412
!
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1413
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1414
isVowel
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1415
    "return true, if I am a vowel (lower- or uppercase)"
333
claus
parents: 328
diff changeset
  1416
6066
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  1417
    "/ I know the code is ugly;
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  1418
    "/ better code is:
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  1419
    "/     'aeiou' includes:self asLowercase
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  1420
    "/ or:
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  1421
    "/     'aeiouAEIOU' includes:self
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  1422
    "/
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  1423
    "/ until I have a smart compiler, I use the shorter (codewise):
ca508f7658dc comment
Claus Gittinger <cg@exept.de>
parents: 6029
diff changeset
  1424
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1425
    (self == $a) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1426
    (self == $e) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1427
    (self == $i) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1428
    (self == $o) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1429
    (self == $u) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1430
    (self == $A) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1431
    (self == $E) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1432
    (self == $I) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1433
    (self == $O) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1434
    (self == $U) ifTrue:[^ true].
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1435
    ^ false
1
a27a279701f8 Initial revision
claus
parents:
diff changeset
  1436
! !
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1437
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1438
!Character methodsFor:'tracing'!
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1439
4682
4158042a9c8c *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 4655
diff changeset
  1440
traceInto:aRequestor level:level from:referrer
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1441
    "double dispatch into tracer, passing my type implicitely in the selector"
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1442
4682
4158042a9c8c *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 4655
diff changeset
  1443
    ^ aRequestor traceCharacter:self level:level from:referrer
4655
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1444
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1445
! !
b9405ca0bb4e added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents: 4340
diff changeset
  1446
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 1491
diff changeset
  1447
!Character class methodsFor:'documentation'!
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1448
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1449
version
7951
062ac899852e care for 4-byte characters
Claus Gittinger <cg@exept.de>
parents: 7897
diff changeset
  1450
    ^ '$Header: /cvs/stx/stx/libbasic/Character.st,v 1.93 2004-02-13 13:41:23 cg Exp $'
699
12f456343eea checkin from browser
Claus Gittinger <cg@exept.de>
parents: 544
diff changeset
  1451
! !