author | Claus Gittinger <cg@exept.de> |
Mon, 23 Feb 2004 14:27:27 +0100 | |
changeset 7988 | cb1c920e67eb |
parent 7987 | eec82f92ad08 |
child 7989 | 7907420b2fab |
permissions | -rw-r--r-- |
1 | 1 |
" |
5 | 2 |
COPYRIGHT (c) 1988 by Claus Gittinger |
154 | 3 |
All Rights Reserved |
1 | 4 |
|
5 |
This software is furnished under a license and may be used |
|
6 |
only in accordance with the terms of that license and with the |
|
7 |
inclusion of the above copyright notice. This software may not |
|
8 |
be provided or otherwise made available to, or used by, any |
|
9 |
other person. No title to or ownership of the software is |
|
10 |
hereby transferred. |
|
11 |
" |
|
12 |
||
5407 | 13 |
"{ Package: 'stx:libbasic' }" |
14 |
||
1 | 15 |
Magnitude subclass:#Character |
995
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
16 |
instanceVariableNames:'asciivalue' |
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
17 |
classVariableNames:'CharacterTable' |
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
18 |
poolDictionaries:'' |
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
19 |
category:'Magnitude-General' |
1 | 20 |
! |
21 |
||
2124 | 22 |
!Character class methodsFor:'documentation'! |
54 | 23 |
|
88 | 24 |
copyright |
25 |
" |
|
26 |
COPYRIGHT (c) 1988 by Claus Gittinger |
|
154 | 27 |
All Rights Reserved |
88 | 28 |
|
29 |
This software is furnished under a license and may be used |
|
30 |
only in accordance with the terms of that license and with the |
|
31 |
inclusion of the above copyright notice. This software may not |
|
32 |
be provided or otherwise made available to, or used by, any |
|
33 |
other person. No title to or ownership of the software is |
|
34 |
hereby transferred. |
|
35 |
" |
|
36 |
! |
|
37 |
||
54 | 38 |
documentation |
39 |
" |
|
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
40 |
This class represents characters. |
7897 | 41 |
|
42 |
For now, unicode characters can be created and also put-into/taken from |
|
43 |
Unicode strings, but the attribute queries (such as isLetter, isDigit etc. are |
|
44 |
not yet unicode aware). |
|
45 |
||
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
46 |
Notice, that actual character objects are not used when characters |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
47 |
are stored in strings, symbols or twoByteStrings; these only store |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
48 |
a characters asciiValue for a more compact representation. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
49 |
The word 'asciiValue' is a historic leftover - actually, any integer |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
50 |
code is allowed (i.e. characters are not limited to 8bit). |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
51 |
|
1229 | 52 |
Single byte Characters are unique; |
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
53 |
i.e. for every asciiValue (0..255) there exists exactly one instance of |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
54 |
Character, which is shared (Character value:xxx checks for this, and returns |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
55 |
a reference to an existing instance). |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
56 |
Other characters (i.e. asciivalue > 255) are not shared; i.e. these |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
57 |
are created as required. |
1 | 58 |
|
1229 | 59 |
This means: you may compare characters using #== ONLY IFF you are certain, |
357 | 60 |
that the characters ranges is 0..255. Otherwise, you HAVE TO compare |
61 |
using #=. (if in doubt, always compare using #=). |
|
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
62 |
Sorry for this inconvenience, but it is (practically) impossible to keep |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
63 |
the possible maximum of 2^32 characters (Unicode) around, for that |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
64 |
convenience alone. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
65 |
|
357 | 66 |
|
68 | 67 |
Methods marked as (JS) come from the manchester Character goody |
68 |
(CharacterComparing) by Jan Steinman, which allow Characters to be used as |
|
1229 | 69 |
Interval elements (i.e. ($a to:$z) do:[...] ); |
70 |
They are not a big deal, but convenient add-ons. |
|
71 |
Some of these have been modified a bit. |
|
1 | 72 |
|
68 | 73 |
WARNING: characters are known by compiler and runtime system - |
7897 | 74 |
do not change the instance layout. |
357 | 75 |
|
76 |
Also, although you can create subclasses of Character, the compiler always |
|
77 |
creates instances of Character for literals ... |
|
814
d4d28ca7afcd
made the global CharacterTable a classVar of Character
Claus Gittinger <cg@exept.de>
parents:
760
diff
changeset
|
78 |
... and other classes are hard-wired to always return instances of characters |
357 | 79 |
in some cases (i.e. String>>at:, Symbol>>at: etc.). |
80 |
Therefore, it may not make sense to create a character-subclass. |
|
1295 | 81 |
|
82 |
[author:] |
|
7897 | 83 |
Claus Gittinger |
1295 | 84 |
|
85 |
[see also:] |
|
7897 | 86 |
String TwoByteString |
87 |
StringCollection |
|
54 | 88 |
" |
89 |
! ! |
|
1 | 90 |
|
2124 | 91 |
!Character class methodsFor:'instance creation'! |
1 | 92 |
|
93 |
basicNew |
|
94 |
"catch new - Characters cannot be created with new" |
|
95 |
||
96 |
^ self error:'Characters cannot be created with new' |
|
97 |
! |
|
98 |
||
699 | 99 |
digitValue:anInteger |
100 |
"return a character that corresponds to anInteger. |
|
101 |
0-9 map to $0-$9, 10-35 map to $A-$Z" |
|
102 |
||
103 |
|val "{ Class: SmallInteger }" | |
|
104 |
||
105 |
val := anInteger. |
|
106 |
(val between:0 and:9) ifTrue:[ |
|
107 |
^ Character value:(val + ($0 asciiValue)) |
|
108 |
]. |
|
109 |
(val between:10 and:35) ifTrue:[ |
|
110 |
^ Character value:(val + ($A asciiValue - 10)) |
|
111 |
]. |
|
112 |
^self error:'value not in range 0 to 35' |
|
113 |
! |
|
114 |
||
6808 | 115 |
utf8DecodeFrom:aStream |
116 |
"read and return a single unicode character from an UTF8 encoded stream" |
|
117 |
||
118 |
|fetchNext c1 c2 c3 c4 c5 ascii| |
|
119 |
||
120 |
c1 := aStream next. |
|
121 |
ascii := c1 asciiValue. |
|
122 |
ascii <= 16r7F ifTrue:[ |
|
123 |
"/ 0xxxxxxx - 7 bits |
|
124 |
^ c1. |
|
125 |
]. |
|
126 |
||
127 |
(ascii bitAnd:2r11000000) == 2r10000000 ifTrue:[ |
|
128 |
"/ out of sync (got an intermediate character) |
|
129 |
self error:'out of sync'. |
|
130 |
^ c1. |
|
131 |
]. |
|
132 |
||
133 |
fetchNext := [ |ch| |
|
134 |
||
135 |
ch := aStream next. |
|
136 |
(ch asciiValue bitAnd:2r11000000) == 2r10000000 ifFalse:[ |
|
137 |
"/ followup chars must have 2r10 in high bits |
|
138 |
self error:'bad encoding'. |
|
139 |
]. |
|
140 |
ch |
|
141 |
]. |
|
142 |
||
143 |
(ascii bitAnd:2r11100000) == 2r11000000 ifTrue:[ |
|
144 |
"/ 110xxxxx 10xxxxxx - 11 bits |
|
145 |
c2 := fetchNext value. |
|
146 |
ascii := c1 asciiValue bitAnd:16r1F. |
|
147 |
ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F). |
|
6811
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
148 |
ascii <= 16r7F ifTrue:[ |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
149 |
self error:'invalid encoding'. |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
150 |
]. |
6808 | 151 |
^ Character value:ascii |
152 |
]. |
|
153 |
(ascii bitAnd:2r11110000) == 2r11100000 ifTrue:[ |
|
154 |
"/ 1110xxxx 10xxxxxx 10xxxxxx - 16 bits |
|
155 |
c2 := fetchNext value. |
|
156 |
c3 := fetchNext value. |
|
157 |
ascii := c1 asciiValue bitAnd:16r0F. |
|
158 |
ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F). |
|
159 |
ascii := (ascii bitShift:6) bitOr:(c3 asciiValue bitAnd:16r3F). |
|
6811
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
160 |
ascii <= 16r7FF ifTrue:[ |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
161 |
self error:'invalid encoding'. |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
162 |
]. |
6808 | 163 |
^ Character value:ascii |
164 |
]. |
|
165 |
||
166 |
"/ notice: currently, characters can only have 16bit encoding; |
|
167 |
"/ therefore the following will raise a runtime exception, |
|
168 |
||
169 |
(ascii bitAnd:2r11111000) == 2r11110000 ifTrue:[ |
|
170 |
"/ 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx - 21 bits |
|
171 |
c2 := fetchNext value. |
|
172 |
c3 := fetchNext value. |
|
173 |
c4 := fetchNext value. |
|
174 |
ascii := c1 asciiValue bitAnd:16r07. |
|
175 |
ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F). |
|
176 |
ascii := (ascii bitShift:6) bitOr:(c3 asciiValue bitAnd:16r3F). |
|
177 |
ascii := (ascii bitShift:6) bitOr:(c4 asciiValue bitAnd:16r3F). |
|
6811
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
178 |
ascii <= 16rFFFF ifTrue:[ |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
179 |
self error:'invalid encoding'. |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
180 |
]. |
6808 | 181 |
^ Character value:ascii |
182 |
]. |
|
183 |
||
184 |
(ascii bitAnd:2r11111100) == 2r11111000 ifTrue:[ |
|
185 |
"/ 111110xx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx - 26 bits |
|
186 |
c2 := fetchNext value. |
|
187 |
c3 := fetchNext value. |
|
188 |
c4 := fetchNext value. |
|
189 |
c5 := fetchNext value. |
|
190 |
ascii := c1 asciiValue bitAnd:16r03. |
|
191 |
ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F). |
|
192 |
ascii := (ascii bitShift:6) bitOr:(c3 asciiValue bitAnd:16r3F). |
|
193 |
ascii := (ascii bitShift:6) bitOr:(c4 asciiValue bitAnd:16r3F). |
|
194 |
ascii := (ascii bitShift:6) bitOr:(c5 asciiValue bitAnd:16r3F). |
|
6811
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
195 |
ascii <= 16r1FFFFF ifTrue:[ |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
196 |
self error:'invalid encoding'. |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
197 |
]. |
6808 | 198 |
^ Character value:ascii |
199 |
]. |
|
200 |
||
201 |
(ascii bitAnd:2r11111110) == 2r11111100 ifTrue:[ |
|
202 |
"/ 1111110x ... 10xxxxxx - any number of bits |
|
203 |
ascii := c1 asciiValue bitAnd:16r01. |
|
204 |
||
205 |
c2 := aStream peek. |
|
206 |
[ (c2 asciiValue bitAnd:2r11000000) == 2r10000000 ] whileTrue:[ |
|
207 |
ascii := (ascii bitShift:6) bitOr:(c2 asciiValue bitAnd:16r3F). |
|
208 |
aStream next. |
|
209 |
]. |
|
6811
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
210 |
ascii <= 16r3FFFFFF ifTrue:[ |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
211 |
self error:'invalid encoding'. |
2fec3f732ac6
catch invalid utf8 encodings
Claus Gittinger <cg@exept.de>
parents:
6808
diff
changeset
|
212 |
]. |
6808 | 213 |
^ Character value:ascii |
214 |
]. |
|
215 |
||
216 |
self error:'bad encoding'. |
|
217 |
^ c1 |
|
218 |
||
219 |
" |
|
220 |
Character utf8DecodeFrom:'a' readStream |
|
221 |
Character utf8DecodeFrom:#[195 188] asString readStream |
|
222 |
" |
|
223 |
||
224 |
"test: |
|
225 |
||
226 |
|utf8Encoding original readBack| |
|
227 |
||
228 |
1 to:16rFFFF do:[:ascii | |
|
229 |
original := Character value:ascii. |
|
230 |
utf8Encoding := original asString utf8Encoded. |
|
231 |
readBack := Character utf8DecodeFrom:(utf8Encoding readStream). |
|
232 |
readBack asciiValue = ascii ifFalse:[ |
|
233 |
self halt |
|
234 |
] |
|
235 |
] |
|
236 |
" |
|
237 |
! |
|
238 |
||
1 | 239 |
value:anInteger |
240 |
"return a character with asciivalue anInteger" |
|
241 |
||
242 |
%{ /* NOCONTEXT */ |
|
243 |
||
54 | 244 |
INT ascii; |
1 | 245 |
|
249 | 246 |
if (__isSmallInteger(anInteger)) { |
7951 | 247 |
ascii = __intVal(anInteger); |
248 |
if ((ascii & ~0xFF) == 0 /* (ascii >= 0) && (ascii <= 255) */) |
|
249 |
RETURN ( __MKCHARACTER(ascii) ); |
|
1 | 250 |
} |
819 | 251 |
%}. |
1 | 252 |
(anInteger between:0 and:16rFF) ifTrue:[ |
7951 | 253 |
^ CharacterTable at:(anInteger + 1) |
1 | 254 |
]. |
7951 | 255 |
(anInteger between:16r100 and:16r3FFFFFFF) ifTrue:[ |
256 |
^ super basicNew setAsciiValue:anInteger |
|
1 | 257 |
]. |
357 | 258 |
" |
7951 | 259 |
a characters ascii-code must be 0..16r3FFFFFFF. |
260 |
(i.e. only characters with up-to 30 bits are allowed.) |
|
357 | 261 |
" |
1 | 262 |
self error:'invalid ascii code for character' |
699 | 263 |
! ! |
264 |
||
7300 | 265 |
!Character class methodsFor:'Compatibility-ANSI'! |
5945 | 266 |
|
267 |
codePoint:asciiValue |
|
268 |
^ self value:asciiValue |
|
269 |
! ! |
|
270 |
||
2214 | 271 |
!Character class methodsFor:'accessing untypeable characters'! |
272 |
||
273 |
endOfInput |
|
274 |
"Answer the Character representing ctrl-d ." |
|
275 |
||
276 |
^self value: 4 |
|
277 |
! |
|
278 |
||
279 |
leftParenthesis |
|
280 |
"Answer the Character representing a left parenthesis." |
|
281 |
||
282 |
^self value: 40 |
|
283 |
! |
|
284 |
||
285 |
period |
|
286 |
"Answer the Character representing a carriage period." |
|
287 |
||
288 |
^self value: 46 |
|
289 |
! |
|
290 |
||
291 |
poundSign |
|
292 |
"Answer the Character representing a pound sign." |
|
293 |
||
294 |
^self value: 35 |
|
295 |
! |
|
296 |
||
297 |
rightParenthesis |
|
298 |
"Answer the Character representing a right parenthesis." |
|
299 |
||
300 |
^self value: 41 |
|
301 |
! ! |
|
302 |
||
2124 | 303 |
!Character class methodsFor:'constants'! |
699 | 304 |
|
305 |
backspace |
|
306 |
"return the backspace character" |
|
307 |
||
308 |
^ Character value:8 |
|
309 |
! |
|
310 |
||
311 |
bell |
|
312 |
"return the bell character" |
|
313 |
||
314 |
^ Character value:7 |
|
315 |
! |
|
316 |
||
317 |
cr |
|
318 |
"return the lineEnd character |
|
319 |
- actually (in unix) this is a newline character" |
|
320 |
||
321 |
^ Character value:10 |
|
322 |
! |
|
323 |
||
324 |
del |
|
325 |
"return the delete character" |
|
326 |
||
327 |
^ Character value:16r7F |
|
328 |
! |
|
329 |
||
330 |
doubleQuote |
|
331 |
"return the double-quote character" |
|
332 |
||
333 |
^ Character value:34 |
|
334 |
! |
|
335 |
||
336 |
esc |
|
337 |
"return the escape character" |
|
338 |
||
339 |
^ Character value:27 |
|
340 |
! |
|
341 |
||
7688 | 342 |
euro |
7689 | 343 |
"The Euro currency sign (notice: not all fonts support it). |
344 |
Thats the ascii-character code; the Unicode encoding is 0x8364" |
|
7688 | 345 |
|
7976 | 346 |
^ Character value: 16r20AC |
7689 | 347 |
|
348 |
" |
|
7976 | 349 |
Transcript showCR:Character euro asString |
350 |
" |
|
351 |
" |
|
7689 | 352 |
0 to:255 do:[:i | |
353 |
Transcript |
|
354 |
show:'| '; show:((i printStringRadix:16) leftPaddedTo:2); |
|
355 |
show:' | '; show:(i printStringPaddedTo:3); |
|
356 |
show:' | '; show:(Character value:i); |
|
357 |
cr. |
|
358 |
] |
|
359 |
" |
|
7688 | 360 |
! |
361 |
||
699 | 362 |
excla |
363 |
"return the exclamation-mark character" |
|
364 |
^ $!! |
|
1 | 365 |
! |
366 |
||
699 | 367 |
ff |
368 |
"return the form-feed character" |
|
369 |
||
370 |
^ Character value:12 |
|
371 |
! |
|
372 |
||
373 |
lf |
|
374 |
"return the newline/linefeed character" |
|
375 |
||
376 |
^ Character value:10 |
|
377 |
! |
|
1 | 378 |
|
4340
523ef8410fad
added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4337
diff
changeset
|
379 |
linefeed |
523ef8410fad
added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4337
diff
changeset
|
380 |
"squeak compatibility: return the newline/linefeed character" |
523ef8410fad
added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4337
diff
changeset
|
381 |
|
523ef8410fad
added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4337
diff
changeset
|
382 |
^ Character value:10 |
523ef8410fad
added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4337
diff
changeset
|
383 |
! |
523ef8410fad
added #linefeed - squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4337
diff
changeset
|
384 |
|
699 | 385 |
maxValue |
386 |
"return the maximum asciiValue a character can have" |
|
387 |
||
388 |
^ 16rFFFF |
|
389 |
! |
|
390 |
||
391 |
newPage |
|
392 |
"return the form-feed character" |
|
393 |
||
394 |
^ Character value:12 |
|
395 |
! |
|
396 |
||
397 |
nl |
|
398 |
"return the newline character" |
|
68 | 399 |
|
699 | 400 |
^ Character value:10 |
401 |
! |
|
402 |
||
6324 | 403 |
null |
404 |
^ Character value:0 |
|
405 |
! |
|
406 |
||
699 | 407 |
quote |
408 |
"return the single-quote character" |
|
409 |
||
410 |
^ Character value:39 |
|
411 |
! |
|
412 |
||
413 |
return |
|
414 |
"return the return character. |
|
415 |
In ST/X, this is different from cr - for Unix reasons." |
|
416 |
||
417 |
^ Character value:13 |
|
418 |
! |
|
419 |
||
420 |
space |
|
421 |
"return the blank character" |
|
422 |
||
423 |
^ Character value:32 |
|
424 |
! |
|
425 |
||
426 |
tab |
|
427 |
"return the tabulator character" |
|
428 |
||
429 |
^ Character value:9 |
|
1 | 430 |
! ! |
431 |
||
2124 | 432 |
!Character class methodsFor:'primitive input'! |
1 | 433 |
|
434 |
fromUser |
|
357 | 435 |
"return a character from the keyboard (C's standard input stream) |
1 | 436 |
- this should only be used for emergency evaluators and the like." |
437 |
||
438 |
%{ /* NOCONTEXT */ |
|
439 |
int c; |
|
440 |
||
441 |
c = getchar(); |
|
442 |
if (c < 0) { |
|
5433 | 443 |
RETURN (nil); |
1 | 444 |
} |
1133 | 445 |
RETURN ( __MKCHARACTER(c & 0xFF) ); |
5433 | 446 |
%}. |
447 |
^ Stdin next |
|
1 | 448 |
! ! |
449 |
||
2124 | 450 |
!Character class methodsFor:'queries'! |
3 | 451 |
|
4337
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
452 |
allCharacters |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
453 |
"added for squeak compatibility: return a collection of all chars" |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
454 |
|
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
455 |
^ CharacterTable |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
456 |
|
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
457 |
" |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
458 |
Character allCharacters |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
459 |
" |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
460 |
! |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
461 |
|
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
462 |
hasSharedInstances |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
463 |
"return true if this class has shared instances, that is, instances |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
464 |
with the same value are identical. |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
465 |
False is returned here, only redefined in classes which have unified |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
466 |
instances (or should be treated so)." |
2672
dc3662188b2c
added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents:
2561
diff
changeset
|
467 |
|
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
468 |
^ true |
2672
dc3662188b2c
added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents:
2561
diff
changeset
|
469 |
|
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
470 |
|
2672
dc3662188b2c
added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents:
2561
diff
changeset
|
471 |
! |
dc3662188b2c
added #hasImmediateInstances for VW compatibility
Claus Gittinger <cg@exept.de>
parents:
2561
diff
changeset
|
472 |
|
3 | 473 |
isBuiltInClass |
1271 | 474 |
"return true if this class is known by the run-time-system. |
475 |
Here, true is returned for myself, false for subclasses." |
|
3 | 476 |
|
477 |
^ self == Character |
|
1271 | 478 |
|
479 |
"Modified: 23.4.1996 / 15:56:39 / cg" |
|
4337
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
480 |
! |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
481 |
|
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
482 |
separators |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
483 |
"added for squeak compatibility: return a collection of separator chars" |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
484 |
|
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
485 |
^ Array |
7019 | 486 |
with:Character space |
487 |
with:Character return |
|
488 |
"/ with:Character cr |
|
489 |
with:Character tab |
|
490 |
with:Character lf |
|
491 |
with:Character ff |
|
4337
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
492 |
|
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
493 |
" |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
494 |
Character separators |
07fad5b7af9b
added #allCharacters & #separators for Squeak compatibility
Claus Gittinger <cg@exept.de>
parents:
4037
diff
changeset
|
495 |
" |
3 | 496 |
! ! |
497 |
||
7261 | 498 |
!Character methodsFor:'Compatibility-ANSI'! |
5945 | 499 |
|
500 |
codePoint |
|
7897 | 501 |
"return the codePoint of myself. |
502 |
Traditionally, this was named 'asciiValue'; |
|
503 |
however, characters are not limited to 8bit characters." |
|
504 |
||
505 |
^ asciivalue |
|
5945 | 506 |
! ! |
5847 | 507 |
|
7261 | 508 |
!Character methodsFor:'Compatibility-Dolphin'! |
6324 | 509 |
|
7351
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
510 |
isAlphaNumeric |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
511 |
"return true, if I am a letter or a digit |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
512 |
Please use isLetterOrDigit for compatibility reasons (which is ANSI)." |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
513 |
|
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
514 |
^ self isLetterOrDigit |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
515 |
! |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
516 |
|
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
517 |
isAlphabetic |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
518 |
"return true, if I am a letter. |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
519 |
Please use isLetter for compatibility reasons (which is ANSI)." |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
520 |
|
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
521 |
^ self isLetter |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
522 |
! |
1f805a32d551
comments in isAlphaNumeric and isAlphabetic
Claus Gittinger <cg@exept.de>
parents:
7300
diff
changeset
|
523 |
|
6324 | 524 |
isControl |
7353 | 525 |
"return true if I am a control character (i.e. ascii value < 32)" |
526 |
||
527 |
^ self isControlCharacter |
|
6324 | 528 |
! |
529 |
||
530 |
isHexDigit |
|
7354 | 531 |
"return true if I am a valid hexadecimal digit" |
532 |
||
6324 | 533 |
^ '0123456789abcdefABCDEF' includes:self |
7354 | 534 |
|
535 |
" |
|
536 |
$a isHexDigit |
|
537 |
" |
|
6324 | 538 |
! |
539 |
||
540 |
isPunctuation |
|
7897 | 541 |
"the code below is not unicode aware" |
542 |
||
6327 | 543 |
^ (asciivalue between:16r21 and:16r40) |
544 |
or:[ (asciivalue between:16r5B and:16r60) |
|
545 |
or:[ (asciivalue between:123 and:126) |
|
546 |
or:[ (asciivalue between:161 and:191) |
|
547 |
or:[ (asciivalue == 215 ) |
|
548 |
or:[ (asciivalue == 247 ) ]]]]] |
|
6324 | 549 |
! ! |
550 |
||
1 | 551 |
!Character methodsFor:'accessing'! |
552 |
||
553 |
asciiValue |
|
54 | 554 |
"return the asciivalue of myself. |
7897 | 555 |
The name 'asciiValue' is a historic leftover: |
556 |
characters are not limited to 8bit characters. |
|
557 |
So the actual value returned is a codePoint (i.e. full potential for 31bit encoding). |
|
558 |
PP has removed this method with 4.1 and providing asInteger instead." |
|
1 | 559 |
|
7897 | 560 |
^ asciivalue |
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
561 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
562 |
"Modified: 27.6.1996 / 12:34:34 / cg" |
1 | 563 |
! |
564 |
||
565 |
instVarAt:index put:anObject |
|
54 | 566 |
"catch instvar access - asciivalue may not be changed" |
1 | 567 |
|
568 |
self error:'Characters may not be modified' |
|
2214 | 569 |
! |
570 |
||
571 |
newDigitValue |
|
572 |
"Answer 0-9 if the receiver is $0-$9, 10-35 if it is $A-$Z, and < 0 otherwise. |
|
573 |
This is used to parse literal numbers of radix 2-36." |
|
574 |
||
575 |
| code value | |
|
576 |
code := self asInteger. |
|
577 |
value := code - $0 asInteger. |
|
578 |
value <= 9 ifTrue: [^value]. |
|
579 |
value := code - $A asInteger. |
|
580 |
(value >= 0 and: [value < 26]) |
|
581 |
ifTrue: [^value + 10]. |
|
582 |
^-1 |
|
1 | 583 |
! ! |
584 |
||
699 | 585 |
!Character methodsFor:'arithmetic'! |
1 | 586 |
|
587 |
+ aMagnitude |
|
588 |
"Return the Character that is <aMagnitude> higher than the receiver. |
|
589 |
Wrap if the resulting value is not a legal Character value. (JS)" |
|
590 |
||
6531
8798c9a1d1e6
arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents:
6527
diff
changeset
|
591 |
^ Character value:(asciivalue + aMagnitude asInteger \\ 16rFFFF) |
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
592 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
593 |
" |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
594 |
$A + 5 |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
595 |
" |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
596 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
597 |
"Modified: 27.6.1996 / 12:34:51 / cg" |
1 | 598 |
! |
599 |
||
600 |
- aMagnitude |
|
601 |
"Return the Character that is <aMagnitude> lower than the receiver. |
|
68 | 602 |
Wrap if the resulting value is not a legal Character value. (JS) |
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
603 |
claus: |
6531
8798c9a1d1e6
arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents:
6527
diff
changeset
|
604 |
modified to return the difference as integer, if the argument |
8798c9a1d1e6
arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents:
6527
diff
changeset
|
605 |
is another character. If the argument is a number, a character is |
8798c9a1d1e6
arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents:
6527
diff
changeset
|
606 |
returned." |
1 | 607 |
|
68 | 608 |
aMagnitude isCharacter ifTrue:[ |
6531
8798c9a1d1e6
arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents:
6527
diff
changeset
|
609 |
^ self asciiValue - aMagnitude asciiValue |
68 | 610 |
]. |
6531
8798c9a1d1e6
arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents:
6527
diff
changeset
|
611 |
^ Character value:(asciivalue - aMagnitude asInteger \\ 16rFFFF) |
68 | 612 |
|
613 |
" |
|
614 |
$z - $a |
|
615 |
$d - 3 |
|
616 |
" |
|
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
617 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
618 |
"Modified: 27.6.1996 / 12:35:34 / cg" |
1 | 619 |
! |
620 |
||
621 |
// aMagnitude |
|
622 |
"Return the Character who's value is the receiver divided by <aMagnitude>. |
|
623 |
Wrap if the resulting value is not a legal Character value. (JS)" |
|
624 |
||
6531
8798c9a1d1e6
arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents:
6527
diff
changeset
|
625 |
^ Character value:(asciivalue // aMagnitude asInteger \\ 16rFFFF) |
1 | 626 |
! |
627 |
||
628 |
\\ aMagnitude |
|
629 |
"Return the Character who's value is the receiver modulo <aMagnitude>. |
|
630 |
Wrap if the resulting value is not a legal Character value. (JS)" |
|
631 |
||
6531
8798c9a1d1e6
arithmetic wraps at 16bit bounds (unicode)
Claus Gittinger <cg@exept.de>
parents:
6527
diff
changeset
|
632 |
^ Character value:(asciivalue \\ aMagnitude asInteger \\ 16rFFFF) |
1 | 633 |
! ! |
634 |
||
699 | 635 |
!Character methodsFor:'binary storage'! |
636 |
||
637 |
hasSpecialBinaryRepresentation |
|
638 |
"return true, if the receiver has a special binary representation" |
|
639 |
||
640 |
^ true |
|
641 |
! |
|
642 |
||
643 |
storeBinaryOn:stream manager:manager |
|
644 |
"store a binary representation of the receiver on stream; |
|
645 |
redefined, since single-byte characters are stored more compact |
|
646 |
with a special type-code followed by the asciiValue." |
|
647 |
||
648 |
(asciivalue < 256) ifTrue:[ |
|
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
649 |
stream nextPut:manager codeForCharacter; nextPut:asciivalue. |
699 | 650 |
] ifFalse:[ |
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
651 |
stream nextPut:manager codeForTwoByteCharacter. |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
652 |
stream nextPutShort:asciivalue MSB:true |
699 | 653 |
] |
654 |
! ! |
|
655 |
||
656 |
!Character methodsFor:'comparing'! |
|
657 |
||
658 |
< aCharacter |
|
7799 | 659 |
"return true, if the arguments asciiValue is greater than the receiver's" |
699 | 660 |
|
5315
04052c5626bf
allow compare against integers (asciiValue compare).
Claus Gittinger <cg@exept.de>
parents:
4728
diff
changeset
|
661 |
^ (asciivalue < aCharacter asInteger) |
699 | 662 |
! |
663 |
||
664 |
<= aCharacter |
|
7799 | 665 |
"return true, if the arguments asciiValue is greater or equal to the receiver's" |
699 | 666 |
|
5315
04052c5626bf
allow compare against integers (asciiValue compare).
Claus Gittinger <cg@exept.de>
parents:
4728
diff
changeset
|
667 |
^ (asciivalue <= aCharacter asInteger) |
699 | 668 |
! |
669 |
||
670 |
= aCharacter |
|
671 |
"return true, if the argument, aCharacter is the same character |
|
672 |
Redefined to take care of 16bit characters." |
|
673 |
||
674 |
self == aCharacter ifTrue:[^ true]. |
|
675 |
aCharacter isCharacter ifFalse:[^ false]. |
|
676 |
^ (asciivalue = aCharacter asciiValue) |
|
677 |
! |
|
678 |
||
679 |
> aCharacter |
|
7799 | 680 |
"return true, if the arguments asciiValue is less than the receiver's" |
699 | 681 |
|
5315
04052c5626bf
allow compare against integers (asciiValue compare).
Claus Gittinger <cg@exept.de>
parents:
4728
diff
changeset
|
682 |
^ (asciivalue > aCharacter asInteger) |
699 | 683 |
! |
684 |
||
685 |
>= aCharacter |
|
7799 | 686 |
"return true, if the arguments asciiValue is less or equal to the receiver's" |
699 | 687 |
|
5315
04052c5626bf
allow compare against integers (asciiValue compare).
Claus Gittinger <cg@exept.de>
parents:
4728
diff
changeset
|
688 |
^ (asciivalue >= aCharacter asInteger) |
699 | 689 |
! |
690 |
||
5540 | 691 |
hash |
692 |
"return an integer useful for hashing" |
|
693 |
||
694 |
^ asciivalue |
|
695 |
! |
|
696 |
||
699 | 697 |
identityHash |
698 |
"return an integer useful for hashing on identity" |
|
699 |
||
700 |
asciivalue <= 255 ifTrue:[ |
|
701 |
^ 4096 + asciivalue |
|
702 |
]. |
|
703 |
^ super identityHash |
|
704 |
! |
|
705 |
||
706 |
sameAs:aCharacter |
|
707 |
"return true, if the argument, aCharacter is the same character, |
|
708 |
ignoring case differences." |
|
709 |
||
710 |
self == aCharacter ifTrue:[^ true]. |
|
711 |
^ self asLowercase = aCharacter asLowercase |
|
712 |
! |
|
713 |
||
714 |
~= aCharacter |
|
715 |
"return true, if the argument, aCharacter is not the same character |
|
716 |
Redefined to take care of 16bit characters." |
|
717 |
||
718 |
self == aCharacter ifTrue:[^ false]. |
|
719 |
aCharacter isCharacter ifFalse:[^ true]. |
|
720 |
^ (asciivalue ~~ aCharacter asciiValue) |
|
721 |
! ! |
|
722 |
||
723 |
!Character methodsFor:'converting'! |
|
724 |
||
725 |
asCharacter |
|
726 |
"usually sent to integers, but redefined here to allow integers |
|
727 |
and characters to be used commonly without a need for a test." |
|
728 |
||
729 |
^ self |
|
730 |
||
731 |
" |
|
732 |
32 asCharacter |
|
733 |
" |
|
734 |
! |
|
735 |
||
736 |
asInteger |
|
737 |
"return an Integer with my ascii-value. |
|
738 |
OWST4.2 compatibility (sigh)" |
|
739 |
||
740 |
^ asciivalue |
|
741 |
! |
|
742 |
||
743 |
asLowercase |
|
744 |
"return a character with same letter as the receiver, |
|
745 |
but lowercase (the receiver if its lowercase or nonLetter)" |
|
746 |
||
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
747 |
|code "{Class: SmallInteger }"| |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
748 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
749 |
"/ the old code: |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
750 |
"/ self isUppercase ifFalse:[^ self]. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
751 |
"/ ^ Character value:(asciivalue + 32) |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
752 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
753 |
code := asciivalue. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
754 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
755 |
"/ ISO Latin-1 |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
756 |
((code >= $A asciiValue) and:[code <= $Z asciiValue]) ifTrue:[ |
7985 | 757 |
^ Character value:(code + ($a asciiValue - $A asciiValue)) |
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
758 |
]. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
759 |
code < 16r00C0 ifTrue:[^ self]. |
7985 | 760 |
code <= 16r00FF ifTrue:[ |
761 |
code >= 16r00DF ifTrue:[^ self]. |
|
762 |
code == 16r00D7 ifTrue:[^ self]. |
|
763 |
^ Character value:(code + 16r20) |
|
764 |
]. |
|
7987 | 765 |
code <= 16r021F ifTrue:[ |
7985 | 766 |
code even ifTrue:[ ^ Character value:(code + 1) ]. |
767 |
^ self. |
|
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
768 |
]. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
769 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
770 |
"/ mhmh - in which encoding is this character. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
771 |
"/ here, assume Unicode |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
772 |
|
2124 | 773 |
'Character [info]: Unicode support is under construction' infoPrintCR. |
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
774 |
^ self |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
775 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
776 |
" |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
777 |
$A asLowercase |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
778 |
$1 asLowercase |
7985 | 779 |
(Character value:16r10d) asLowercase |
780 |
(Character value:16r10d) asUppercase |
|
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
781 |
" |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
782 |
|
2124 | 783 |
"Modified: 10.1.1997 / 15:35:44 / cg" |
699 | 784 |
! |
785 |
||
786 |
asString |
|
787 |
"return a string of len 1 with myself as contents" |
|
788 |
||
789 |
%{ /* NOCONTEXT */ |
|
790 |
char buffer[2]; |
|
791 |
OBJ s; |
|
995
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
792 |
unsigned val; |
699 | 793 |
|
995
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
794 |
val = __intVal(_characterVal(self)); |
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
795 |
if (val <= 0xFF) { |
5407 | 796 |
buffer[0] = (char) val; |
797 |
buffer[1] = '\0'; |
|
798 |
s = __MKSTRING_L(buffer, 1 COMMA_SND); |
|
799 |
if (s != nil) { |
|
800 |
RETURN (s); |
|
801 |
} |
|
699 | 802 |
} |
803 |
%}. |
|
7951 | 804 |
asciivalue > 16rFF ifTrue:[ |
805 |
asciivalue > 16rFFFF ifTrue:[ |
|
806 |
^ (FourByteString new:1) at:1 put:self; yourself |
|
807 |
]. |
|
5407 | 808 |
^ (TwoByteString new:1) at:1 put:self; yourself |
995
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
809 |
]. |
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
810 |
|
5407 | 811 |
^ (String new:1) at:1 put:self; yourself. |
699 | 812 |
! |
813 |
||
814 |
asSymbol |
|
815 |
"return a unique symbol which prints like I print" |
|
816 |
||
817 |
^ Symbol internCharacter:self |
|
818 |
! |
|
819 |
||
6029 | 820 |
asUnicodeString |
821 |
"return a unicode string of len 1 with myself as contents" |
|
822 |
||
7951 | 823 |
asciivalue > 16rFFFF ifTrue:[ |
824 |
^ (Unicode32String new:1) at:1 put:self; yourself. |
|
825 |
]. |
|
826 |
^ (Unicode16String new:1) at:1 put:self; yourself. |
|
6029 | 827 |
! |
828 |
||
699 | 829 |
asUppercase |
830 |
"return a character with same letter as the receiver, |
|
831 |
but uppercase (the receiver if its uppercase or nonLetter)" |
|
832 |
||
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
833 |
|code "{Class: SmallInteger }"| |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
834 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
835 |
"/ the old code: |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
836 |
"/ self isLowercase ifFalse:[^ self]. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
837 |
"/ ^ Character value:(asciivalue - 32) |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
838 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
839 |
code := asciivalue. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
840 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
841 |
"/ ISO Latin-1 |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
842 |
((code >= $a asciiValue) and:[code <= $z asciiValue]) ifTrue:[ |
7985 | 843 |
^ Character value:(code + ($A asciiValue - $a asciiValue)) |
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
844 |
]. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
845 |
code < 16r00E0 ifTrue:[^ self]. |
7985 | 846 |
code <= 16r00FF ifTrue:[ |
847 |
code == 16r00F7 ifTrue:[^ self]. "/ division |
|
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
848 |
"/ code == 16r00FF ifTrue:[^ Character value:16r0178]. "/ y diaresis (no uppercase equivalent in ISO-latin 1 |
7985 | 849 |
code == 16r00FF ifTrue:[^ self]. |
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
850 |
|
7985 | 851 |
^ Character value:(code - 16r20) |
852 |
]. |
|
7987 | 853 |
code <= 16r021F ifTrue:[ |
7985 | 854 |
code even ifTrue:[ ^ self ]. |
855 |
^ Character value:(code - 1). |
|
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
856 |
]. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
857 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
858 |
"/ mhmh - in which encoding is this character. |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
859 |
"/ here, assume Unicode |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
860 |
|
2124 | 861 |
'Character [info]: Unicode support is under construction' infoPrintCR. |
1491
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
862 |
^ self |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
863 |
|
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
864 |
" |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
865 |
$A asLowercase |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
866 |
$1 asLowercase |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
867 |
" |
a42ae3fbb756
fixed asLowercase / asUppercase for national characters
Claus Gittinger <cg@exept.de>
parents:
1295
diff
changeset
|
868 |
|
2124 | 869 |
"Modified: 10.1.1997 / 15:35:51 / cg" |
699 | 870 |
! |
871 |
||
3058
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
872 |
decodeAsLiteralArray |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
873 |
"given a literalEncoding in the receiver, |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
874 |
create & return the corresponding object. |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
875 |
The inverse operation to #literalArrayEncoding." |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
876 |
|
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
877 |
^ self |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
878 |
|
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
879 |
"Created: / 27.10.1997 / 14:40:16 / cg" |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
880 |
! |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
881 |
|
699 | 882 |
digitValue |
883 |
"return my digitValue for any base" |
|
884 |
||
885 |
|ascii "{ Class: SmallInteger }" | |
|
886 |
||
887 |
ascii := asciivalue. |
|
888 |
(ascii between:($0 asciiValue) and:($9 asciiValue)) ifTrue:[ |
|
889 |
^ ascii - $0 asciiValue |
|
890 |
]. |
|
891 |
(ascii between:($a asciiValue) and:($z asciiValue)) ifTrue:[ |
|
892 |
^ ascii + (10 - $a asciiValue) |
|
893 |
]. |
|
894 |
(ascii between:($A asciiValue) and:($Z asciiValue)) ifTrue:[ |
|
895 |
^ ascii + (10 - $A asciiValue) |
|
896 |
]. |
|
897 |
||
898 |
"remove error below for X3J20 conformance ... " |
|
899 |
self error:'bad character'. |
|
900 |
" " |
|
901 |
^ -1 |
|
902 |
! |
|
903 |
||
3058
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
904 |
literalArrayEncoding |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
905 |
"encode myself as an array literal, from which a copy of the receiver |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
906 |
can be reconstructed with #decodeAsLiteralArray." |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
907 |
|
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
908 |
^ self |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
909 |
|
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
910 |
"Created: / 27.10.1997 / 14:40:37 / cg" |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
911 |
! |
187372cf3eb8
added #encodeAsLiteralArray & #decodeAsLiteralArray
Claus Gittinger <cg@exept.de>
parents:
2908
diff
changeset
|
912 |
|
699 | 913 |
to:aMagnitude |
914 |
"Return an Interval over the characters from the receiver to <aMagnitude>. |
|
915 |
Wrap <aMagnitude> if it is not a legal Character value. (JS)" |
|
916 |
||
917 |
^ Interval from:self to:(aMagnitude \\ 256) |
|
5847 | 918 |
! |
919 |
||
920 |
utf8Encoded |
|
921 |
"convert a character from ISO8859-1 to its UTF-8 encoding. |
|
922 |
this returns a String" |
|
923 |
||
924 |
|s| |
|
925 |
||
7897 | 926 |
asciivalue <= 16r7F ifTrue:[ |
927 |
^ self asString. |
|
928 |
]. |
|
929 |
||
5847 | 930 |
s := (String new:6) writeStream. |
931 |
self utf8EncodedOn:s. |
|
932 |
^ s contents |
|
933 |
! |
|
934 |
||
935 |
utf8EncodedOn:aStream |
|
6808 | 936 |
"append my UTF-8 representation to the argument, aStream. |
937 |
Up to 31 bits can be encoded in up to 6 bytes. |
|
938 |
However, currently, characters are limited to 16 bits." |
|
5847 | 939 |
|
940 |
|b1 b2 b3 b4 b5 v| |
|
941 |
||
942 |
asciivalue <= 16r7F ifTrue:[ |
|
943 |
aStream nextPut:self. |
|
944 |
^ self. |
|
945 |
]. |
|
946 |
||
6808 | 947 |
b1 := Character value:((asciivalue bitAnd:16r3F) bitOr:2r10000000). |
5847 | 948 |
v := asciivalue bitShift:-6. |
949 |
v <= 16r1F ifTrue:[ |
|
6808 | 950 |
aStream nextPut:(Character value:(v bitOr:2r11000000)). |
5847 | 951 |
aStream nextPut:b1. |
952 |
^ self. |
|
953 |
]. |
|
6808 | 954 |
b2 := Character value:((v bitAnd:16r3F) bitOr:2r10000000). |
5847 | 955 |
v := v bitShift:-6. |
956 |
v <= 16r0F ifTrue:[ |
|
6808 | 957 |
aStream nextPut:(Character value:(v bitOr:2r11100000)). |
7897 | 958 |
aStream nextPut:b2; nextPut:b1. |
5847 | 959 |
^ self. |
960 |
]. |
|
6808 | 961 |
b3 := Character value:((v bitAnd:16r3F) bitOr:2r10000000). |
5847 | 962 |
v := v bitShift:-6. |
963 |
v <= 16r07 ifTrue:[ |
|
6808 | 964 |
aStream nextPut:(Character value:(v bitOr:2r11110000)). |
7897 | 965 |
aStream nextPut:b3; nextPut:b2; nextPut:b1. |
5847 | 966 |
^ self. |
967 |
]. |
|
6808 | 968 |
b4 := Character value:((v bitAnd:16r3F) bitOr:2r10000000). |
5847 | 969 |
v := v bitShift:-6. |
970 |
v <= 16r03 ifTrue:[ |
|
6808 | 971 |
aStream nextPut:(Character value:(v bitOr:2r11111000)). |
7897 | 972 |
aStream nextPut:b4; nextPut:b3; nextPut:b2; nextPut:b1. |
5847 | 973 |
^ self. |
974 |
]. |
|
6808 | 975 |
b5 := Character value:((v bitAnd:16r3F) bitOr:2r10000000). |
5847 | 976 |
v := v bitShift:-6. |
977 |
v <= 16r01 ifTrue:[ |
|
6808 | 978 |
aStream nextPut:(Character value:(v bitOr:2r11111100)). |
7897 | 979 |
aStream nextPut:b5; nextPut:b4; nextPut:b3; nextPut:b2; nextPut:b1. |
5847 | 980 |
^ self. |
981 |
]. |
|
6808 | 982 |
"/ cannot happen - we only support up to 16 bit characters |
983 |
self error:'ascii value > 31bit in utf8Encode'. |
|
5847 | 984 |
|
985 |
" |
|
6527 | 986 |
$a utf8Encoded |
987 |
$ü utf8Encoded asByteArray |
|
988 |
(Character value:16r1fff) utf8Encoded asByteArray |
|
6808 | 989 |
(Character value:16rffff) utf8Encoded asByteArray |
990 |
(Character value:16r800) utf8Encoded asByteArray |
|
5847 | 991 |
" |
699 | 992 |
! ! |
993 |
||
994 |
!Character methodsFor:'copying'! |
|
995 |
||
996 |
copy |
|
997 |
"return a copy of myself |
|
998 |
reimplemented since characters are unique" |
|
999 |
||
1000 |
^ self |
|
1001 |
! |
|
1002 |
||
2214 | 1003 |
copyUpToLast: char |
1004 |
||
1005 |
^self |
|
1006 |
! |
|
1007 |
||
699 | 1008 |
deepCopyUsing:aDictionary |
1009 |
"return a deep copy of myself |
|
4728 | 1010 |
reimplemented since characters are immutable" |
699 | 1011 |
|
1012 |
^ self |
|
1013 |
! |
|
1014 |
||
1015 |
shallowCopy |
|
1016 |
"return a shallow copy of myself |
|
4728 | 1017 |
reimplemented since characters are immutable" |
699 | 1018 |
|
1019 |
^ self |
|
1020 |
! |
|
1021 |
||
1022 |
simpleDeepCopy |
|
1023 |
"return a deep copy of myself |
|
4728 | 1024 |
reimplemented since characters are immutable" |
699 | 1025 |
|
1026 |
^ self |
|
1027 |
! ! |
|
1028 |
||
5471
a57eeb01c5ab
General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents:
5452
diff
changeset
|
1029 |
!Character methodsFor:'encoding'! |
a57eeb01c5ab
General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents:
5452
diff
changeset
|
1030 |
|
a57eeb01c5ab
General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents:
5452
diff
changeset
|
1031 |
encodeOn:anEncoder with:aParameter |
a57eeb01c5ab
General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents:
5452
diff
changeset
|
1032 |
|
a57eeb01c5ab
General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents:
5452
diff
changeset
|
1033 |
anEncoder encodeCharacter:self with:aParameter |
a57eeb01c5ab
General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents:
5452
diff
changeset
|
1034 |
|
a57eeb01c5ab
General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents:
5452
diff
changeset
|
1035 |
|
6508 | 1036 |
! |
1037 |
||
1038 |
rot13 |
|
1039 |
"Usenet: from `rotate alphabet 13 places'] |
|
1040 |
The simple Caesar-cypher encryption that replaces each English |
|
1041 |
letter with the one 13 places forward or back along the alphabet, |
|
1042 |
so that 'The butler did it!!' becomes 'Gur ohgyre qvq vg!!' |
|
1043 |
Most Usenet news reading and posting programs include a rot13 feature. |
|
1044 |
It is used to enclose the text in a sealed wrapper that the reader must choose |
|
1045 |
to open -- e.g., for posting things that might offend some readers, or spoilers. |
|
1046 |
A major advantage of rot13 over rot(N) for other N is that it |
|
1047 |
is self-inverse, so the same code can be used for encoding and decoding." |
|
1048 |
||
7715
0e69a830f5d8
use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents:
7689
diff
changeset
|
1049 |
(((self >= $a) and:[ self < $n]) or:[(self >= $A) and:[ self < $N]]) |
6508 | 1050 |
ifTrue: [ ^ Character value:(self asciiValue + 13) ]. |
7715
0e69a830f5d8
use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents:
7689
diff
changeset
|
1051 |
(((self > $m) and:[ self <= $z]) or:[(self > $M) and:[ self <= $Z]]) |
6508 | 1052 |
ifTrue: [ ^ Character value:(self asciiValue - 13) ]. |
1053 |
^ self |
|
1054 |
||
1055 |
" |
|
7715
0e69a830f5d8
use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents:
7689
diff
changeset
|
1056 |
$h rot13 |
0e69a830f5d8
use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents:
7689
diff
changeset
|
1057 |
$h rot13 rot13 |
0e69a830f5d8
use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents:
7689
diff
changeset
|
1058 |
'The butler did it!!' rot13 -> 'Gur ohgyre qvq vg!!' |
0e69a830f5d8
use #and: - not #& you lazy bone, you
Claus Gittinger <cg@exept.de>
parents:
7689
diff
changeset
|
1059 |
'The butler did it!!' rot13 rot13 -> 'The butler did it!!' |
6508 | 1060 |
" |
5471
a57eeb01c5ab
General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents:
5452
diff
changeset
|
1061 |
! ! |
a57eeb01c5ab
General encoding method (#encodeOn:with:)
Stefan Vogel <sv@exept.de>
parents:
5452
diff
changeset
|
1062 |
|
699 | 1063 |
!Character methodsFor:'national testing'! |
1064 |
||
1065 |
isNationalAlphaNumeric |
|
2214 | 1066 |
"return true, if the receiver is a letter or digit in the |
699 | 1067 |
current language (Language variable)" |
1068 |
||
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1069 |
self isNationalLetter ifTrue:[^ true]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1070 |
^ self isDigit |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1071 |
! |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1072 |
|
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1073 |
isNationalDigit |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1074 |
"return true, if the receiver is a digit. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1075 |
This assumes unicode encoding. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1076 |
WARNING: this method is not complete." |
699 | 1077 |
|
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1078 |
|codePoint "{ Class SmallInteger }"| |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1079 |
|
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1080 |
codePoint := asciivalue. |
2214 | 1081 |
|
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1082 |
codePoint <= 16rFF ifTrue:[ "/ u00xx - unicode latin1 page |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1083 |
(codePoint between:($0 asciiValue) and:($9 asciiValue)) ifTrue:[^ true]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1084 |
^ false |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1085 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1086 |
^ false. |
699 | 1087 |
! |
1088 |
||
1089 |
isNationalLetter |
|
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1090 |
"return true, if the receiver is a letter. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1091 |
This assumes unicode encoding. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1092 |
WARNING: this method is not complete." |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1093 |
|
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1094 |
|codePoint "{ Class SmallInteger }"| |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1095 |
|
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1096 |
codePoint := asciivalue. |
699 | 1097 |
|
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1098 |
codePoint <= 16rFF ifTrue:[ "/ u00xx - unicode latin1 page |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1099 |
(codePoint between:($a asciiValue) and:($z asciiValue)) ifTrue:[^ true]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1100 |
(codePoint between:($A asciiValue) and:($Z asciiValue)) ifTrue:[^ true]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1101 |
codePoint < 16rC0 ifTrue:[^ false]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1102 |
codePoint == 16rD7 ifTrue:[^ false]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1103 |
codePoint == 16rF7 ifTrue:[^ false]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1104 |
^ true |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1105 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1106 |
codePoint <= 16r1FF ifTrue:[ "/ u0100..u017F - unicode european latin (Latin ext-A) page |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1107 |
^ true "/ u0180..u01FF - unicode european latin (Latin ext-B) page |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1108 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1109 |
codePoint <= 16r2FF ifTrue:[ "/ u02xx - unicode latin page |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1110 |
codePoint <= 16r2B8 ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1111 |
^ true |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1112 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1113 |
^ false |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1114 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1115 |
codePoint <= 16r3FF ifTrue:[ "/ u03xx - unicode accent & greek page |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1116 |
codePoint <= 16r385 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1117 |
codePoint == 16r387 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1118 |
^ true |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1119 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1120 |
codePoint <= 16r4FF ifTrue:[ "/ u04xx - unicode cyrillic |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1121 |
^ true |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1122 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1123 |
codePoint <= 16r5FF ifTrue:[ "/ u05xx - |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1124 |
codePoint <= 16r50F ifTrue:[ "/ u05xx - unicode cyrillic supplementary |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1125 |
^ true |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1126 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1127 |
codePoint <= 16r530 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1128 |
codePoint <= 16r556 ifTrue:[ "/ u05xx - unicode armenian |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1129 |
^ true |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1130 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1131 |
codePoint <= 16r550 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1132 |
codePoint <= 16r587 ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1133 |
^ true |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1134 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1135 |
codePoint <= 16r5CF ifTrue:[ ^ false ]."/ u05xx - unicode hebrew |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1136 |
codePoint <= 16r5F2 ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1137 |
^ true |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1138 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1139 |
^ false. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1140 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1141 |
codePoint <= 16r6FF ifTrue:[ "/ u06xx - arabic |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1142 |
codePoint <= 16r620 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1143 |
codePoint <= 16r64A ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1144 |
codePoint <= 16r66D ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1145 |
codePoint <= 16r6D3 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1146 |
codePoint == 16r6D5 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1147 |
codePoint == 16r6EE ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1148 |
codePoint == 16r6EF ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1149 |
^ false. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1150 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1151 |
codePoint <= 16r7FF ifTrue:[ "/ u06xx - syriac / thaana |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1152 |
codePoint < 16r710 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1153 |
codePoint < 16r74F ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1154 |
codePoint < 16r780 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1155 |
codePoint < 16r7B0 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1156 |
^ false. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1157 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1158 |
codePoint <= 16rDFF ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1159 |
"/ self halt:'unimplemented character range'. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1160 |
^ false. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1161 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1162 |
codePoint <= 16rEFF ifTrue:[ "/ u0Exx - thai / lao |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1163 |
codePoint < 16rE50 ifTrue:[ ^ true]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1164 |
codePoint < 16rE80 ifTrue:[ ^ false]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1165 |
codePoint < 16rED0 ifTrue:[ ^ true]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1166 |
^ false. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1167 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1168 |
"/ self halt:'unimplemented character range'. |
2214 | 1169 |
^ false. |
1170 |
||
1171 |
"Modified: 21.1.1997 / 11:13:02 / cg" |
|
699 | 1172 |
! ! |
1173 |
||
1174 |
!Character methodsFor:'printing & storing'! |
|
1175 |
||
1176 |
displayString |
|
1177 |
"return a string used when the receiver is to be displayed |
|
1178 |
in an inspector kind-of-thing" |
|
1179 |
||
2908 | 1180 |
^ (self storeString , ' "16r' , (asciivalue printStringRadix:16) , '"') |
699 | 1181 |
! |
1182 |
||
1183 |
isLiteral |
|
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1184 |
"return true, if the receiver can be used as a literal constant in ST syntax |
699 | 1185 |
(i.e. can be used in constant arrays)" |
1186 |
||
1187 |
^ true |
|
1188 |
! |
|
1189 |
||
1190 |
||
5746
cf5e42cb72ef
do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents:
5566
diff
changeset
|
1191 |
"print myself on stdout. |
cf5e42cb72ef
do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents:
5566
diff
changeset
|
1192 |
This method does NOT (by purpose) use the stream classes and |
cf5e42cb72ef
do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents:
5566
diff
changeset
|
1193 |
will therefore work even in case of emergency (but only, if Stdout is nil)." |
699 | 1194 |
|
1195 |
%{ /* NOCONTEXT */ |
|
1196 |
||
5746
cf5e42cb72ef
do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents:
5566
diff
changeset
|
1197 |
if (@global(Stdout) == nil) { |
cf5e42cb72ef
do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents:
5566
diff
changeset
|
1198 |
putchar(__intVal(__INST(asciivalue))); |
cf5e42cb72ef
do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents:
5566
diff
changeset
|
1199 |
RETURN(self); |
cf5e42cb72ef
do not overwrite the standard printing conventions
Claus Gittinger <cg@exept.de>
parents:
5566
diff
changeset
|
1200 |
} |
5452
71fd110c347a
allow print, printCR during early initialization
Claus Gittinger <cg@exept.de>
parents:
5433
diff
changeset
|
1201 |
%}. |
71fd110c347a
allow print, printCR during early initialization
Claus Gittinger <cg@exept.de>
parents:
5433
diff
changeset
|
1202 |
super print |
699 | 1203 |
! |
1204 |
||
1205 |
printOn:aStream |
|
1206 |
"print myself on aStream" |
|
1207 |
||
1208 |
aStream nextPut:self |
|
1209 |
! |
|
1210 |
||
1211 |
printString |
|
1212 |
"return a string to print me" |
|
1213 |
||
1214 |
^ self asString |
|
1215 |
! |
|
1216 |
||
2214 | 1217 |
reconstructOn:aStream |
1218 |
aStream nextPut:$$; nextPut:self |
|
1219 |
||
1220 |
"Created: 6.2.1996 / 09:46:12 / stefan" |
|
1221 |
"Modified: 6.2.1996 / 10:57:35 / stefan" |
|
1222 |
! |
|
1223 |
||
699 | 1224 |
storeOn:aStream |
1225 |
"store myself on aStream" |
|
1226 |
||
1227 |
|special| |
|
1228 |
||
1229 |
(asciivalue between:33 and:127) ifFalse:[ |
|
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1230 |
(self == Character space) ifTrue:[ |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1231 |
special := 'space' |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1232 |
] ifFalse:[ |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1233 |
(self == Character cr) ifTrue:[ |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1234 |
special := 'cr'. |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1235 |
] ifFalse:[ |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1236 |
(self == Character tab) ifTrue:[ |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1237 |
special := 'tab'. |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1238 |
] ifFalse:[ |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1239 |
(self == Character esc) ifTrue:[ |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1240 |
special := 'esc'. |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1241 |
] |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1242 |
] |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1243 |
] |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1244 |
]. |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1245 |
special notNil ifTrue:[ |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1246 |
aStream nextPutAll:'(Character '; nextPutAll:special; nextPut:$). |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1247 |
^ self |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1248 |
]. |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1249 |
aStream nextPutAll:'(Character value:16r'. |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1250 |
asciivalue printOn:aStream base:16. |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1251 |
aStream nextPut:$) |
699 | 1252 |
] ifTrue:[ |
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1253 |
aStream nextPut:$$; nextPut:self |
699 | 1254 |
] |
995
b018368b3a94
asString to 16-bit char should return a twoByteString
Claus Gittinger <cg@exept.de>
parents:
819
diff
changeset
|
1255 |
|
3190
81ffb25d1d86
Use #printOn: instead of #printString
Stefan Vogel <sv@exept.de>
parents:
3072
diff
changeset
|
1256 |
"Modified: / 23.2.1996 / 23:27:32 / cg" |
81ffb25d1d86
Use #printOn: instead of #printString
Stefan Vogel <sv@exept.de>
parents:
3072
diff
changeset
|
1257 |
"Modified: / 20.1.1998 / 14:10:46 / stefan" |
699 | 1258 |
! ! |
1259 |
||
7257 | 1260 |
!Character methodsFor:'private-accessing'! |
699 | 1261 |
|
1262 |
setAsciiValue:anInteger |
|
7897 | 1263 |
"very private - set the asciivalue / codePoint. |
699 | 1264 |
- use this only for characters with codes > 16rFF. |
7897 | 1265 |
DANGER alert: |
1266 |
funny things happen, if this is applied to |
|
1267 |
one of the shared characters with codePoints 0..255." |
|
699 | 1268 |
|
1269 |
asciivalue := anInteger |
|
1270 |
! ! |
|
1271 |
||
1 | 1272 |
!Character methodsFor:'testing'! |
1273 |
||
54 | 1274 |
isCharacter |
1275 |
"return true, if the receiver is some kind of character" |
|
1276 |
||
1277 |
^ true |
|
1278 |
! |
|
1279 |
||
3667 | 1280 |
isControlCharacter |
1281 |
"return true if I am a control character (i.e. ascii value < 32)" |
|
1282 |
||
1283 |
%{ /* NOCONTEXT */ |
|
1284 |
#ifdef NON_ASCII /* i.e. EBCDIC ;-) */ |
|
1285 |
not yet implemented - fails when compiled |
|
1286 |
#else |
|
1287 |
REGISTER int val; |
|
1288 |
||
1289 |
val = __intVal(__INST(asciivalue)); |
|
3881 | 1290 |
if (val < ' ') { |
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1291 |
RETURN ( true ); |
3667 | 1292 |
} |
1293 |
#endif |
|
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1294 |
RETURN (false); |
3667 | 1295 |
%}. |
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1296 |
^ asciivalue < 16r20. |
3667 | 1297 |
|
1298 |
" |
|
1299 |
(Character value:1) isControlCharacter |
|
1300 |
$a isControlCharacter |
|
1301 |
" |
|
1302 |
! |
|
1303 |
||
1 | 1304 |
isDigit |
1305 |
"return true, if I am a digit (i.e. $0 .. $9)" |
|
1306 |
||
7980 | 1307 |
%{ /* NOCONTEXT */ |
6527 | 1308 |
|
1309 |
REGISTER int val; |
|
1310 |
||
1311 |
val = __intVal(__INST(asciivalue)); |
|
1312 |
if ((unsigned)(val - '0') <= ('9' - '0')) { |
|
1313 |
RETURN ( true ); |
|
1314 |
} |
|
1315 |
RETURN ( false ); |
|
1316 |
%}. |
|
1317 |
^ asciivalue between:$0 asciiValue and:$9 asciiValue |
|
1 | 1318 |
! |
1319 |
||
1320 |
isDigitRadix:r |
|
1321 |
"return true, if I am a digit of a base r number" |
|
1322 |
||
1323 |
(asciivalue < $0 asciiValue) ifTrue:[^ false]. |
|
1324 |
(r > 10) ifTrue:[ |
|
6527 | 1325 |
(asciivalue <= $9 asciiValue) ifTrue:[ |
1326 |
^ true |
|
1327 |
]. |
|
1328 |
((asciivalue - $a asciiValue) between:0 and:(r - 11)) ifTrue:[ |
|
1329 |
^ true |
|
1330 |
]. |
|
1331 |
^ (asciivalue - $A asciiValue) between:0 and:(r - 11) |
|
1 | 1332 |
]. |
1333 |
(asciivalue - $0 asciiValue) < r ifTrue:[^ true]. |
|
1334 |
^ false |
|
1335 |
! |
|
1336 |
||
699 | 1337 |
isEndOfLineCharacter |
1338 |
"return true if I am a line delimitting character" |
|
1 | 1339 |
|
1340 |
%{ /* NOCONTEXT */ |
|
1341 |
||
1342 |
REGISTER int val; |
|
1343 |
||
1133 | 1344 |
val = __intVal(__INST(asciivalue)); |
699 | 1345 |
if ((val == '\n') |
1346 |
|| (val == '\r') |
|
1347 |
|| (val == '\f')) { |
|
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1348 |
RETURN ( true ); |
54 | 1349 |
} |
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1350 |
RETURN (false); |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1351 |
%}. |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1352 |
^ asciivalue == 16r0A |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1353 |
or:[asciivalue == 16r0D |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1354 |
or:[asciivalue == 16r0C]] |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1355 |
|
1 | 1356 |
! |
1357 |
||
5473
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1358 |
isImmediate |
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1359 |
"return true if I am an immediate object |
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1360 |
i.e. I am represented in the pointer itself and |
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1361 |
no real object header/storage is used me. |
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1362 |
For VW compatibility, shared characters (i.e. in the range 0..255) |
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1363 |
also return true here" |
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1364 |
|
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1365 |
^ asciivalue <= 255 |
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1366 |
|
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1367 |
! |
c48d8c45c740
isImmediate returns true for shared characters
Claus Gittinger <cg@exept.de>
parents:
5471
diff
changeset
|
1368 |
|
1 | 1369 |
isLetter |
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1370 |
"return true, if I am a letter in the 'a'..'z' range. |
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1371 |
Use isNationalLetter, if you are interested in those." |
1 | 1372 |
|
7980 | 1373 |
%{ /* NOCONTEXT */ |
1 | 1374 |
|
1375 |
REGISTER int val; |
|
1376 |
||
1133 | 1377 |
val = __intVal(__INST(asciivalue)); |
6527 | 1378 |
if ((unsigned)(val - 'a') <= ('z' - 'a')) { |
1379 |
RETURN ( true ); |
|
1380 |
} |
|
1381 |
if ((unsigned)(val - 'A') <= ('Z' - 'A')) { |
|
1382 |
RETURN ( true ); |
|
1383 |
} |
|
1384 |
RETURN ( false ); |
|
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1385 |
%}. |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1386 |
^ (asciivalue between:($a asciiValue) and:($z asciiValue)) |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1387 |
or:[(asciivalue between:($A asciiValue) and:($Z asciiValue))] |
1 | 1388 |
! |
1389 |
||
154 | 1390 |
isLetterOrDigit |
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1391 |
"return true, if I am a letter (a..z or A..Z) or a digit (0..9) |
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1392 |
Use isNationalAlphaNumeric, if you are interested in those." |
1 | 1393 |
|
1394 |
%{ /* NOCONTEXT */ |
|
1395 |
||
1396 |
REGISTER int val; |
|
1397 |
||
1133 | 1398 |
val = __intVal(__INST(asciivalue)); |
6527 | 1399 |
if ((unsigned)(val - 'a') <= ('z' - 'a')) { |
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1400 |
RETURN ( true ); |
1 | 1401 |
} |
6527 | 1402 |
if ((unsigned)(val - 'A') <= ('Z' - 'A')) { |
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1403 |
RETURN ( true ); |
1 | 1404 |
} |
6527 | 1405 |
if ((unsigned)(val - '0') <= ('9' - '0')) { |
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1406 |
RETURN ( true ); |
1 | 1407 |
} |
1408 |
RETURN ( false ); |
|
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1409 |
%}. |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1410 |
^ self isLetter or:[self isDigit] |
1 | 1411 |
! |
1412 |
||
699 | 1413 |
isLowercase |
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1414 |
"return true, if I am a lower-case letter. |
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1415 |
This one does care for national characters." |
699 | 1416 |
|
1417 |
%{ /* NOCONTEXT */ |
|
1418 |
||
1419 |
REGISTER int val; |
|
154 | 1420 |
|
1133 | 1421 |
val = __intVal(__INST(asciivalue)); |
6527 | 1422 |
if ((unsigned)(val - 'a') <= ('z' - 'a')) { |
1423 |
RETURN ( true ); |
|
1424 |
} |
|
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1425 |
|
699 | 1426 |
/* iso8859 puts national lower case characters at e0 .. ff */ |
1427 |
if ((val >= 0xE0) && (val <= 0xFF)) { |
|
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1428 |
if (val != 0xE7) { |
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1429 |
RETURN(true); |
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1430 |
} |
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1431 |
RETURN(true); |
699 | 1432 |
} |
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1433 |
|
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1434 |
if (val <= 0xFF) { |
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1435 |
RETURN (false); |
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1436 |
} |
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1437 |
%}. |
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1438 |
asciivalue <= 16r01FF ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1439 |
asciivalue <= 16r0137 ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1440 |
asciivalue <= 16r0137 ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1441 |
asciivalue == 16r0138 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1442 |
asciivalue <= 16r0148 ifTrue:[ ^ asciivalue even ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1443 |
asciivalue == 16r0149 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1444 |
asciivalue <= 16r0178 ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1445 |
asciivalue <= 16r017E ifTrue:[ ^ asciivalue even ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1446 |
asciivalue == 16r017F ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1447 |
asciivalue < 16r01CD ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1448 |
asciivalue == 16r01C6 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1449 |
asciivalue == 16r01C9 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1450 |
asciivalue == 16r01CC ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1451 |
^ false "WRONG !!!!!! " |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1452 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1453 |
asciivalue <= 16r01DC ifTrue:[ ^ asciivalue even ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1454 |
asciivalue == 16r01DD ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1455 |
asciivalue <= 16r01EF ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1456 |
asciivalue == 16r01F0 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1457 |
asciivalue == 16r01F1 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1458 |
asciivalue == 16r01F2 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1459 |
asciivalue == 16r01F3 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1460 |
asciivalue < 16r01F4 ifTrue:[ self halt. ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1461 |
asciivalue <= 16r01F6 ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1462 |
asciivalue == 16r01F7 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1463 |
asciivalue < 16r01F8 ifTrue:[ self halt. ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1464 |
asciivalue <= 16r01FF ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1465 |
^ false. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1466 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1467 |
asciivalue <= 16r02FF ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1468 |
asciivalue <= 16r0233 ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1469 |
asciivalue <= 16r0236 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1470 |
asciivalue < 16r0250 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1471 |
asciivalue < 16r02B0 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1472 |
^ false |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1473 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1474 |
asciivalue <= 16r03FF ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1475 |
asciivalue <= 16r03AB ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1476 |
asciivalue <= 16r03D1 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1477 |
asciivalue == 16r03D6 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1478 |
asciivalue <= 16r03D7 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1479 |
asciivalue <= 16r03EF ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1480 |
asciivalue <= 16r03F2 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1481 |
^ false |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1482 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1483 |
asciivalue <= 16r04FF ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1484 |
asciivalue <= 16r042F ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1485 |
asciivalue <= 16r045F ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1486 |
asciivalue <= 16r0481 ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1487 |
asciivalue < 16r048A ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1488 |
asciivalue < 16r04BF ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1489 |
asciivalue == 16r04C0 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1490 |
^ asciivalue odd |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1491 |
]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1492 |
asciivalue <= 16r05FF ifTrue:[ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1493 |
asciivalue <= 16r050F ifTrue:[ ^ asciivalue odd ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1494 |
asciivalue < 16r0561 ifTrue:[ ^ false ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1495 |
asciivalue <= 16r0587 ifTrue:[ ^ true ]. |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1496 |
^ false |
7985 | 1497 |
]. |
1498 |
||
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1499 |
"/ TODO: care for unicode lowercase characters. |
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1500 |
^ false |
154 | 1501 |
! |
1502 |
||
699 | 1503 |
isPrintable |
1504 |
"return true, if the receiver is a useful printable character |
|
1505 |
(see fileBrowsers showFile:-method on how it can be used)" |
|
1 | 1506 |
|
699 | 1507 |
(asciivalue between:32 and:127) ifTrue:[^ true]. |
6398 | 1508 |
asciivalue == 12 ifTrue:[^ true]. "/ FF |
1509 |
asciivalue == 13 ifTrue:[^ true]. "/ CR |
|
1510 |
asciivalue == 9 ifTrue:[^ true]. "/ TAB |
|
1511 |
asciivalue == 10 ifTrue:[^ true]. "/ NL |
|
2840 | 1512 |
|
1513 |
(asciivalue between:16rA0 and:16rBF) ifTrue:[^ true]. "/ ISO8859 |
|
699 | 1514 |
^ self isNationalLetter |
2840 | 1515 |
|
1516 |
"Modified: 7.8.1997 / 17:05:24 / cg" |
|
1 | 1517 |
! |
1518 |
||
1519 |
isSeparator |
|
1520 |
"return true if I am a space, cr, tab, nl, or newPage" |
|
1521 |
||
1522 |
%{ /* NOCONTEXT */ |
|
1523 |
||
1524 |
REGISTER int val; |
|
1525 |
||
1133 | 1526 |
val = __intVal(__INST(asciivalue)); |
328 | 1527 |
#ifndef NON_ASCII /* i.e. EBCDIC ;-) */ |
1528 |
if (val <= ' ') |
|
1529 |
#endif |
|
5423
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1530 |
if ((val == ' ') |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1531 |
|| (val == '\n') |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1532 |
|| (val == '\t') |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1533 |
|| (val == '\r') |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1534 |
|| (val == '\f')) { |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1535 |
RETURN ( true ); |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1536 |
} |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1537 |
RETURN (false); |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1538 |
%}. |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1539 |
^ (asciivalue == 16r20) |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1540 |
or:[asciivalue == 16r0D |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1541 |
or:[asciivalue == 16r0A |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1542 |
or:[asciivalue == 16r09 |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1543 |
or:[asciivalue == 16r0C]]]] |
e33decc83182
non-primitive fallBack code added
Claus Gittinger <cg@exept.de>
parents:
5407
diff
changeset
|
1544 |
|
1 | 1545 |
! |
1546 |
||
699 | 1547 |
isUppercase |
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1548 |
"return true, if I am an upper-case letter. |
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1549 |
This one does care for national characters." |
1 | 1550 |
|
1551 |
%{ /* NOCONTEXT */ |
|
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1552 |
#define TRUE_IF_ODD(x) ((x & 1) ? true : false) |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1553 |
#define TRUE_IF_EVEN(x) ((x & 1) ? false : true) |
1 | 1554 |
|
1555 |
REGISTER int val; |
|
1556 |
||
1133 | 1557 |
val = __intVal(__INST(asciivalue)); |
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1558 |
|
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1559 |
switch (val >> 8) { |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1560 |
case 0x00: |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1561 |
if ((unsigned)(val - 'A') <= ('Z' - 'A')) { |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1562 |
RETURN ( true ); |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1563 |
} |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1564 |
/* iso8859 puts national upper case characters at c0 .. df */ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1565 |
if ((val >= 0xC0) && (val <= 0xDF)) { |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1566 |
if (val != 0xD7) { |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1567 |
RETURN(true); |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1568 |
} |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1569 |
} |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1570 |
RETURN (false); |
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1571 |
|
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1572 |
case 0x01: |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1573 |
if (val <= 0x0137) { RETURN (TRUE_IF_EVEN(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1574 |
if (val <= 0x0148) { RETURN (TRUE_IF_ODD(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1575 |
if (val <= 0x0178) { RETURN (TRUE_IF_EVEN(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1576 |
if (val <= 0x017E) { RETURN (TRUE_IF_ODD(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1577 |
if (val < 0x01CD) { |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1578 |
if (val == 0x01C4) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1579 |
if (val == 0x01C5) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1580 |
if (val == 0x01C7) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1581 |
if (val == 0x01C8) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1582 |
if (val == 0x01CA) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1583 |
if (val == 0x01CB) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1584 |
RETURN (false); /* WRONG !!! */ |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1585 |
} |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1586 |
if (val <= 0x01DC) { RETURN (TRUE_IF_ODD(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1587 |
if (val <= 0x01EF) { RETURN (TRUE_IF_EVEN(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1588 |
if (val == 0x01F0) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1589 |
if (val == 0x01F1) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1590 |
if (val == 0x01F2) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1591 |
if (val == 0x01F3) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1592 |
if (val < 0x01F4) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1593 |
if (val <= 0x01F7) { RETURN (TRUE_IF_ODD(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1594 |
RETURN (TRUE_IF_EVEN(val)); |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1595 |
|
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1596 |
case 0x02: |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1597 |
if (val <= 0x0233) { RETURN (TRUE_IF_EVEN(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1598 |
RETURN (false); |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1599 |
|
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1600 |
case 0x03: |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1601 |
if (val < 0x0386) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1602 |
if (val == 0x0387) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1603 |
if (val <= 0x03AB) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1604 |
if (val <= 0x03D1) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1605 |
if (val <= 0x03D4) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1606 |
if (val <= 0x03D7) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1607 |
if (val <= 0x03EF) { RETURN (TRUE_IF_EVEN(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1608 |
RETURN (false); |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1609 |
|
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1610 |
case 0x04: |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1611 |
if (val <= 0x042F) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1612 |
if (val <= 0x045F) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1613 |
if (val <= 0x0481) { RETURN (TRUE_IF_EVEN(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1614 |
if (val < 0x048A) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1615 |
if (val == 0x04C0) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1616 |
RETURN (TRUE_IF_EVEN(val)); |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1617 |
|
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1618 |
case 0x05: |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1619 |
if (val <= 0x050F) { RETURN (TRUE_IF_EVEN(val)); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1620 |
if (val < 0x0531) { RETURN (false); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1621 |
if (val < 0x0556) { RETURN (true); } |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1622 |
RETURN (false); |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1623 |
|
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1624 |
case 0x06: |
7979
7515722ccfb1
isUppercase / isLowercase fix for division character.
Claus Gittinger <cg@exept.de>
parents:
7976
diff
changeset
|
1625 |
} |
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1626 |
RETURN (false); |
7985 | 1627 |
|
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1628 |
#undef TRUE_IF_ODD(x) |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1629 |
#undef TRUE_IF_EVEN(x) |
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1630 |
%} |
1 | 1631 |
! |
1632 |
||
699 | 1633 |
isVowel |
1634 |
"return true, if I am a vowel (lower- or uppercase)" |
|
333 | 1635 |
|
6066 | 1636 |
"/ I know the code is ugly; |
1637 |
"/ better code is: |
|
1638 |
"/ 'aeiou' includes:self asLowercase |
|
1639 |
"/ or: |
|
1640 |
"/ 'aeiouAEIOU' includes:self |
|
1641 |
"/ |
|
1642 |
"/ until I have a smart compiler, I use the shorter (codewise): |
|
1643 |
||
699 | 1644 |
(self == $a) ifTrue:[^ true]. |
1645 |
(self == $e) ifTrue:[^ true]. |
|
1646 |
(self == $i) ifTrue:[^ true]. |
|
1647 |
(self == $o) ifTrue:[^ true]. |
|
1648 |
(self == $u) ifTrue:[^ true]. |
|
1649 |
(self == $A) ifTrue:[^ true]. |
|
1650 |
(self == $E) ifTrue:[^ true]. |
|
1651 |
(self == $I) ifTrue:[^ true]. |
|
1652 |
(self == $O) ifTrue:[^ true]. |
|
1653 |
(self == $U) ifTrue:[^ true]. |
|
1654 |
^ false |
|
1 | 1655 |
! ! |
699 | 1656 |
|
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1657 |
!Character methodsFor:'tracing'! |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1658 |
|
4682 | 1659 |
traceInto:aRequestor level:level from:referrer |
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1660 |
"double dispatch into tracer, passing my type implicitely in the selector" |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1661 |
|
4682 | 1662 |
^ aRequestor traceCharacter:self level:level from:referrer |
4655
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1663 |
|
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1664 |
! ! |
b9405ca0bb4e
added #hasSharedInstances & tracing support
Claus Gittinger <cg@exept.de>
parents:
4340
diff
changeset
|
1665 |
|
2124 | 1666 |
!Character class methodsFor:'documentation'! |
699 | 1667 |
|
1668 |
version |
|
7988
cb1c920e67eb
isUppercase / isLowercase unicode changes
Claus Gittinger <cg@exept.de>
parents:
7987
diff
changeset
|
1669 |
^ '$Header: /cvs/stx/stx/libbasic/Character.st,v 1.99 2004-02-23 13:27:27 cg Exp $' |
699 | 1670 |
! ! |