CharacterArray.st
author Claus Gittinger <cg@exept.de>
Fri, 04 Jul 1997 11:23:39 +0200
changeset 2748 3002ca130a92
parent 2747 0a95a87bcefa
child 2749 62552ae3b004
permissions -rw-r--r--
checkin from browser
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
     1
"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
     2
 COPYRIGHT (c) 1994 by Claus Gittinger
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
     3
	      All Rights Reserved
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
     4
af7aeb79b25e Initial revision
claus
parents:
diff changeset
     5
 This software is furnished under a license and may be used
af7aeb79b25e Initial revision
claus
parents:
diff changeset
     6
 only in accordance with the terms of that license and with the
af7aeb79b25e Initial revision
claus
parents:
diff changeset
     7
 inclusion of the above copyright notice.   This software may not
af7aeb79b25e Initial revision
claus
parents:
diff changeset
     8
 be provided or otherwise made available to, or used by, any
af7aeb79b25e Initial revision
claus
parents:
diff changeset
     9
 other person.  No title to or ownership of the software is
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    10
 hereby transferred.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    11
"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    12
255
2b2c5c0facab *** empty log message ***
claus
parents: 247
diff changeset
    13
ByteArray subclass:#CharacterArray
735
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
    14
	instanceVariableNames:''
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    15
	classVariableNames:'PreviousMatch DecoderTables EncoderTables DecodingFailedSignal
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    16
		EncodingFailedSignal'
735
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
    17
	poolDictionaries:''
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
    18
	category:'Collections-Text'
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    19
!
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    20
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
    21
!CharacterArray class methodsFor:'documentation'!
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    22
82
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    23
copyright
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    24
"
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    25
 COPYRIGHT (c) 1994 by Claus Gittinger
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
    26
	      All Rights Reserved
82
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    27
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    28
 This software is furnished under a license and may be used
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    29
 only in accordance with the terms of that license and with the
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    30
 inclusion of the above copyright notice.   This software may not
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    31
 be provided or otherwise made available to, or used by, any
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    32
 other person.  No title to or ownership of the software is
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    33
 hereby transferred.
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    34
"
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    35
!
0147b4f725ae *** empty log message ***
claus
parents: 77
diff changeset
    36
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    37
documentation
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    38
"
255
2b2c5c0facab *** empty log message ***
claus
parents: 247
diff changeset
    39
    CharacterArray is a superclass for all kinds of Strings (i.e.
68
59faa75185ba *** empty log message ***
claus
parents: 64
diff changeset
    40
    (singleByte-)Strings, TwoByteStrings and whatever comes in the future.
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
    41
2537
9398b135cec4 comment
Claus Gittinger <cg@exept.de>
parents: 2520
diff changeset
    42
    This class is abstract, meaning that there are no instances of it. 
9398b135cec4 comment
Claus Gittinger <cg@exept.de>
parents: 2520
diff changeset
    43
    All this class does is provide common protocol for concrete subclasses.
1290
15ba3221b89b documentation
Claus Gittinger <cg@exept.de>
parents: 1272
diff changeset
    44
15ba3221b89b documentation
Claus Gittinger <cg@exept.de>
parents: 1272
diff changeset
    45
    [author:]
15ba3221b89b documentation
Claus Gittinger <cg@exept.de>
parents: 1272
diff changeset
    46
        Claus Gittinger
15ba3221b89b documentation
Claus Gittinger <cg@exept.de>
parents: 1272
diff changeset
    47
15ba3221b89b documentation
Claus Gittinger <cg@exept.de>
parents: 1272
diff changeset
    48
    [see also:]
15ba3221b89b documentation
Claus Gittinger <cg@exept.de>
parents: 1272
diff changeset
    49
        String TwoByteString
15ba3221b89b documentation
Claus Gittinger <cg@exept.de>
parents: 1272
diff changeset
    50
        StringCollection
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    51
"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    52
! !
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    53
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    54
!CharacterArray class methodsFor:'initialization'!
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    55
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    56
initialize
2730
1fbfddfa1d3f allow proceeding after a decoder error
Claus Gittinger <cg@exept.de>
parents: 2728
diff changeset
    57
    DecodingFailedSignal := Signal new mayProceed:true.
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    58
    DecodingFailedSignal notifierString:'decoding failure'.
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    59
    EncodingFailedSignal := QuerySignal new mayProceed:true.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    60
    EncodingFailedSignal notifierString:'encoding failure'.
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    61
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    62
    "
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    63
     CharacterArray initialize
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    64
    "
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    65
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    66
    "Modified: 30.6.1997 / 19:18:16 / cg"
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    67
! !
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
    68
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
    69
!CharacterArray class methodsFor:'instance creation'!
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    70
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    71
basicNew
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    72
    "return a new empty string"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    73
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    74
    ^ self basicNew:0
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    75
!
af7aeb79b25e Initial revision
claus
parents:
diff changeset
    76
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    77
fromBytes:aByteCollection
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    78
    "return an instance of the receiver class,
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    79
     taking untranslated bytes from the argument, aByteCollection.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    80
     Only useful, when reading twoByteStrings from external sources."
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    81
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    82
    |mySize nBytes newString dstIdx|
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    83
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    84
    nBytes := aByteCollection size.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    85
    mySize := self basicNew bitsPerCharacter.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    86
    mySize == 16 ifTrue:[
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    87
        newString := self basicNew:(nBytes // 2).
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    88
        dstIdx := 1.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    89
        aByteCollection pairWiseDo:[:hi :lo |
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    90
            newString at:dstIdx put:(Character value:(hi bitShift:8)+lo).
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    91
            dstIdx := dstIdx + 1
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    92
        ].
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    93
        ^ newString.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    94
    ].
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    95
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    96
    ^ (self basicNew:nBytes) replaceFrom:1 with:aByteCollection
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    97
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    98
    "
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
    99
     TwoByteString fromBytes:#[16r21 16r21]
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   100
    "
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   101
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   102
    "Modified: 30.6.1997 / 20:08:37 / cg"
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   103
!
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   104
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
   105
fromString:aString
af7aeb79b25e Initial revision
claus
parents:
diff changeset
   106
    "return a copy of the argument, aString"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
   107
68
59faa75185ba *** empty log message ***
claus
parents: 64
diff changeset
   108
    ^ (self basicNew:(aString size)) replaceFrom:1 with:aString
59faa75185ba *** empty log message ***
claus
parents: 64
diff changeset
   109
59faa75185ba *** empty log message ***
claus
parents: 64
diff changeset
   110
    "TwoByteString fromString:'hello'"
581
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   111
!
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   112
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   113
fromStringCollection:aCollectionOfStrings
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   114
    "return new string formed by concatenating a copy of the argument, aString"
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   115
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   116
    ^ self fromStringCollection:aCollectionOfStrings separatedBy:''
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   117
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   118
    "
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   119
     String fromStringCollection:#('hello' 'world' 'how' 'about' 'this')
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   120
    "
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   121
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   122
    "Created: 20.11.1995 / 15:26:59 / cg"
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   123
!
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   124
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   125
fromStringCollection:aCollectionOfStrings separatedBy:aSeparatorString
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   126
    "return new string formed by concatenating a copy of the argument, aString"
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   127
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   128
    |newString first|
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   129
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   130
    newString := ''.
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   131
    first := true.
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   132
    aCollectionOfStrings do:[:s | 
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   133
	first ifFalse:[
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   134
	    newString := newString , aSeparatorString
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   135
	] ifTrue:[
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   136
	    first := false
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   137
	].
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   138
	newString := newString , s
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   139
    ].
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   140
    ^ newString
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   141
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   142
    "
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   143
     String fromStringCollection:#('hello' 'world' 'how' 'about' 'this') separatedBy:' '
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   144
    "
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   145
8a991a4cb738 *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 528
diff changeset
   146
    "Created: 20.11.1995 / 15:32:17 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
   147
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
   148
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
   149
new
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
   150
    "return a new empty string"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
   151
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
   152
    ^ self basicNew:0
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
   153
! !
af7aeb79b25e Initial revision
claus
parents:
diff changeset
   154
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   155
!CharacterArray class methodsFor:'Signal constants'!
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   156
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   157
decodingFailedSignal
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   158
    "return the signal raise when decoding of a string is not possible
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   159
     due to invalid characters contained in the source.
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   160
     This may happen for example, if a non EUC coded 8-bit string
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   161
     is attempted to be decoded into a JIS string."
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   162
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   163
    ^ DecodingFailedSignal
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   164
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   165
    "Modified: 28.6.1997 / 20:09:35 / cg"
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   166
    "Created: 28.6.1997 / 20:09:55 / cg"
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   167
! !
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
   168
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   169
!CharacterArray class methodsFor:'cleanup'!
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   170
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   171
lowSpaceCleanup
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   172
    "cleanup in low-memory situations"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   173
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   174
    DecoderTables := EncoderTables := nil
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   175
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   176
    "
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   177
     CharacterArray lowSpaceCleanup
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   178
    "
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   179
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   180
    "Created: 22.2.1996 / 16:30:30 / cg"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   181
    "Modified: 22.2.1996 / 17:58:05 / cg"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   182
! !
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   183
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   184
!CharacterArray class methodsFor:'code tables'!
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   185
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   186
classForEncoding:encodingSymbol
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   187
    "return a class capable of representing text encoded
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   188
     by encodingSymbol"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   189
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   190
    encodingSymbol == #jis ifTrue:[^ JISEncodedString].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   191
    encodingSymbol == #big5 ifTrue:[^ BIG5EncodedString].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   192
    encodingSymbol == #gb ifTrue:[^ GBEncodedString].
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   193
    encodingSymbol == #ksc ifTrue:[^ KSCEncodedString].
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   194
    encodingSymbol == #unicode ifTrue:[^ UnicodeString].
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   195
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   196
    "/
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   197
    "/ assume its an 8-bit code
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   198
    "/
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   199
    ^ String
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   200
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   201
    "Modified: 30.6.1997 / 19:48:05 / cg"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   202
!
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   203
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   204
decoderTableFor:encoding
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   205
    "return a table to decode from an alien encoding into
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   206
     the internal (iso8859, ansi) format.
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   207
     This is an experimental interface - unfinished"
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   208
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   209
    |table unrepresentableCharacterCode x80Table|
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   210
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   211
    DecoderTables notNil ifTrue:[
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   212
        table := DecoderTables at:encoding ifAbsent:nil.
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   213
        table notNil ifTrue:[
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   214
            table == #identity ifTrue:[^ nil].
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   215
            ^ table
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   216
        ].
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   217
    ] ifFalse:[
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   218
        DecoderTables := IdentityDictionary new
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   219
    ].
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   220
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   221
    unrepresentableCharacterCode := 16rBF.
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   222
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   223
    "/
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   224
    "/ setup as identityTranslation
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   225
    "/
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   226
    table := (0 to:255) asByteArray.
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   227
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   228
    "/
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   229
    "/ hp -> iso8859
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   230
    "/
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   231
    "/ hp uses a funny encoding of the A1 .. FF characters
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   232
    "/
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   233
    "/   
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   234
    encoding == #hp ifTrue:[
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   235
        unrepresentableCharacterCode := 16rA0.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   236
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   237
        x80Table :=
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   238
            #(
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   239
               16r0080 16r0081 16r0082 16r0083 16r0084 16r0085 16r0086 16r0087  "/ 80
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   240
               16r0088 16r0089 16r008a 16r008b 16r008c 16r008d 16r008e 16r008f
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   241
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   242
               16r0090 16r0091 16r0092 16r0093 16r0094 16r0095 16r0096 16r0097  "/ 90
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   243
               16r0098 16r0099 16r009a 16r009b 16r009c 16r009d 16r009e 16r009f
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   244
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   245
               16r00a0 16r00c0 16r00c2 16r00c8 16r00c9 16r00cb 16r00ce 16r00cf  "/ a0
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   246
               16r00b4 16rFFFF 16rFFFF 16r00a8 16rFFFF 16r00d9 16r00da 16r00a3
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   247
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   248
               16rFFFF 16r00dd 16r00fd 16r00b0 16r00c7 16r00e7 16r00d1 16r00f1  "/ b0
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   249
               16r00a1 16r00bf 16r00a4 16r00a3 16r00a5 16r00a7 16rFFFF 16rFFFF
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   250
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   251
               16r00e2 16r00ea 16r00f4 16r00fb 16r00e1 16r00e9 16r00f3 16r00fa  "/ c0
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   252
               16r00e0 16r00e8 16r00f2 16r00f9 16r00e4 16r00eb 16r00f6 16r00fc
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   253
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   254
               16r00c5 16r00ee 16r00d8 16r00c6 16r00e5 16r00ed 16r00f8 16r00e6  "/ d0
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   255
               16r00c4 16r00ec 16r00d6 16r00dc 16r00c9 16r00ef 16r00df 16r00d4
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   256
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   257
               16r00c1 16r00c3 16r00e3 16r00d0 16r00f0 16r00cd 16r00cc 16r00d3  "/ e0
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   258
"/                                        S ?       s ?             Y ?
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   259
               16r00d2 16r00d5 16r00f5 16rFFFF 16rFFFF 16r00da 16rFFFF 16r00ff 
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   260
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   261
               16r00de 16r00fe 16r00b7 16r00b5 16r00b6 16r00be 16r00ad 16r00bc  "/ f0
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   262
               16r00bd 16r00aa 16r00ba 16r00ab 16rFFFF 16r00bb 16r00b1 16rFFFF
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   263
            ).
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   264
    ].
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   265
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   266
    "/
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   267
    "/ mac -> iso8859
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   268
    "/
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   269
    "/ You will loose the following characters:
1440
7a38f09abf92 no 8-bit chars in comments (7-bit lexes dont like that)
Claus Gittinger <cg@exept.de>
parents: 1435
diff changeset
   270
    "/ - all of them will be replaced by 8F
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   271
    "/
1272
b45ddd6d6ec3 assume printer can handle ISO-latin1 encoded strings (national characters)
Claus Gittinger <cg@exept.de>
parents: 1255
diff changeset
   272
    "/  A0 (dagger)
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   273
    "/  A5 (dot-dot)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   274
    "/  AA (tm - trademark)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   275
    "/  AD (not equal)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   276
    "/  B0 (infinity)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   277
    "/  B3 (less-double-equal <=)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   278
    "/  B4 (greater-equal >-)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   279
    "/  B6 (math lowercase delta)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   280
    "/  B7 (math sum)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   281
    "/  B8 (math uppercase pi)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   282
    "/  B9 (math lowercase pi)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   283
    "/  BA (math integral)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   284
    "/  BD (math omega)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   285
    "/  C3 (math union)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   286
    "/  C4 (ext latin f)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   287
    "/  C5 (math almost equal)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   288
    "/  C6 (math uppercase delta)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   289
    "/  C9 (dot-dot-dot)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   290
    "/  CE (OE ligature)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   291
    "/  CF (oe ligature)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   292
    "/  D1 (hyphen)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   293
    "/  D2 (opening top dquote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   294
    "/  D3 (closing top dquote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   295
    "/  D4 (opening top quote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   296
    "/  D5 (closing top quote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   297
    "/  D7 (geometric: )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   298
    "/  D9 (uppercase Y diacrit )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   299
    "/  DA (slash2 )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   300
    "/  DC (single oldStyle opening quote (<) )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   301
    "/  DD (single oldStyle closing quote (>) )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   302
    "/  DE (? )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   303
    "/  DF (? )
1272
b45ddd6d6ec3 assume printer can handle ISO-latin1 encoded strings (national characters)
Claus Gittinger <cg@exept.de>
parents: 1255
diff changeset
   304
    "/  E0 (double dagger )
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   305
    "/  E2 (opening bottom quote )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   306
    "/  E3 (opening bottom dquote )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   307
    "/  F0 (? )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   308
    "/  F5 (latin l )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   309
    "/  F6 (diacrit circumflex)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   310
    "/  F7 (diacrit tilde)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   311
    "/  F8 (diacrit top line)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   312
    "/  F9 (diacrit inverse circumflex)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   313
    "/  FA (diacrit dot)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   314
    "/  FB (diacrit ring)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   315
    "/  FC (diacrit cedille left)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   316
    "/  FD (diacrit dquote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   317
    "/  FE (diacrit cedille right)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   318
    "/  FF (diacrit circumflex2 ?)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   319
    "/   
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   320
    encoding == #mac ifTrue:[
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   321
        x80Table :=
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   322
            #( 16r00c4 16r00c5 16r00c7 16r00c9 16r00d1 16r00d6 16r00dc 16r00e1  "/ 80
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   323
               16r00e0 16r00e2 16r00e4 16r00e3 16r00e5 16r00e7 16r00e9 16r00e8
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   324
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   325
               16r00ea 16r00eb 16r00ed 16r00ec 16r00ee 16r00ef 16r00f1 16r00f3  "/ 90
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   326
               16r00f2 16r00f4 16r00f6 16r00f5 16r00fa 16r00f9 16r00fb 16r00fc
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   327
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   328
               16rFFFF 16r00b0 16r00a2 16r00a3 16r00a7 16rFFFF 16r00b6 16r00df  "/ a0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   329
               16r00ae 16r00a9 16rFFFF 16r00b4 16r00a8 16rFFFF 16r00c6 16r00d8
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   330
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   331
               16rFFFF 16r00b1 16rFFFF 16rFFFF 16r00a5 16r00b5 16rFFFF 16rFFFF  "/ b0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   332
               16rFFFF 16rFFFF 16rFFFF 16r00aa 16r00ba 16rFFFF 16r00e6 16r00f8
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   333
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   334
               16r00bf 16r00a1 16r00ac 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16r00ab  "/ c0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   335
               16r00bb 16rFFFF 16r00a0 16r00c1 16r00c3 16r00d5 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   336
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   337
               16r00ad 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16r00f7 16rFFFF  "/ d0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   338
               16rFFFF 16rFFFF 16rFFFF 16r00a4 16rFFFF 16rFFFF 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   339
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   340
               16rFFFF 16r00b7 16rFFFF 16rFFFF 16rFFFF 16r00c2 16r00ca 16r00c1  "/ e0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   341
               16r00cb 16r00c8 16r00cd 16r00ce 16r00cf 16r00cc 16r00d3 16r00d4
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   342
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   343
               16rFFFF 16r00d2 16r00da 16r00db 16r00d9 16rFFFF 16rFFFF 16rFFFF  "/ f0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   344
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   345
            ).
657
449935f15b9e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 652
diff changeset
   346
    ].
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   347
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   348
    "/
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   349
    "/ next -> iso8859
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   350
    "/
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   351
    "/ You will loose the following characters:
1440
7a38f09abf92 no 8-bit chars in comments (7-bit lexes dont like that)
Claus Gittinger <cg@exept.de>
parents: 1435
diff changeset
   352
    "/ - all of them will be replaced by 8F
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   353
    "/
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   354
    "/  A4 (diacrit /)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   355
    "/  A9 (diacrit quote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   356
    "/  AA (top opening dquote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   357
    "/  AC (single oldStyle opening quote (<))
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   358
    "/  AD (single oldStyle closing quote (>))
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   359
    "/  AE (?)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   360
    "/  AF (?)
1272
b45ddd6d6ec3 assume printer can handle ISO-latin1 encoded strings (national characters)
Claus Gittinger <cg@exept.de>
parents: 1255
diff changeset
   361
    "/  B2 (dagger)
b45ddd6d6ec3 assume printer can handle ISO-latin1 encoded strings (national characters)
Claus Gittinger <cg@exept.de>
parents: 1255
diff changeset
   362
    "/  B3 (double dagger)
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   363
    "/  B7 (enter dot)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   364
    "/  B8 (opening bottom quote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   365
    "/  B9 (opening bottom dquote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   366
    "/  BA (closing top dquote)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   367
    "/  BC (dot-dot-dot)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   368
    "/  BD (per-mille)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   369
    "/  C0 (subscript 1)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   370
    "/  C1 (accent grave)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   371
    "/  C2 (accent degu)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   372
    "/  C3 (accent circonflex)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   373
    "/  C4 (accent tilde)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   374
    "/  C5 (accent line)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   375
    "/  C6 (accent circonflex reverse)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   376
    "/  C7 (accent dot)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   377
    "/  C8 (accent dot-dot)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   378
    "/  D0 (hline)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   379
    "/  E8 (engl. pound upper case L)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   380
    "/  EA (OE ligature)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   381
    "/  F5 (latin l)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   382
    "/  F8 (engl. pound lower case l)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   383
    "/  FE (?)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   384
    "/  FF (?)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   385
    "/
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   386
    encoding == #next ifTrue:[
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   387
        unrepresentableCharacterCode := 16rBF.
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   388
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   389
        x80Table :=
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   390
            #( 16r0080 16r00c0 16r00c1 16r00c2 16r00c3 16r00c4 16r00c5 16r00c7  "/ 80
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   391
               16r00c8 16r00c9 16r00ca 16r00cb 16r00cc 16r00cd 16r00ce 16r00cf
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   392
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   393
               16r00d0 16r00d1 16r00d2 16r00d3 16r00d4 16r00d5 16r00d6 16r00d9  "/ 90
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   394
               16r00da 16r00db 16r00dc 16r00dd 16r00de 16r00b5 16r00d7 16r00f7
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   395
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   396
               16r00a9 16r00a1 16r00a2 16r00a3 16rFFFF 16r00a5 16rFFFF 16r00a7  "/ a0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   397
               16r00a4 16rFFFF 16rFFFF 16r00ab 16rFFFF 16rFFFF 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   398
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   399
               16r00ae 16r00ad 16rFFFF 16rFFFF 16r00b7 16r00a6 16r00b6 16rFFFF  "/ b0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   400
               16rFFFF 16rFFFF 16rFFFF 16r00bb 16rFFFF 16rFFFF 16r00ac 16r00bf
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   401
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   402
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF  "/ c0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   403
               16rFFFF 16r00b2 16r00b0 16r00b8 16r00b3 16rFFFF 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   404
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   405
               16rFFFF 16r00b1 16r00bc 16r00bd 16r00be 16r00e0 16r00e1 16r00e2  "/ d0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   406
               16r00e3 16r00e4 16r00e5 16r00e7 16r00e8 16r00e9 16r00ea 16r00eb
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   407
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   408
               16r00ec 16r00c6 16r00ed 16r00aa 16r00ee 16r00ef 16r00f0 16r00f1  "/ e0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   409
               16rFFFF 16r00d8 16rFFFF 16r00ba 16r00f2 16r00f3 16r00f4 16r00f5
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   410
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   411
               16r00f6 16r00e6 16r00f9 16r00fa 16r00fb 16rFFFF 16r00fc 16r00fd  "/ f0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   412
               16rFFFF 16r00f8 16rFFFF 16r00df 16r00fe 16r00ff 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   413
            ).
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   414
    ].
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   415
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   416
    "/
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   417
    "/ msdos (codePage 437 [=US]) -> iso8859
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   418
    "/
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   419
    "/
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   420
    "/ You will loose the following characters:
1440
7a38f09abf92 no 8-bit chars in comments (7-bit lexes dont like that)
Claus Gittinger <cg@exept.de>
parents: 1435
diff changeset
   421
    "/ - all of them will be replaced by 8F
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   422
    "/
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   423
    "/  9E      (pesetas)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   424
    "/  9F      (latin f)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   425
    "/  A9      (technical not)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   426
    "/  B0 - BF (block graphic)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   427
    "/  C0 - CF (block graphic)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   428
    "/  D0 - DF (block graphic)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   429
    "/  E0      (greek alpha)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   430
    "/  E2 - E5 (greek)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   431
    "/  E7 - EC (greek & math)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   432
    "/  EE - EF (greek & math)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   433
    "/  F0      (math)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   434
    "/  F2 - F5 (math & technical)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   435
    "/  F7      (math)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   436
    "/  F9      (center dot)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   437
    "/  FB      (math sqrt)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   438
    "/  FC      (super n)
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   439
    "/  FE      (block)
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   440
    "/
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   441
    encoding == #msdos437 ifTrue:[
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   442
        unrepresentableCharacterCode := 16rBF.
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   443
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   444
        x80Table :=
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   445
            #(
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   446
               "/ Ccedil  uuml  eacute  acirc   auml    agrave  aring   ccedil
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   447
               16r00c7 16r00fc 16r00e9 16r00e2 16r00e4 16r00e0 16r00e5 16r00e7  "/ 80
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   448
               "/ ecirc   euml  egrave  iuml    icirc   igrave  Auml    Aring
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   449
               16r00ea 16r00eb 16r00e8 16r00ef 16r00ee 16r00ec 16r00c4 16r00c5
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   450
               "/ Eacute  ae     AE     ocirc   ouml    ograve  ucirc   ugrave
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   451
               16r00c9 16r00e6 16r00c6 16r00f4 16r00f6 16r00f2 16r00fb 16r00f9  "/ 90
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   452
               "/ yuml    Ouml   Uuml   cent    pound   Yen     psetas  latin-f
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   453
               16r00ff 16r00d6 16r00dc 16r00a2 16r00a3 16r00a5 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   454
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   455
               "/ aacute iacute oacute  uacute  ntilde  Ntilde  asup    0sup 
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   456
               16r00e1 16r00ed 16r00f3 16r00fa 16r00f1 16r00d1 16r00aa 16r00b0  "/ a0
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   457
               "/ iquest        lognot  1/2     1/4     iexcla  <<      >>
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   458
               16r00bf 16rFFFF 16r00ac 16r00bd 16r00bc 16r00a1 16r00ab 16r00bb
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   459
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   460
               "/ block graphics
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   461
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF  "/ b0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   462
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   463
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   464
               "/ block graphics
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   465
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF  "/ c0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   466
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   467
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   468
               "/ block graphics
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   469
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF  "/ d0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   470
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   471
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   472
               "/ alpha beta   ...                              mu
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   473
               16rFFFF 16r00df 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16r00b5 16rFFFF  "/ e0
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   474
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16r00f8 16rFFFF
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   475
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   476
               "/       plusmin  gr-eq  lt-eq   integr  integr  div
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   477
               16rFFFF 16r00b1 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16r00f7 16rFFFF  "/ f0
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   478
               "/ 0sup  dot      dot    sqr             sup2            space 
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   479
               16r00b0 16rFFFF 16r00b7 16rFFFF 16rFFFF 16r00b2 16rFFFF 16r00a0
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   480
            ).
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   481
    ].
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   482
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   483
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   484
    "/
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   485
    "/ msdos (codePage 850 [=latin1]) -> iso8859
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   486
    "/
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   487
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   488
    (encoding == #msdos850 
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   489
    or:[encoding == #msdos]) ifTrue:[
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   490
        unrepresentableCharacterCode := 16rBF.
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   491
        x80Table :=
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   492
            #(
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   493
               "/ Ccedil  uuml  eacute  acirc   auml    agrave  aring   ccedil
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   494
               16r00c7 16r00fc 16r00e9 16r00e2 16r00e4 16r00e0 16r00e5 16r00e7  "/ 80
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   495
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   496
               "/ ecirc   euml  egrave  iuml    icirc   igrave  Auml    Aring
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   497
               16r00ea 16r00eb 16r00e8 16r00ef 16r00ee 16r00ec 16r00c4 16r00c5
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   498
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   499
               "/ Eacute  ae     AE     ocirc   ouml    ograve  ucirc   ugrave
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   500
               16r00c9 16r00e6 16r00c6 16r00f4 16r00f6 16r00f2 16r00fb 16r00f9  "/ 90
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   501
               "/ yuml    Ouml   Uuml   oslash    pound Oslash  x      latin-f
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   502
               16r00ff 16r00d6 16r00dc 16r00f8 16r00a3 16r00d8 16rFFFF 16rFFFF
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   503
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   504
               "/ aacute iacute oacute  uacute  ntilde  Ntilde  asup    0sup 
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   505
               16r00e1 16r00ed 16r00f3 16r00fa 16r00f1 16r00d1 16r00aa 16r00da  "/ a0
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   506
               "/ iquest reg    lognot  1/2     1/4     iexcla  <<      >>
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   507
               16r00bf 16r00ae 16r00ac 16r00bd 16r00bc 16r00a1 16r00ab 16r00bb
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   508
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   509
               "/                                       Aacute  Acirc   Agrave
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   510
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16r00c1 16r00c2 16r00c0  "/ b0
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   511
               "/ cpyr                                  cent    Yen
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   512
               16r00a9 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16r00a2 16r00a5 16rFFFF
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   513
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   514
               "/                                               atilde  Atilde
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   515
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16r00e3 16r00c3  "/ c0
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   516
               "/                                               
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   517
               16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16rFFFF 
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   518
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   519
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   520
               "/               Ecirc   Euml    Egrave          Iacute  Icirc
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   521
               16rFFFF 16rFFFF 16r00ca 16r00cb 16r00c8 16rFFFF 16r00dc 16r00ce  "/ d0
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   522
               "/ Iuml                                          Igrave
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   523
               16r00cf 16rFFFF 16rFFFF 16rFFFF 16rFFFF 16r007c 16r00cc 16rFFFF  
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   524
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   525
               "/ Oacute  sz    Ocirc   Ograve  otilde  Otilde   mu
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   526
               16r00d3 16r00df 16r00d4 16r00d2 16rFFFF 16r00f5 16r00b5 16rFFFF  "/ e0
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   527
               "/       Uacute  Ucirc   Ugrav    yacute  Yacute             '
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   528
               16rFFFF 16r00da 16r00db 16r00d9 16r00fd 16r00dd 16rFFFF 16r0027
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   529
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   530
               "/       plusmin  =       3/4    para   sect      div            "/ f0
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   531
               16rFFFF 16r00b1 16r003d 16r00be 16r00b6 16r00a7 16r00f7 16rFFFF
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   532
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   533
               "/ sup0                          sup3   sup2
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   534
               16r00b0 16rFFFF 16rFFFF 16rFFFF 16r00b3 16r00b2 16rFFFF 16r00a0
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   535
            ).
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   536
    ].
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   537
    "/ more encodings needed here ....
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   538
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   539
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   540
    "/
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   541
    "/ x80Table should contain the translation for
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   542
    "/ 0x80..0xFF characters
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   543
    "/
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   544
    x80Table isNil ifTrue:[
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   545
        table := #identity.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   546
    ] ifFalse:[
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   547
        x80Table keysAndValuesDo:[:idx :repl |
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   548
            |ch|
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   549
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   550
            repl == 16rFFFF ifTrue:[
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   551
                ch := unrepresentableCharacterCode
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   552
            ] ifFalse:[
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   553
                ch := repl
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   554
            ].
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   555
            table at:(16r80 + idx) put:ch
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   556
        ].
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   557
    ].
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   558
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   559
    DecoderTables at:encoding put:table.
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   560
    ^ table
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   561
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   562
    "
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   563
     EncoderTables := nil.
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   564
     DecoderTables := nil
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   565
    "
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   566
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
   567
    "Created: 20.10.1995 / 23:04:43 / cg"
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   568
    "Modified: 1.7.1997 / 17:24:49 / cg"
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   569
!
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   570
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   571
defaultRomanCharacterValue
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   572
    "return the characterValue for unmappable roman characters
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   573
     (only applies to jis, gb, ksc etc.)"
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   574
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   575
    |romans|
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   576
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   577
    romans := self romanTable.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   578
    romans isNil ifTrue:[^ 16r20].
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   579
    ^ romans at:1.              "/ space characters xlation
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   580
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   581
    "Created: 30.6.1997 / 18:13:24 / cg"
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   582
    "Modified: 30.6.1997 / 18:19:26 / cg"
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   583
!
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   584
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   585
encoderTableFor:encoding
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   586
    "return a table to encode from the internal (iso8859, ansi) format
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   587
     into an alien encoding.
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   588
     This is an experimental interface - unfinished"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   589
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   590
    |table decoderTable unrepresentableCharacterCode 
1669
4951596746f7 comments & code cleanup
Claus Gittinger <cg@exept.de>
parents: 1613
diff changeset
   591
     unrepresentableISOCharacterCode|
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   592
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   593
    EncoderTables notNil ifTrue:[
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   594
        table := EncoderTables at:encoding ifAbsent:nil.
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   595
        table notNil ifTrue:[
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   596
            table == #identity ifTrue:[^ nil].
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   597
            ^ table
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   598
        ].
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   599
    ] ifFalse:[
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   600
        EncoderTables := IdentityDictionary new
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   601
    ].
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   602
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   603
    "/
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   604
    "/ iso8859 -> hp
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   605
    "/
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   606
    encoding == #hp ifTrue:[
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   607
        decoderTable := self decoderTableFor:encoding.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   608
        unrepresentableCharacterCode := 16rFF.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   609
    ].
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   610
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   611
    "/
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   612
    "/ iso8859 -> mac
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   613
    "/
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   614
    encoding == #mac ifTrue:[
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   615
        decoderTable := self decoderTableFor:encoding.
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   616
        unrepresentableCharacterCode := 16rFF.
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   617
    ].
1272
b45ddd6d6ec3 assume printer can handle ISO-latin1 encoded strings (national characters)
Claus Gittinger <cg@exept.de>
parents: 1255
diff changeset
   618
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   619
    "/
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   620
    "/ iso8859 -> msdos
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   621
    "/
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   622
    encoding == #msdos ifTrue:[
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   623
        decoderTable := self decoderTableFor:encoding.
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   624
        unrepresentableCharacterCode := 16rFF.
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   625
    ].
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   626
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   627
    "/ more encodings needed here ....
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   628
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   629
    decoderTable notNil ifTrue:[
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   630
        table := Array new:256 withAll:unrepresentableCharacterCode.
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   631
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   632
        0 to:16rFF do:[:code |
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   633
            |isoCode destIdex|
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   634
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   635
            isoCode := decoderTable at:(code+1).
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   636
            table at:isoCode+1 put:code.
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   637
        ]
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   638
    ].
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   639
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   640
    table isNil ifTrue:[
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   641
        "/
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   642
        "/ cannot put a nil into an Id-Dict;
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   643
        "/ use special #identity
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   644
        "/
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   645
        table := #identity.
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   646
    ].
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   647
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   648
    EncoderTables at:encoding put:table.
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   649
    ^ table
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   650
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   651
    "
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   652
     EncoderTables := nil.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   653
     DecoderTables := nil.
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   654
     CharacterArray decoderTableFor:#mac
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   655
     CharacterArray encoderTableFor:#mac
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   656
    "
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   657
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   658
    "Created: 22.2.1996 / 16:17:58 / cg"
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   659
    "Modified: 1.7.1997 / 17:26:28 / cg"
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   660
!
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   661
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   662
romanTable
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   663
    ^ nil
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   664
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   665
    "Created: 30.6.1997 / 15:13:52 / cg"
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   666
!
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   667
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
   668
supportedEncodings
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   669
    "return an array containing symbolic names of supported encodings. 
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   670
     These are internally visible supported ones only"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   671
2452
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   672
    ^ self supportedExternalEncodings at:2.
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   673
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   674
    "Modified: 30.6.1997 / 14:38:25 / cg"
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   675
!
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   676
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   677
supportedExternalEncodings
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   678
    "return an array of two arrays containing the names of supported
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   679
     encodings which are supported for external resources (i.e. files).
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   680
     The first array contains user-readable strings (descriptions),
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   681
     the second contains the internally used symbolic names.
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   682
     More than one external name may be mapped onto the same symbolic."
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   683
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   684
    ^ #( 
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   685
         (
2250
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   686
          'iso8859      (ansi)'  
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   687
          'iso8859-1    (latin1)'  
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   688
          'iso8859-2    (latin2)'  
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   689
          'iso8859-3    (latin3)'  
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   690
          'iso8859-4    (latin4)'  
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   691
          'iso8859-5    (cyrillic)'  
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   692
          'iso8859-6    (arabic)'  
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   693
          'iso8859-7    (greek)'  
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   694
          'iso8859-8    (hebrew)'  
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   695
          'msdos US     (codepage 437)'  
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   696
          'msdos Latin1 (codepage 850)'  
2250
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   697
          'macintosh    (8 bit)' 
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   698
          'next         (8 bit)' 
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   699
          'hp           (8 bit)' 
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   700
          nil
2250
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   701
          'EUC          (extended unix code japanese)' 
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   702
          'JIS7         (jis 7bit escape codes japanese)'
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
   703
          'SJIS         (shift jis 8bit codes japanese)'
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   704
          nil
2250
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   705
          'GB           (mainland china)'
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   706
          'BIG5         (taiwan)'
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   707
"/          'KSC          (korean)'
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   708
         )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   709
         (
2452
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   710
          #'iso8859'
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   711
          #'iso8859-1'    "/ latin1
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   712
          #'iso8859-2'    "/ latin2
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   713
          #'iso8859-3'    "/ latin3
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   714
          #'iso8859-4'    "/ latin4
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   715
          #'iso8859-5'    "/ cyrillic
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   716
          #'iso8859-6'    "/ arabic
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   717
          #'iso8859-7'    "/ greek
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   718
          #'iso8859-8'    "/ hebrew
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   719
          #'msdos437'                   
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   720
          #'msdos850'                   
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   721
          #'mac'         
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   722
          #'next'   
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   723
          #'hp'   
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   724
          nil
2452
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   725
          #'euc'   
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   726
          #'jis7'   
2721
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
   727
          #'sjis'   
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   728
          nil
2452
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   729
          #'gb'   
7262e3b2c20a fixed supportedExternalEncodings
Claus Gittinger <cg@exept.de>
parents: 2344
diff changeset
   730
          #'big5'
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   731
"/          #'ksc'           "/ korean
2250
4b7a93d96b3f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2124
diff changeset
   732
         )
1250
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   733
       )
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   734
0f628432aa93 fixed encoding/decoding for msdos & mac; added next decoder
Claus Gittinger <cg@exept.de>
parents: 1241
diff changeset
   735
    "Created: 22.4.1996 / 14:39:39 / cg"
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
   736
    "Modified: 1.7.1997 / 17:28:09 / cg"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   737
! !
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   738
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
   739
!CharacterArray class methodsFor:'encoding / decoding'!
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   740
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   741
decodeFromBIG5:aString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   742
    "return a new string containing the characters from aString,
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   743
     which is interpreted as a BIG5 encoded singleByte string.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   744
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   745
     The result is a Big5EncodedString (you need a BIG5 font to display that ...)."
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   746
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   747
    |newString|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   748
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   749
    newString := BIG5EncodedString new:aString size.
1204
977e83b2c166 oops - leftover prints
Claus Gittinger <cg@exept.de>
parents: 1203
diff changeset
   750
    ^ self decodeFromBIG5_or_GB:aString into:newString.
977e83b2c166 oops - leftover prints
Claus Gittinger <cg@exept.de>
parents: 1203
diff changeset
   751
977e83b2c166 oops - leftover prints
Claus Gittinger <cg@exept.de>
parents: 1203
diff changeset
   752
    "Modified: 17.4.1996 / 18:58:29 / cg"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   753
!
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   754
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   755
decodeFromBIG5_or_GB:aString into:a16BitString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   756
    "return a new string containing the characters from aString,
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   757
     which is interpreted as a BIG5 encoded singleByte string.
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   758
     The result is a Big5EncodedString (you need a BIG5 font to display that ...).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   759
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   760
     This is a first-class candidate for a primitive"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   761
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   762
    |sz     "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   763
     dstIdx "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   764
     srcIdx "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   765
     b1     "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   766
     b2     "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   767
     val    "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   768
     c|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   769
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   770
    sz := aString size.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   771
    sz ~~ 0 ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   772
        dstIdx := 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   773
        srcIdx := 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   774
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   775
        [true] whileTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   776
            c := aString at:srcIdx.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   777
            "/
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   778
            "/ characters below 16rA1 are left untranslated
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   779
            "/ (romans).
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   780
            "/ Translation into roman-row is done at display time.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   781
            "/
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   782
            b1 := c asciiValue.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   783
            b1 >= 16rA1 ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   784
                srcIdx := srcIdx + 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   785
                srcIdx <= sz ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   786
                    b2 := (aString at:srcIdx) asciiValue.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   787
                    val := (b1 bitShift:8) bitOr:b2.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   788
                    c := Character value:val.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   789
                ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   790
            ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   791
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   792
            a16BitString at:dstIdx put:c.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   793
            dstIdx := dstIdx + 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   794
            srcIdx := srcIdx + 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   795
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   796
            srcIdx > sz ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   797
                ^ a16BitString copyFrom:1 to:dstIdx-1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   798
            ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   799
        ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   800
    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   801
    ^ a16BitString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   802
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   803
    "Created: 17.4.1996 / 16:55:54 / cg"
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   804
    "Modified: 4.7.1997 / 11:01:38 / cg"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   805
!
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   806
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   807
decodeFromEUC:aString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   808
    "return a new string containing the characters from aString,
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   809
     which is interpreted as an EUC encoded singleByte string.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   810
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   811
     There are various JIS encodings around (New-JIS, Old-JIS and NEC-JIS);
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   812
     this one only understands New-JIS.
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   813
     The result is a JISEncodedString (you need a JIS font to display that ...).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   814
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   815
     This is a first-class candidate for a primitive"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   816
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   817
    |newString 
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   818
     sz     "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   819
     dstIdx "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   820
     start  "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   821
     b1     "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   822
     b2     "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   823
     val    "{ Class: SmallInteger }"
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   824
     c c2|
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   825
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   826
    sz := aString size.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   827
    newString := JISEncodedString new:sz.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   828
    sz ~~ 0 ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   829
        dstIdx := 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   830
        start := 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   831
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   832
        [true] whileTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   833
            c := aString at:start.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   834
            b1 := c asciiValue.
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   835
            b1 < 161 ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   836
                "/ characters below 16rA1 are left untranslated
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   837
                "/ (control character or roman).
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   838
                newString at:dstIdx put:c.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   839
            ] ifFalse:[
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   840
                start := start + 1.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   841
                start <= sz ifTrue:[    
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   842
                    b1 := b1 - 128.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   843
                    b2 := (c2 := aString at:start) asciiValue.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   844
                    b2 := b2 - 128.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   845
                    val := (b1 bitShift:8) bitOr:b2.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   846
                    val <= 0 ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   847
                        DecodingFailedSignal 
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   848
                            raiseWith:aString
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   849
                            errorString:'EUC decoding failed (not EUC encoded ?)'.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   850
                        newString at:dstIdx put:c.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   851
                        dstIdx := dstIdx + 1.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   852
                        newString at:dstIdx put:c2.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   853
                    ] ifFalse:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   854
                        newString at:dstIdx put:(Character value:val).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   855
                    ].
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   856
                ] ifFalse:[
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   857
                    newString at:dstIdx put:c.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   858
                ].
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   859
            ].
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   860
            dstIdx := dstIdx + 1.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   861
            start := start + 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   862
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   863
            start > sz ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   864
                ^ newString copyFrom:1 to:dstIdx-1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   865
            ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   866
        ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   867
    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   868
    ^ newString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   869
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   870
    "simple:
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   871
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
   872
         'hello' decodeFrom:#euc 
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   873
    "
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   874
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   875
    "Created: 17.4.1996 / 16:10:22 / cg"
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   876
    "Modified: 4.7.1997 / 11:06:05 / cg"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   877
!
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   878
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   879
decodeFromGB:aString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   880
    "return a new string containing the characters from aString,
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   881
     which is interpreted as a GB encoded singleByte string.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   882
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   883
     The result is a Big5EncodedString (you need a GB font to display that ...)."
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   884
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   885
    |newString|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   886
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   887
    newString := GBEncodedString new:aString size.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   888
    ^ self decodeFromBIG5_or_GB:aString into:newString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   889
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   890
    "Created: 17.4.1996 / 16:56:33 / cg"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   891
!
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   892
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   893
decodeFromJIS7:aString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   894
    "return a new string containing the aStrings characters,
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   895
     which are interpreted as a JIS7 encoded singleByte string.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   896
     There are various JIS encodings around (New-JIS, Old-JIS and NEC-JIS);
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   897
     this one understands New-JIS and treats Old-JIS just the same.
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   898
     The result is a JISEncodedString (you need a JIS font to display that ...).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   899
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
   900
     This is a first-class candidate for a primitive"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   901
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   902
    |newString 
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   903
     sz         "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   904
     dstIdx     "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   905
     start      "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   906
     stop       "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   907
     n1 n2 n3  
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   908
     b1         "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   909
     b2         "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   910
     val        "{ Class: SmallInteger }"
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   911
     singleBytes katakana c|
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   912
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   913
    sz := aString size.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   914
    newString := JISEncodedString new:sz.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   915
    sz ~~ 0 ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   916
        dstIdx := 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   917
        start := 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   918
        singleBytes := true.
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   919
        katakana := false.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   920
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   921
        [true] whileTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   922
            "/
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   923
            "/ scan for next escape"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   924
            "/
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   925
            stop := aString indexOf:(Character esc) startingAt:start.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   926
            stop == 0 ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   927
                stop := sz + 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   928
            ] ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   929
                (stop + 2) > sz ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   930
                    stop := sz + 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   931
                ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   932
            ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   933
            singleBytes ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   934
                start to:(stop - 1) do:[:i |
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   935
                    b1 := (c := aString at:i) asciiValue.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   936
                    b1 >= 16r80 ifTrue:[
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   937
                        "/ mhmh - seems to be JIS8
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   938
                        "/ (has half-width katakane at A1..DF
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   939
                        "/ my romanTable also includes xlations for those ...
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   940
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   941
"/                        DecodingFailedSignal 
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   942
"/                            raiseWith:aString
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   943
"/                            errorString:'JIS7 decoding failed (seems to be JIS8 encoded ?)'.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   944
"/                        (b1 >= 16rA1 and:[b1 <= 16rDF]) ifTrue:[
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   945
"/                            c := Character value:(self romanTable at:(b1 - 32))
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   946
"/                        ] ifFalse:[
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   947
"/                            c := Character value:(self romanTable at:(b1 - 32))
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   948
"/                        ]
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   949
                    ].    
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   950
                    newString at:dstIdx put:c.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   951
                    dstIdx := dstIdx + 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   952
                ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   953
            ] ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   954
                start to:(stop - 2) by:2 do:[:i |
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   955
                    b1 := (aString at:i) asciiValue.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   956
                    b2 := (aString at:i+1) asciiValue.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   957
                    val := (b1 bitShift:8) bitOr:b2.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   958
                    newString at:dstIdx put:(Character value:val).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   959
                    dstIdx := dstIdx + 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   960
                ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   961
            ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   962
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   963
            stop > sz ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   964
                ^ newString copyFrom:1 to:dstIdx - 1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   965
            ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   966
            start := stop.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   967
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   968
            "/
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   969
            "/ found an escape (at start) 
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   970
            "/ - check for KI (<ESC> '$' 'B') or OLD-JIS-KI (<ESC> '$' '@')
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   971
            "/ and KO(ASCII) (<ESC> '(' 'B') or KO(ROMAN) (<ESC> '(' 'J')
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   972
            "/
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   973
            n1 := aString at:start.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   974
            n2 := aString at:(start + 1).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   975
            n3 := aString at:(start + 2).
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   976
            katakana := false.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   977
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   978
            (n2 == $$ and:[n3 == $B ]) ifTrue:[
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   979
                singleBytes := false.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   980
            ] ifFalse:[
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   981
                (n2 == $$ and:[n3 == $@ ]) ifTrue:[
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   982
                    singleBytes := false.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   983
                ] ifFalse:[
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   984
                    (n2 == $( and:[n3 == $B ]) ifTrue:[
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   985
                        singleBytes := true.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   986
                    ] ifFalse:[
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   987
                        (n2 == $( and:[n3 == $J ]) ifTrue:[
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   988
                            singleBytes := true.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   989
                        ] ifFalse:[
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   990
                            (n2 == $( and:[n3 == $I ]) ifTrue:[
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   991
                                singleBytes := true.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   992
                                katakana := true.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
   993
                            ] ifFalse:[
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   994
                                singleBytes ifTrue:[
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   995
                                    newString at:dstIdx put:n1.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   996
                                    newString at:(dstIdx + 1) put:n2.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   997
                                    newString at:(dstIdx + 2) put:n3.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   998
                                    dstIdx := dstIdx + 3.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
   999
                                ] ifFalse:[
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1000
                                    DecodingFailedSignal 
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1001
                                            raiseWith:aString
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1002
                                            errorString:'JIS7 decoding failed (not JIS7 encoded ?)'.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1003
                                    newString at:dstIdx put:n1.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1004
                                    newString at:(dstIdx + 1) put:n2.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1005
                                    newString at:(dstIdx + 2) put:n3.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1006
                                    dstIdx := dstIdx + 3.
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1007
                                ]
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1008
                            ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1009
                        ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1010
                    ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1011
                ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1012
            ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1013
            start := start + 3.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1014
            start > sz ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1015
                ^ newString copyFrom:1 to:dstIdx-1.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1016
            ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1017
        ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1018
    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1019
    ^ newString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1020
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1021
    "simple:
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1022
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1023
         'hello' decodeFromJIS7
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1024
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1025
     ending with a crippled escape:
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1026
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1027
         |s|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1028
         s := 'hello' copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1029
         s decodeFromJIS7
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1030
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1031
         |s|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1032
         s := 'hello' copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1033
         s := s copyWith:$A.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1034
         s decodeFromJIS7
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1035
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1036
         |s|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1037
         s := 'hello' copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1038
         s := s copyWith:$$.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1039
         s decodeFromJIS7
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1040
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1041
         |s|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1042
         s := 'hello' copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1043
         s := s copyWith:$$.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1044
         s := s copyWith:$A.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1045
         s decodeFromJIS7 
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1046
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1047
     ending with a KANJI-in,  but no more chars:
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1048
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1049
         |s|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1050
         s := 'hello' copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1051
         s := s copyWith:$$.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1052
         s := s copyWith:$B.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1053
         s decodeFromJIS7
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1054
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1055
     ending with a KANJI-in, followed by $3 (KO):
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1056
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1057
         |s|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1058
         s := 'hello' copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1059
         s := s copyWith:$$.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1060
         s := s copyWith:$B.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1061
         s := s , '$3'.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1062
         s decodeFromJIS7
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1063
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1064
     ending with a KANJI-in, followed by $3$l$OF| (KO RE HA NI):
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1065
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1066
         |s|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1067
         s := 'hello' copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1068
         s := s copyWith:$$.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1069
         s := s copyWith:$B.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1070
         s := s , '$3$l$OF|'.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1071
         s decodeFromJIS7
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1072
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1073
     a KO in between:
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1074
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1075
         |s|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1076
         s := 'hello' copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1077
         s := s copyWith:$$.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1078
         s := s copyWith:$B.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1079
         s := s , '$3'.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1080
         s := s copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1081
         s := s copyWith:$(.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1082
         s := s copyWith:$B.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1083
         s := s , 'hello'.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1084
         s decodeFromJIS7
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1085
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1086
     I dont know what that means ;-):
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1087
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1088
         |s t l|
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1089
         s := 'kterm ' copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1090
         s := s copyWith:$$.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1091
         s := s copyWith:$B.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1092
         s := s , '$N4A;zC<Kv%(%_%e%l!!<%?'.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1093
         s := s copyWith:Character esc.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1094
         s := s copyWith:$(.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1095
         s := s copyWith:$B.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1096
         s := s , ' kterm'.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1097
         t := s decodeFromJIS7.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1098
         l := Label new.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1099
         l label:t.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1100
         l font:(Font family:'k14' face:nil style:nil size:nil).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1101
         l font:(Font family:'gothic' size:17).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1102
         l font:(Font family:'mincho' size:23).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1103
         l realize
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1104
    "
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1105
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1106
    "Created: 17.4.1996 / 16:11:57 / cg"
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1107
    "Modified: 4.7.1997 / 11:01:48 / cg"
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1108
!
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1109
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1110
decodeFromSJIS:aString
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1111
    "return a new string containing the aStrings characters,
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1112
     which are interpreted as a Shift-JIS encoded singleByte string.
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1113
     The result is a JISEncodedString (you need a JIS font to display that ...).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1114
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1115
     This is a first-class candidate for a primitive"
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1116
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1117
    |newString char1 char2
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1118
     sz         "{ Class: SmallInteger }"
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1119
     dstIdx     "{ Class: SmallInteger }"
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1120
     srcIdx     "{ Class: SmallInteger }"
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1121
     b1         "{ Class: SmallInteger }"
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1122
     b2         "{ Class: SmallInteger }"
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1123
     val        "{ Class: SmallInteger }"
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1124
    |
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1125
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1126
    sz := aString size.
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1127
    newString := JISEncodedString new:sz.
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1128
    sz ~~ 0 ifTrue:[
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1129
        dstIdx := 1.
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1130
        srcIdx := 1.
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1131
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1132
        [srcIdx <= sz] whileTrue:[
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1133
            "/
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1134
            "/ scan for next character in 129..159 or 224..239
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1135
            "/
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1136
            char1 := aString at:srcIdx.
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1137
            srcIdx := srcIdx + 1.
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1138
            b1 := char1 asciiValue.
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1139
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1140
            ((srcIdx <= sz) 
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1141
            and:[(b1 >= 129 and:[b1 <= 159])                 "/ SJIS1
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1142
                 or:[b1 >= 224 and:[b1 <= 239]]]) ifTrue:[
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1143
                char2 := aString at:srcIdx.
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1144
                srcIdx := srcIdx + 1.
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1145
                b2 := char2 asciiValue.
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  1146
                (b2 >= 64 and:[b2 <= 252]) ifTrue:[          "/ SJIS2
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1147
                    |adjust rowOffs cellOffs|
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1148
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1149
                    adjust := (b2 < 159) ifTrue:[1] ifFalse:[0].
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1150
                    rowOffs := b1 < 160 ifTrue:[112] ifFalse:[176].
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1151
                    adjust == 1 ifTrue:[
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1152
                        cellOffs := 31 + (b2 > 127 ifTrue:[1] ifFalse:[0]).
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1153
                    ] ifFalse:[
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1154
                        cellOffs := 126.
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1155
                    ].
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1156
                    b1 := ((b1 - rowOffs) bitShift:1) - adjust.
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1157
                    b2 := (b2 - cellOffs).
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1158
                    val := (b1 bitShift:8) + b2.
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1159
                    val <= 0 ifTrue:[
2730
1fbfddfa1d3f allow proceeding after a decoder error
Claus Gittinger <cg@exept.de>
parents: 2728
diff changeset
  1160
                        DecodingFailedSignal 
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1161
                                raiseWith:aString
2730
1fbfddfa1d3f allow proceeding after a decoder error
Claus Gittinger <cg@exept.de>
parents: 2728
diff changeset
  1162
                                errorString:'SJIS decoding failed (not SJIS encoded ?)'.
1fbfddfa1d3f allow proceeding after a decoder error
Claus Gittinger <cg@exept.de>
parents: 2728
diff changeset
  1163
                        newString at:dstIdx put:char1.
1fbfddfa1d3f allow proceeding after a decoder error
Claus Gittinger <cg@exept.de>
parents: 2728
diff changeset
  1164
                        dstIdx := dstIdx + 1.
1fbfddfa1d3f allow proceeding after a decoder error
Claus Gittinger <cg@exept.de>
parents: 2728
diff changeset
  1165
                        newString at:dstIdx put:char2.
1fbfddfa1d3f allow proceeding after a decoder error
Claus Gittinger <cg@exept.de>
parents: 2728
diff changeset
  1166
                    ] ifFalse:[
1fbfddfa1d3f allow proceeding after a decoder error
Claus Gittinger <cg@exept.de>
parents: 2728
diff changeset
  1167
                        newString at:dstIdx put:(Character value:val).
1fbfddfa1d3f allow proceeding after a decoder error
Claus Gittinger <cg@exept.de>
parents: 2728
diff changeset
  1168
                    ]
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1169
                ] ifFalse:[
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1170
                    "/ mhmh - append untranslated
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1171
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1172
                    newString at:dstIdx put:char1.
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1173
                    dstIdx := dstIdx + 1.
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1174
                    newString at:dstIdx put:char2.
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1175
                ]
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  1176
            ] ifFalse:[    
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  1177
                (b1 >= 16rA1 "161" and:[b1 <= 16rDF "223"]) ifTrue:[     "/ HALFWIDTH KATAKANA
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  1178
                    "/ map half-width katakan to 8E:xx
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1179
                    val := b1 - 128.
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  1180
                    val := val + (16r8E bitShift:8).
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1181
                    newString at:dstIdx put:(Character value:val).
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1182
                ] ifFalse:[    
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1183
                    "/ roman characters left untranslated
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1184
                    newString at:dstIdx put:char1
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1185
                ]
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1186
            ].
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1187
            dstIdx := dstIdx + 1.
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1188
        ].
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1189
        newString := newString copyTo:dstIdx - 1.
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1190
    ].
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1191
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1192
    ^ newString
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1193
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1194
    "simple:
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1195
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1196
         'hello' decodeFrom:#sjis         
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1197
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1198
         '../../doc/online/japanese/TOP.html' asFilename contents asString
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1199
                decodeFrom:#sjis  
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1200
    "
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  1201
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  1202
    "Created: 28.6.1997 / 19:19:23 / cg"
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1203
    "Modified: 4.7.1997 / 11:01:22 / cg"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1204
!
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1205
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1206
encodeIntoBIG5:aBIG5String
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1207
    "return a new string with aBIG5Strings characters as BIG5 encoded 16bit string,
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1208
     The argument must be a BIG5String.
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1209
     This is used translate 16bit BIG5 strings into the external BIG5 encoded
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1210
     representation.
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1211
     The resulting string is only useful to be stored on some external file,
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1212
     not for being displayed in an ST/X view.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1213
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1214
     This is a first-class candidate for a primitive"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1215
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1216
    |sz "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1217
     b  "{ Class: SmallInteger }"
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1218
     c out|
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1219
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1220
    sz := aBIG5String size.
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1221
    out := WriteStream on:(String new:(sz * 2)).
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1222
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1223
    1 to:sz do:[:index |
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1224
        c := aBIG5String at:index.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1225
        b := c asciiValue.
1203
a43528048cb8 code tables moved to BIG5/JISEncodedString classes
Claus Gittinger <cg@exept.de>
parents: 1202
diff changeset
  1226
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1227
        b > 255 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1228
            "/ big5 character
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1229
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1230
            out nextPut:(Character value:(b bitShift:-8)).
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1231
            out nextPut:(Character value:(b bitAnd:16rFF)).
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1232
        ] ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1233
            "/
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1234
            "/ control or roman character
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1235
            "/
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1236
            out nextPut:c
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1237
        ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1238
    ].
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1239
    ^ out contents
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1240
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1241
    "Created: 17.4.1996 / 17:17:50 / cg"
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1242
    "Modified: 4.7.1997 / 11:01:57 / cg"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1243
!
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1244
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1245
encodeIntoEUC:aJISString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1246
    "return a new string with aJISStrings characters as EUC encoded 8bit string.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1247
     The argument must be a JIS 16 bit character string.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1248
     The resulting string is only useful to be stored on some external file,
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1249
     not for being displayed in an ST/X view.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1250
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1251
     This is a first-class candidate for a primitive"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1252
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1253
    |sz "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1254
     b1 "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1255
     val "{ Class: SmallInteger }"
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1256
     c romans out|
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1257
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1258
    romans := JISEncodedString romanTable.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1259
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1260
    sz := aJISString size.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1261
    sz == 0 ifTrue:[^ ''].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1262
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1263
    out := WriteStream on:(String new:(sz * 2)).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1264
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1265
    1 to:sz do:[:srcIndex |
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1266
        b1 := (c := aJISString at:srcIndex) asciiValue.
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1267
        b1 < 161 ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1268
            "/ a control or roman character    
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1269
            out nextPut:c.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1270
        ] ifFalse:[
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1271
            "/
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1272
            "/ check for a roman character
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1273
            "/ the two numbers below are romanTable min and romanTable max
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1274
            "/
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1275
            (b1 between:16r2121 and:16r2573) ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1276
                val := romans indexOf:b1.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1277
                (val ~~ 0 and:[val <= 127]) ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1278
                    out nextPut:(Character value:(val - 1 + 32))
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1279
                ] ifFalse:[
2747
0a95a87bcefa fixed euc encoding
Claus Gittinger <cg@exept.de>
parents: 2742
diff changeset
  1280
                    out nextPut:(Character value:(b1 bitShift:-8) + 128).
0a95a87bcefa fixed euc encoding
Claus Gittinger <cg@exept.de>
parents: 2742
diff changeset
  1281
                    out nextPut:(Character value:(b1 bitAnd:16rFF) + 128).
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1282
                ].
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1283
            ] ifFalse:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1284
                out nextPut:(Character value:(b1 bitShift:-8) + 128).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1285
                out nextPut:(Character value:(b1 bitAnd:16rFF) + 128).
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1286
            ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1287
        ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1288
    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1289
    ^ out contents
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1290
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1291
    "simple:
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1292
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1293
         ('hello' decodeFrom:#euc) encodeInto:#euc    
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1294
    "
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1295
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1296
    "Created: 17.4.1996 / 16:13:33 / cg"
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1297
    "Modified: 4.7.1997 / 11:03:43 / cg"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1298
!
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1299
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1300
encodeIntoGB:aGBString
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1301
    "return a new string with aGBStrings characters as GB encoded 8bit string.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1302
     The argument must be a GBString.
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1303
     This is used translate 16bit GB strings into the external GB encoded
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1304
     representation.
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1305
     The resulting string is only useful to be stored on some external file,
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1306
     not for being displayed in an ST/X view.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1307
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1308
     This is a first-class candidate for a primitive"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1309
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1310
    |sz "{ Class: SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1311
     b  "{ Class: SmallInteger }"
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1312
     c out|
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1313
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1314
    sz := aGBString size.
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1315
    out := WriteStream on:(String new:(sz * 2)).
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1316
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1317
    1 to:sz do:[:index |
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1318
        c := aGBString at:index.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1319
        b := c asciiValue.
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1320
        b > 255 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1321
            "/ big5 character
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1322
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1323
            out nextPut:(Character value:(b bitShift:-8)).
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1324
            out nextPut:(Character value:(b bitAnd:16rFF)).
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1325
        ] ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1326
            "/
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1327
            "/ control or roman character
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1328
            "/
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1329
            out nextPut:c
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1330
        ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1331
    ].
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1332
    ^ out contents
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1333
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1334
    "Created: 30.6.1997 / 15:48:35 / cg"
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1335
    "Modified: 4.7.1997 / 11:02:13 / cg"
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1336
!
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1337
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1338
encodeIntoJIS7:aJISString
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1339
    "return a new string with aJISStrings characters as JIS7 encoded 7bit string,
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1340
     The receiver must be a JIS encoded character string.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1341
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1342
     The resulting string is only useful to be stored on some external file,
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1343
     not for being displayed in an ST/X view."
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1344
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1345
    |sz "{ Class:SmallInteger }"
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1346
     b1 "{ Class:SmallInteger }"
1669
4951596746f7 comments & code cleanup
Claus Gittinger <cg@exept.de>
parents: 1613
diff changeset
  1347
     val romans out inSingleByteMode c kanji roman|
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1348
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1349
    inSingleByteMode := true.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1350
    kanji := JISEncodedString jis7KanjiEscapeSequence.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1351
    roman := JISEncodedString jis7RomanEscapeSequence.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1352
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1353
    romans := JISEncodedString romanTable.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1354
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1355
    sz := aJISString size.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1356
    sz == 0 ifTrue:[^ ''].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1357
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1358
    out := WriteStream on:(String new:(sz * 2)).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1359
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1360
    1 to:sz do:[:srcIndex |
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1361
        c := aJISString at:srcIndex.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1362
        b1 := c asciiValue.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1363
        b1 < 33 ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1364
            "/ a control character
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1365
            inSingleByteMode ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1366
                out nextPutAll:roman.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1367
                inSingleByteMode := true
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1368
            ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1369
            out nextPut:c.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1370
        ] ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1371
            "/ check for a roman character
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1372
            "/ the two numbers below are romanTable min and romanTable max
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1373
            "/
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1374
            (b1 between:16r2121 and:16r2573) ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1375
                val := romans indexOf:b1.
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1376
                (val ~~ 0 and:[val <= 16r7F]) ifTrue:[
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1377
                    inSingleByteMode ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1378
                        out nextPutAll:roman.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1379
                        inSingleByteMode := true
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1380
                    ].
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1381
                    out nextPut:(Character value:(val - 1 + 32))
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1382
                ] ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1383
                    inSingleByteMode ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1384
                        out nextPutAll:kanji.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1385
                        inSingleByteMode := false
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1386
                    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1387
                    out nextPut:(Character value:(b1 bitShift:-8)).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1388
                    out nextPut:(Character value:(b1 bitAnd:16rFF)).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1389
                ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1390
            ] ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1391
                b1 <= 255 ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1392
                    "/ mhmh - unrepresentable roman (national chars)
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1393
"/                    b1 >= 160 ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1394
"/                        ('no rep for ' , b1 printString) printNL.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1395
"/                    ].
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1396
                    EncodingFailedSignal
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1397
                        raiseWith:aJISString
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1398
                        errorString:'JIS7 encoding failed (contains 8-bit characters ?)'.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1399
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1400
                    inSingleByteMode ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1401
                        out nextPutAll:roman.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1402
                        inSingleByteMode := true
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1403
                    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1404
                    out nextPut:c
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1405
                ] ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1406
                    inSingleByteMode ifTrue:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1407
                        out nextPutAll:kanji.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1408
                        inSingleByteMode := false
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1409
                    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1410
                    out nextPut:(Character value:(b1 bitShift:-8)).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1411
                    out nextPut:(Character value:(b1 bitAnd:16rFF)).
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1412
                ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1413
            ]
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1414
        ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1415
    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1416
    inSingleByteMode ifFalse:[
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1417
        out nextPutAll:roman.
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1418
    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1419
    ^ out contents
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1420
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1421
    "simple:
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1422
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1423
         'hello' decodeFromJIS7 encodeIntoJIS7 
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1424
    "
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1425
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  1426
    "Created: 17.4.1996 / 16:17:40 / cg"
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1427
    "Modified: 30.6.1997 / 19:20:02 / cg"
2721
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1428
!
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1429
2731
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1430
encodeIntoSJIS:aJISString
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1431
    "return a new string with aJISStrings characters as SJIS encoded 8bit string.
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1432
     The receiver must be a JIS encoded character string.
2731
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1433
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1434
     The resulting string is only useful to be stored on some external file,
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1435
     not for being displayed in an ST/X view."
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1436
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1437
    |sz "{ Class: SmallInteger }"
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1438
     rval "{ Class: SmallInteger }"
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1439
     val  "{ Class: SmallInteger }"
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1440
     romans c out isSJIS|
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1441
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1442
    romans := JISEncodedString romanTable.
2731
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1443
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1444
    sz := aJISString size.
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1445
    sz == 0 ifTrue:[^ ''].
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1446
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1447
    out := WriteStream on:(String new:(sz * 2)).
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1448
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1449
    1 to:sz do:[:srcIndex |
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1450
        val := (c := aJISString at:srcIndex) asciiValue.
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1451
        val <= 128 ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1452
            "/ a control or ascii character    
2731
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1453
            out nextPut:c.
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1454
        ] ifFalse:[
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1455
            (val > 150 and:[val < 224]) ifTrue:[
2731
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1456
                "/ ascii subset
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1457
                out nextPut:c.
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1458
            ] ifFalse:[
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1459
                "/ should not happen ...
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1460
                val <= 255 ifTrue:[
2731
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1461
                    out nextPut:c.
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1462
                ] ifFalse:[
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1463
                    isSJIS := true.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1464
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1465
                    "/ check for HALFWIDTH KATAKANA
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1466
                    "/ 8E:xx
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1467
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1468
                    (val bitAnd:16rFF00) == 16r8E00 ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1469
                        |b|
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1470
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1471
                        b := (val bitAnd:16rFF) + 128.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1472
                        (b >= 16rA1 "161" and:[b <= 16rDF "223"]) ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1473
                            out nextPut:(Character value:b).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1474
                            isSJIS := false.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1475
                        ].
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1476
                    ].
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1477
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1478
                    isSJIS ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1479
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1480
                        "/ check for a roman character
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1481
                        "/ the two numbers below are romanTable min and romanTable max
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1482
                        "/
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1483
                        (val between:16r2121 and:16r2573) ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1484
                            rval := romans indexOf:val.
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1485
                            rval ~~ 0 ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1486
                                out nextPut:(Character value:(rval - 1 + 32)).
2731
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1487
                                isSJIS := false.
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1488
                            ].
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1489
                        ].
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1490
                    ].
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1491
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1492
                    isSJIS ifTrue:[
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1493
                        |b1 b2 rowOffset cellOffset|
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1494
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1495
                        b1 := (val bitShift:-8).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1496
                        b2 := (val bitAnd:16rFF).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1497
                        rowOffset := (b1 < 95) ifTrue:[112] ifFalse:[176].
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1498
                        cellOffset := b1 odd ifTrue:[(b2 > 95) ifTrue:[32] ifFalse:[31]]
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1499
                                             ifFalse:[126].
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1500
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1501
                        out nextPut:(Character value:(((b1 + 1) bitShift:-1) + rowOffset)).
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1502
                        out nextPut:(Character value:b2 + cellOffset).
2731
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1503
                    ]
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1504
                ]
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1505
            ]
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1506
        ].
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1507
    ].
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1508
    ^ out contents
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1509
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1510
    "simple:
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1511
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1512
         'hello' decodeFromEUC encodeIntoEUC    
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1513
    "
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1514
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1515
    "Created: 28.6.1997 / 21:13:27 / cg"
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  1516
    "Modified: 4.7.1997 / 11:04:32 / cg"
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1517
!
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1518
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1519
encodeRomans:aTwoByteString
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1520
    "return a new (usually 2-byte) string with roman characters
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1521
     encoded. The position of roman characters are encoding specific.
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1522
     (see romanTable of concrete 16-bit string classes).
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1523
     Used when displaying strings."
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1524
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1525
    |sz "{ Class: SmallInteger }"
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1526
     b  "{ Class: SmallInteger }"
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1527
     c romans newString|
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1528
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1529
    romans := self romanTable.
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1530
    romans isNil ifTrue:[^ aTwoByteString].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1531
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1532
    sz := aTwoByteString size.
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1533
    newString := self fromString:aTwoByteString.
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1534
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1535
    1 to:sz do:[:index |
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1536
        c := aTwoByteString at:index.
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1537
        b := c asciiValue.
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1538
2735
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1539
        (b > 32 and:[b <= 255]) ifTrue:[
2734
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1540
            (b-32+1) <= romans size ifTrue:[
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1541
                "/
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1542
                "/ a roman character
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1543
                "/ map to corresponding row
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1544
                "/
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1545
                newString at:index put:(Character value:(romans at:b - 32 + 1))
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1546
            ] ifFalse:[
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1547
                newString at:index put:(Character value:self defaultRomanCharacterValue)
ab167dc4c5db checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2733
diff changeset
  1548
            ]
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1549
        ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1550
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1551
    ^ newString
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1552
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1553
    "Created: 30.6.1997 / 15:43:56 / cg"
2735
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1554
    "Modified: 30.6.1997 / 21:16:20 / cg"
2731
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1555
!
0ba9228a6677 added encoding into SJIS
Claus Gittinger <cg@exept.de>
parents: 2730
diff changeset
  1556
2721
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1557
guessEncodingFrom:aString
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1558
    "try to guess some 8-bit strings encoding by
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1559
     searching for certain escape sequences.
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1560
     Returns a string or nil.
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1561
     This may not find the strings real encoding."
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1562
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1563
    |n "{Class: SmallInteger }"
2735
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1564
     binary idx s w ascii|
2721
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1565
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1566
    "/ look for JIS7 / EUC encoding ...
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1567
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1568
    (aString findString:(JISEncodedString jisISO2022EscapeSequence)) ~~ 0 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1569
        ^ #'jis_iso2020'
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  1570
    ].
2721
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1571
    (aString findString:(JISEncodedString jis7KanjiEscapeSequence)) ~~ 0 ifTrue:[
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1572
        ^ #jis7
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1573
    ].
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1574
    (aString findString:(JISEncodedString oldJis7KanjiEscapeSequence)) ~~ 0 ifTrue:[
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1575
        ^ #jis7
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1576
    ].
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1577
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1578
    "/ TODO:
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1579
2735
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1580
"/    "/ look for EUC
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1581
"/    idx := aString findFirst:[:char | |ascii|
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1582
"/                                        ((ascii := char asciiValue) >= 16rA1)     
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1583
"/                                        and:[ascii <= 16rFE]].
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1584
"/    idx ~~ 0 ifTrue:[
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1585
"/        ascii := (aString at:(idx + 1)) asciiValue.
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1586
"/        (ascii >= 16rA1 and:[ascii <= 16rFE]) ifTrue:[
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1587
"/            ^ #euc
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1588
"/        ]
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1589
"/    ].
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1590
2721
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1591
    "/ look for SJIS ...
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1592
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1593
    "/ look for UTS ...
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1594
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1595
    ^ nil
77fedec2357e checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2720
diff changeset
  1596
2735
72469efcec1b checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2734
diff changeset
  1597
    "Modified: 30.6.1997 / 21:17:22 / cg"
443
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
  1598
! !
Claus Gittinger <cg@exept.de>
parents: 428
diff changeset
  1599
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
  1600
!CharacterArray class methodsFor:'pattern matching'!
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  1601
327
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1602
matchScan:matchScanArray from:matchStart to:matchStop with:aString from:start to:stop ignoreCase:ignoreCase
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1603
    "helper for match; return true if the characters from start to stop in
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1604
     aString are matching the scan in matchScan from matchStart to matchStop.
328
claus
parents: 327
diff changeset
  1605
     The matchScan is as created by asMatchScanArray.
327
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1606
328
claus
parents: 327
diff changeset
  1607
     This algorithm is not at all the most efficient; 
claus
parents: 327
diff changeset
  1608
     for heavy duty pattern matching, an interface (primitive) to the regex 
claus
parents: 327
diff changeset
  1609
     pattern matching package should be added."
327
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1610
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1611
    |matchEntry 
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1612
     mStart "{ Class: SmallInteger }"
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1613
     mStop  "{ Class: SmallInteger }"
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1614
     sStart "{ Class: SmallInteger }"
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1615
     sStop  "{ Class: SmallInteger }"
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1616
     mSize  "{ Class: SmallInteger }"
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1617
     sSize  "{ Class: SmallInteger }"
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1618
     index  "{ Class: SmallInteger }"
2561
205ee33decf9 removed unused var
Claus Gittinger <cg@exept.de>
parents: 2554
diff changeset
  1619
     quickCheck matchLast nextMatchEntry
986
c3a9f590146d removed unused locals
Claus Gittinger <cg@exept.de>
parents: 915
diff changeset
  1620
     checkChar included|
327
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1621
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1622
    mStart := matchStart.
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1623
    mStop := matchStop.
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1624
    sStart := start.
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1625
    sStop := stop.
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1626
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  1627
    [true] whileTrue:[
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1628
        mSize := mStop - mStart + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1629
        sSize := sStop - sStart + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1630
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1631
        "empty strings match"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1632
        (mSize == 0) ifTrue:[^ (sSize == 0)].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1633
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1634
        matchEntry := matchScanArray at:mStart.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1635
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1636
        "/ the most common case first:
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1637
        (sSize ~~ 0 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1638
        and:[(checkChar := (aString at:sStart)) == matchEntry]) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1639
            "advance by one and continue"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1640
            mStart := mStart + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1641
            sStart := sStart + 1
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1642
        ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1643
            (matchEntry == #any) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1644
                "restString empty -> no match"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1645
                (sSize == 0) ifTrue:[^ false].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1646
                "# matches single character"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1647
                ((sSize == 1) and:[mSize == 1]) ifTrue:[^ true].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1648
                "advance by one and continue"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1649
                mStart := mStart + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1650
                sStart := sStart + 1
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1651
            ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1652
                (matchEntry == #anyString) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1653
                    "* alone matches anything"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1654
                    (mSize == 1) ifTrue:[^ true].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1655
                    "restString empty & matchString not empty -> no match"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1656
                    (sSize == 0) ifTrue:[^ false].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1657
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1658
                    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1659
                     try to avoid some of the recursion by checking last
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1660
                     character and continue with shortened strings if possible
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1661
                    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1662
                    quickCheck := false.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1663
                    (mStop >= mStart) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1664
                        matchLast := matchScanArray at:mStop.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1665
                        (matchLast ~~ #anyString) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1666
                            (matchLast == #any) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1667
                                quickCheck := true
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1668
                            ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1669
                                matchLast == (aString at:sStop) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1670
                                    quickCheck := true
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1671
                                ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1672
                                    matchLast isString ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1673
                                        quickCheck := matchLast includes:(aString at:sStop)
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1674
                                    ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1675
                                ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1676
                            ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1677
                        ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1678
                    ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1679
                    quickCheck ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1680
                        "/ no quick check; 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1681
                        "/ look for the next character(s)
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1682
                        "/ and trye matching there
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1683
                        "/ (to avoid recursion)
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1684
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1685
                        mStart < mStop ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1686
                            nextMatchEntry := matchScanArray at:mStart+1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1687
                            nextMatchEntry isCharacter ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1688
                                sStart <= sStop ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1689
                                    [true] whileTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1690
                                        ignoreCase ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1691
                                            index := aString indexOf:nextMatchEntry startingAt:sStart
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1692
                                        ] ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1693
                                            index := aString findFirst:[:c | c asLowercase == nextMatchEntry asLowercase]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1694
                                                           startingAt:sStart.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1695
                                        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1696
                                        index == 0 ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1697
                                            ^ false
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1698
                                        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1699
                                        (self matchScan:matchScanArray 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1700
                                              from:(mStart + 1) 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1701
                                              to:mStop 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1702
                                              with:aString 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1703
                                              from:index 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1704
                                              to:stop 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1705
                                              ignoreCase:ignoreCase 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1706
                                        ) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1707
                                            ^ true
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1708
                                        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1709
                                        sStart := index + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1710
                                    ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1711
                                ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1712
                            ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1713
                        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1714
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1715
                        "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1716
                         no quick check possible;
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1717
                         loop over all possible substrings
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1718
                        "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1719
                        index := sStart.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1720
                        [index <= sStop] whileTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1721
                            (self matchScan:matchScanArray 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1722
                                  from:(mStart + 1) 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1723
                                  to:mStop 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1724
                                  with:aString 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1725
                                  from:index 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1726
                                  to:stop 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1727
                                  ignoreCase:ignoreCase 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1728
                            ) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1729
                                ^ true
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1730
                            ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1731
                            index := index + 1
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1732
                        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1733
                        ^ false
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1734
                    ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1735
                    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1736
                     quickCheck ok, advance from the right
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1737
                    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1738
                    mStop := mStop - 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1739
                    sStop := sStop - 1
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1740
                ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1741
                    (matchEntry isString) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1742
                        "testString empty -> no match"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1743
                        (sSize == 0) ifTrue:[^ false].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1744
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1745
                        included := false.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1746
                        "/ checkChar := aString at:sStart.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1747
                        included := matchEntry includes:checkChar.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1748
                        included ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1749
                            ignoreCase ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1750
                                checkChar isUppercase ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1751
                                    included := matchEntry includes:checkChar asLowercase.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1752
                                ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1753
                                    included := matchEntry includes:checkChar asUppercase.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1754
                                ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1755
                            ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1756
                        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1757
                        mStart := mStart + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1758
                        mSize := mSize - 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1759
                        included ifFalse:[^ false].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1760
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1761
                        ((sSize == 1) and:[mSize == 0]) ifTrue:[^ true].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1762
                        "cut off 1st char and continue"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1763
                        sStart := sStart + 1
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1764
                    ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1765
                        "/ must be single character
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1766
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1767
                        "testString empty ?"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1768
                        (sSize == 0) ifTrue:[^ false].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1769
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1770
                        "first characters equal ?"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1771
                        "/ checkChar := aString at:sStart.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1772
                        ignoreCase ifFalse:[^ false].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1773
                        (checkChar asUppercase ~~ matchEntry asUppercase) ifTrue:[^ false].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1774
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1775
                        "advance and continue"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1776
                        mStart := mStart + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1777
                        sStart := sStart + 1
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1778
                    ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1779
                ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1780
            ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1781
        ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1782
    ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1783
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1784
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1785
     |scanArray s|
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1786
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1787
     scanArray := self matchScanArrayFrom:'*hello'.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1788
     s := 'foo bar hello world'.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1789
     CharacterArray
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1790
         matchScan:scanArray
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1791
         from:1 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1792
         to:scanArray size
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1793
         with:s
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1794
         from:1
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1795
         to:s size
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1796
         ignoreCase:false
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1797
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1798
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1799
     |scanArray s|
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1800
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1801
     scanArray := self matchScanArrayFrom:'*hello*'.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1802
     s := 'foo bar hello world'.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1803
     CharacterArray
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1804
         matchScan:scanArray
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1805
         from:1 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1806
         to:scanArray size
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1807
         with:s
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1808
         from:1
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1809
         to:s size
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1810
         ignoreCase:false
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1811
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1812
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1813
    "Modified: 2.4.1997 / 16:35:48 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1814
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1815
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1816
matchScanArrayFrom:aString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1817
    "scan a pattern string and decompose it into a scanArray.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1818
     This is processed faster (especially with character ranges), and
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1819
     can also be reused later. (if the same pattern is to be searched again)"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1820
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1821
    |coll 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1822
     idx "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1823
     end c1 c2 matchSet previous|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1824
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1825
    coll := OrderedCollection new.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1826
    idx := 1. end := aString size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1827
    [idx <= end] whileTrue:[
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1828
        |char this|
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1829
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1830
        char := aString at:idx.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1831
        char == $* ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1832
            previous ~~ #anyString ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1833
                this := #anyString
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1834
            ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1835
        ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1836
            char == $# ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1837
                previous ~~ #anyString ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1838
                    this := #any
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1839
                ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1840
            ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1841
                char == $[ ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1842
                    matchSet := IdentitySet new.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1843
                    idx := idx + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1844
                    idx > end ifTrue:[^ nil].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1845
                    char := aString at:idx.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1846
                    c1 := nil.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1847
                    [char ~~ $]] whileTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1848
                        ((char == $-) and:[c1 notNil]) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1849
                            idx := idx + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1850
                            idx > end ifTrue:[^ nil].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1851
                            c2 := aString at:idx.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1852
                            c1 to:c2 do:[:c | matchSet add:c].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1853
                            c1 := nil.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1854
                            idx := idx + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1855
                        ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1856
                            (char ~~ $]) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1857
                                matchSet add:char.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1858
                                c1 := char.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1859
                                idx := idx + 1
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1860
                            ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1861
                        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1862
                        idx > end ifTrue:[^ nil].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1863
                        char := aString at:idx
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1864
                    ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1865
                    this := matchSet asString
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1866
                ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1867
                    char == $\ ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1868
                        idx := idx + 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1869
                        idx > end ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1870
                            "/ mhmh - what should we do here ?
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1871
                            this := char
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1872
                        ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1873
                            this := aString at:idx.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1874
                        ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1875
                    ] ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1876
                        this := char
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1877
                    ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1878
                ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1879
            ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1880
        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1881
        this notNil ifTrue:[coll add:this. previous := this].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1882
        idx := idx + 1
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1883
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1884
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1885
    ^ coll asArray
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1886
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1887
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1888
     String matchScanArrayFrom:'*ute*'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1889
     String matchScanArrayFrom:'**ute**'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1890
     String matchScanArrayFrom:'*uter'   
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1891
     String matchScanArrayFrom:'\*uter'   
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1892
     String matchScanArrayFrom:'[cC]#mpute[rR]'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1893
     String matchScanArrayFrom:'[abcd]*'      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1894
     String matchScanArrayFrom:'[a-k]*'      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1895
     String matchScanArrayFrom:'*some*compl*ern*' 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1896
     String matchScanArrayFrom:'[a-'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1897
     String matchScanArrayFrom:'[a-zA-Z]'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1898
     String matchScanArrayFrom:'[a-z01234A-Z]'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1899
    "
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1900
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  1901
    "Modified: 2.4.1997 / 16:20:29 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1902
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1903
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1904
!CharacterArray methodsFor:'Compatibility - ST/V'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1905
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1906
byteAt:index put:aByte
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1907
    "store a byte at given index.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1908
     This is an ST/V compatibility method."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1909
1333
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1910
"/    (aByte == 0) ifTrue:[
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1911
"/        "store a space instead"
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1912
"/        ^ super basicAt:index put:(Character space)
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1913
"/    ].
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1914
    ^ super at:index put:(Character value:aByte)
1333
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1915
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1916
    "
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1917
     'hello' copy at:1 put:$H asciiValue; yourself
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1918
     'hello' copy byteAt:1 put:72; yourself 
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1919
     'hello' copy byteAt:1 put:0; yourself 
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1920
    "
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1921
d3fdca956f0d allow storing of 0-bytes (be careful with printing then ...)
Claus Gittinger <cg@exept.de>
parents: 1312
diff changeset
  1922
    "Modified: 6.5.1996 / 10:35:26 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1923
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1924
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1925
replChar:oldChar with:newChar
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1926
    "return a copy of the receiver, with all oldChars replaced
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1927
     by newChar.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1928
     This is an ST/V compatibility method."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1929
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1930
    ^ self copy replaceAll:oldChar by:newChar
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1931
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1932
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1933
     '12345678901234567890' replChar:$0 with:$* 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1934
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1935
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1936
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1937
replChar:oldChar withString:newString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1938
    "return a copy of the receiver, with all oldChars replaced
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1939
     by newString (i.e. slice in the newString in place of the oldChar).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1940
     This is an ST/V compatibility method."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1941
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1942
    |tmpStream|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1943
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1944
    tmpStream := WriteStream on:(self class new).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1945
    self do:[:element |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1946
	element = oldChar ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1947
	    tmpStream nextPutAll:newString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1948
	] ifFalse:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1949
	    tmpStream nextPut:element 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1950
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1951
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1952
    ^ tmpStream contents
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1953
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1954
   "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1955
     '12345678901234567890' replChar:$0 withString:'foo' 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1956
     'a string with spaces' replChar:$  withString:' foo '  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1957
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1958
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1959
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1960
trimBlanks
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1961
    "return a copy of the receiver without leading
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1962
     and trailing spaces.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1963
     This is an ST/V compatibility method."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1964
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1965
    ^ self withoutSpaces
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1966
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1967
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1968
     '    spaces at beginning' trimBlanks     
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1969
     'spaces at end    ' trimBlanks           
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1970
     '    spaces at beginning and end     ' trimBlanks    
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1971
     'no spaces' trimBlanks              
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1972
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1973
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1974
1978
0d49a096cf06 category rename
Claus Gittinger <cg@exept.de>
parents: 1849
diff changeset
  1975
!CharacterArray methodsFor:'Compatibility - V''Age'!
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1976
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1977
addLineDelimiter
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1978
    "replace all '\'-characters by line delimiter (cr) - characters.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1979
     This has been added for VisualAge compatibility."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1980
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1981
    ^ self withCRs
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1982
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1983
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1984
bindWith:aString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1985
    "return a copy of the receiver, where a '%1' escape is
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1986
     replaced by aString.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1987
     This has been added for VisualAge compatibility."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1988
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1989
    ^ self expandPlaceholdersWith:(Array with:aString)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1990
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1991
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1992
     'do you like %1 ?' bindWith:'smalltalk'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1993
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1994
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1995
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1996
bindWith:string1 with:string2
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1997
    "return a copy of the receiver, where a '%1' escape is
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1998
     replaced by string1 and '%2' is replaced by string2.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  1999
     This has been added for VisualAge compatibility."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2000
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2001
    ^ self expandPlaceholdersWith:(Array with:string1 with:string2)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2002
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2003
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2004
     'do you prefer %1 or rather %2 ?'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2005
	bindWith:'smalltalk' with:'c++'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2006
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2007
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2008
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2009
bindWith:str1 with:str2 with:str3
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2010
    "return a copy of the receiver, where a '%1', '%2' and '%3' escapes
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2011
     are replaced by str1, str2 and str3 respectively.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2012
     This has been added for VisualAge compatibility."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2013
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2014
    ^ self expandPlaceholdersWith:(Array with:str1 with:str2 with:str3)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2015
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2016
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2017
     'do you prefer %1 or rather %2 (not talking about %3) ?'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2018
	bindWith:'smalltalk' with:'c++' with:'c'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2019
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2020
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2021
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2022
bindWith:str1 with:str2 with:str3 with:str4
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2023
    "return a copy of the receiver, where a '%1', '%2', '%3' and '%4' escapes
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2024
     are replaced by str1, str2, str3 and str4 respectively.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2025
     This has been added for VisualAge compatibility."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2026
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2027
    ^ self expandPlaceholdersWith:(Array with:str1 with:str2 with:str3 with:str4)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2028
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2029
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2030
     'do you prefer %1 or rather %2 (not talking about %3 or even %4) ?'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2031
	bindWith:'smalltalk' with:'c++' with:'c' with:'assembler'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2032
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2033
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2034
2344
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2035
bindWith:str1 with:str2 with:str3 with:str4 with:str5
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2036
    "return a copy of the receiver, where a '%1' .. '%5' escapes
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2037
     are replaced by str1 .. str5 respectively.
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2038
     This has been added for VisualAge compatibility."
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2039
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2040
    ^ self expandPlaceholdersWith:(Array with:str1 with:str2 with:str3 with:str4 with:str5)
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2041
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2042
    "Created: 31.1.1997 / 16:25:42 / cg"
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2043
!
b3788119533f checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2250
diff changeset
  2044
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2045
bindWithArguments:anArrayOfStrings
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2046
    "return a copy of the receiver, where a '%i' escape
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2047
     is replaced by the coresponding string from the argument array.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2048
     'i' may be between 1 and 9 (i.e. a maximum of 9 placeholders is allowed).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2049
     This has been added for VisualAge compatibility."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2050
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2051
    ^ self expandPlaceholdersWith:anArrayOfStrings
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2052
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2053
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2054
     'do you prefer %1 or rather %2 (not talking about %3) ?'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2055
	bindWithArguments:#('smalltalk' 'c++' 'c')
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2056
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2057
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2058
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2059
subStrings
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2060
    "return an array consisting of all words contained in the receiver.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2061
     Words are separated by whitespace.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2062
     This has been added for VisualAge compatibility."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2063
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2064
    ^ self asCollectionOfWords
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2065
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2066
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2067
     'hello world, this is smalltalk' subStrings
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2068
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2069
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2070
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2071
subStrings:separatorCharacter
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2072
    "return an array consisting of all words contained in the receiver.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2073
     Words are separated by separatorCharacter.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2074
     This has been added for VisualAge compatibility."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2075
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2076
    ^ self asCollectionOfSubstringsSeparatedBy:separatorCharacter
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2077
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2078
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2079
     'foo:bar:baz:smalltalk' subStrings:$:
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2080
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2081
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2082
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2083
trimSeparators
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2084
    "return a copy of the receiver without leading and trailing whiteSpace"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2085
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2086
    ^ self withoutSeparators
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2087
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2088
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2089
!CharacterArray methodsFor:'character searching'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2090
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2091
includesMatchCharacters
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2092
    "return true if the receiver includes any meta characters (i.e. $* or $#) 
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2093
     for match operations; false if not.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2094
     Here, do not care for $\ escapes"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2095
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2096
    ^ self includesAny:'*#['
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2097
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2098
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2099
     '*foo' includesMatchCharacters   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2100
     '\*foo' includesMatchCharacters  
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2101
     '\*foo' includesUnescapedMatchCharacters  
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2102
     '*foo' includesMatchCharacters   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2103
     '\\*foo' includesMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2104
     'foo*' includesMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2105
     'foo\*' includesMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2106
     'foo\' includesMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2107
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2108
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2109
    "Modified: 2.4.1997 / 18:12:34 / cg"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2110
!
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2111
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2112
includesUnescapedMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2113
    "return true if the receiver really includes any meta characters (i.e. $* or $#) 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2114
     for match operations; false if not.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2115
     Here, care for $\ escapes"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2116
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2117
    |idx sz|
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2118
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2119
    idx := 1.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2120
    sz := self size.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2121
    [true] whileTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2122
        idx := self indexOfAny:'*#[\' startingAt:idx.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2123
        idx == 0 ifTrue:[^ false].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2124
        (self at:idx) == $\ ifFalse:[^ true].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2125
        idx := idx + 2.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2126
        idx > sz ifTrue:[^ false].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2127
    ].    
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2128
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2129
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2130
     '*foo' includesUnescapedMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2131
     '\*foo' includesUnescapedMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2132
     '*foo' includesUnescapedMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2133
     '\\*foo' includesUnescapedMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2134
     'foo*' includesUnescapedMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2135
     'foo\*' includesUnescapedMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2136
     'foo\' includesUnescapedMatchCharacters
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2137
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2138
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2139
    "Modified: 2.4.1997 / 17:08:52 / cg"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2140
    "Created: 2.4.1997 / 17:23:26 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2141
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2142
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2143
indexOfNonSeparatorStartingAt:startIndex
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2144
    "return the index of the next non-whitespace character"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2145
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2146
    |start  "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2147
     mySize "{ Class: SmallInteger }"|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2148
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2149
    start := startIndex.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2150
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2151
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2152
    start to:mySize do:[:index |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2153
	(self at:index) isSeparator ifFalse:[^ index]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2154
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2155
    ^ 0
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2156
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2157
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2158
     '    hello world' indexOfNonSeparatorStartingAt:1 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2159
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2160
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2161
     |s index1 index2|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2162
     s := '   foo    bar      baz'.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2163
     index1 := s indexOfNonSeparatorStartingAt:1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2164
     index2 := s indexOfSeparatorStartingAt:index1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2165
     s copyFrom:index1 to:index2 - 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2166
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2167
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2168
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2169
indexOfSeparator
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2170
    "return the index of the first whitespace character"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2171
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2172
    ^ self indexOfSeparatorStartingAt:1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2173
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2174
    "'hello world' indexOfSeparator"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2175
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2176
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2177
indexOfSeparatorStartingAt:startIndex
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2178
    "return the index of the next whitespace character"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2179
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2180
    |start  "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2181
     mySize "{ Class: SmallInteger }"|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2182
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2183
    start := startIndex.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2184
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2185
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2186
    start to:mySize do:[:index |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2187
	(self at:index) isSeparator ifTrue:[^ index]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2188
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2189
    ^ 0
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2190
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2191
    "'hello world' indexOfSeparatorStartingAt:3"
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2192
!
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2193
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2194
withMatchEscapes
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2195
    "return a copy of the receiver with all match characters escaped
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2196
     by $\ characters (to be usable as a match string).
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2197
     Return the receiver, if there are none."
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2198
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2199
    |in out c|
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2200
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2201
    in := self readStream.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2202
    out := '' writeStream.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2203
    [in atEnd] whileFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2204
        c := in next.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2205
        ('*[#\' includes:c) ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2206
            out nextPut:$\.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2207
        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2208
        out nextPut:c.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2209
    ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2210
    ^ out contents.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2211
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2212
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2213
     '*foo' withMatchEscapes   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2214
     '\*foo' withMatchEscapes  
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2215
     '*foo' withMatchEscapes   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2216
     '\\*foo' withMatchEscapes 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2217
     'foo*' withMatchEscapes   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2218
     'foo\*' withMatchEscapes  
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2219
     'foo\' withMatchEscapes  
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2220
     'f*o*o' withMatchEscapes   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2221
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2222
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2223
    "Modified: 2.4.1997 / 18:13:04 / cg"
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2224
!
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2225
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2226
withoutMatchEscapes
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2227
    "return a copy of the receiver with all $\ removed or
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2228
     the receiver, if there are none."
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2229
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2230
    |in out c|
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2231
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2232
    in := self readStream.
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  2233
    out := self species new writeStream.
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2234
    [in atEnd] whileFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2235
        c := in next.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2236
        c == $\ ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2237
            in atEnd ifFalse:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2238
                c := in next.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2239
            ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2240
        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2241
        out nextPut:c.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2242
    ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2243
    ^ out contents.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2244
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2245
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2246
     '*foo' withoutMatchEscapes   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2247
     '\*foo' withoutMatchEscapes  
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2248
     '*foo' withoutMatchEscapes   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2249
     '\\*foo' withoutMatchEscapes 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2250
     'foo*' withoutMatchEscapes   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2251
     'foo\*' withoutMatchEscapes  
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2252
     'foo\' withoutMatchEscapes  
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2253
     'f\*o\*o' withoutMatchEscapes   
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2254
    "
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  2255
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  2256
    "Modified: 30.6.1997 / 13:40:23 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2257
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2258
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2259
!CharacterArray methodsFor:'comparing'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2260
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2261
< something
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2262
    "Compare the receiver with the argument and return true if the
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2263
     receiver is less than the argument. Otherwise return false.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2264
     This comparison is based on the elements ascii code - 
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2265
     i.e. upper/lowercase & national characters are NOT treated specially."
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2266
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2267
    ^ (something > self)
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2268
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2269
    "Modified: 22.4.1996 / 15:54:54 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2270
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2271
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2272
<= something
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2273
    "Compare the receiver with the argument and return true if the
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2274
     receiver is less than or equal to the argument. Otherwise return false.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2275
     This comparison is based on the elements ascii code - 
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2276
     i.e. upper/lowercase & national characters are NOT treated specially."
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2277
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2278
    ^ (self > something) not
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2279
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2280
    "Modified: 22.4.1996 / 15:54:56 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2281
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2282
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2283
= aString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2284
    "Compare the receiver with the argument and return true if the
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2285
     receiver is equal to the argument. Otherwise return false.
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2286
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2287
     This compare does NOT ignore case differences, 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2288
     therefore 'foo' = 'Foo' will return false.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2289
     Since this is incompatible to ST-80 (at least, V2.x) , this may change."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2290
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2291
    |mySize    "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2292
     otherSize |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2293
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2294
    aString species == self species ifFalse:[^ false].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2295
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2296
    otherSize := aString size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2297
    mySize == otherSize ifFalse:[^ false].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2298
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2299
    1 to:mySize do:[:index |
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2300
        (self at:index) = (aString at:index) ifFalse:[^ false].
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2301
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2302
    ^ true
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2303
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2304
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2305
     'foo' = 'Foo'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2306
     'foo' = 'bar'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2307
     'foo' = 'foo'   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2308
    "
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2309
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2310
    "Modified: 22.4.1996 / 15:53:58 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2311
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2312
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2313
> aString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2314
    "Compare the receiver with the argument and return true if the
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2315
     receiver is greater than the argument. Otherwise return false.
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2316
     This comparison is based on the elements ascii code - 
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2317
     i.e. upper/lowercase & upper/lowercase & national characters are NOT treated specially."
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2318
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2319
    |mySize    "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2320
     otherSize "{ Class: SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2321
     n         "{ Class: SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2322
     c1 c2|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2323
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2324
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2325
    otherSize := aString size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2326
    n := mySize min:otherSize.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2327
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2328
    1 to:n do:[:index |
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2329
        c1 := self at:index.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2330
        c2 := aString at:index.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2331
        c1 > c2 ifTrue:[^ true].
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2332
        c1 < c2 ifTrue:[^ false].
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2333
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2334
    ^ mySize > otherSize
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2335
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2336
    "Modified: 22.4.1996 / 15:55:00 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2337
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2338
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2339
>= something
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2340
    "Compare the receiver with the argument and return true if the
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2341
     receiver is greater than or equal to the argument.
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2342
     Otherwise return false.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2343
     This comparison is based on the elements ascii code - 
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2344
     i.e. upper/lowercase & national characters are NOT treated specially."
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2345
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2346
    ^ (something > self) not
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2347
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2348
    "Modified: 22.4.1996 / 15:55:03 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2349
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2350
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2351
compareWith:aString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2352
    "Compare the receiver with the argument and return 1 if the receiver is
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2353
     greater, 0 if equal and -1 if less than the argument. 
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2354
     This comparison is based on the elements ascii code - 
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2355
     i.e. upper/lowercase & national characters are NOT treated specially.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2356
     'foo' compareWith: 'Foo' will return 1.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2357
     while 'foo' sameAs:'Foo' will return true"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2358
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2359
    |mySize    "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2360
     otherSize "{ Class: SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2361
     n         "{ Class: SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2362
     c1 c2|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2363
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2364
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2365
    otherSize := aString size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2366
    n := mySize min:otherSize.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2367
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2368
    1 to:n do:[:index |
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2369
        c1 := self at:index.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2370
        c2 := aString at:index.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2371
        c1 > c2 ifTrue:[^ 1].
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2372
        c1 < c2 ifTrue:[^ -1].
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2373
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2374
    mySize > otherSize ifTrue:[^ 1].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2375
    mySize < otherSize ifTrue:[^ -1].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2376
    ^ 0
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2377
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2378
    "Modified: 22.4.1996 / 15:56:07 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2379
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2380
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2381
hash
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2382
    "return an integer useful as a hash-key"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2383
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2384
%{  /* NOCONTEXT */
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2385
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2386
    REGISTER int g, val;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2387
    REGISTER unsigned char *cp, *cp0;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2388
    int l;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2389
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1124
diff changeset
  2390
    cp = __stringVal(self);
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1124
diff changeset
  2391
    l = __stringSize(self);
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1124
diff changeset
  2392
    if (__qClass(self) != @global(String)) {
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1124
diff changeset
  2393
	int n = __OBJS2BYTES__(__intVal(__ClassInstPtr(__qClass(self))->c_ninstvars));
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2394
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2395
	cp += n;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2396
	l -= n;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2397
    }
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2398
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2399
    /*
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2400
     * this is the dragon-book algorithm with a funny start
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2401
     * value (to give short strings a number above 8192)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2402
     */
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2403
    val = 12345;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2404
    for (cp0 = cp, cp += l - 1; cp >= cp0; cp--) {
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2405
	val = (val << 5) + (*cp & 0x1F);
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2406
	if (g = (val & 0x3E000000))
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2407
	    val ^= g >> 25 /* 23 */ /* 25 */;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2408
	val &= 0x3FFFFFFF;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2409
    }
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2410
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2411
    if (l) {
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2412
	l |= 1; 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2413
	val = (val * l) & 0x3FFFFFFF;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2414
    }
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2415
1133
961f2b095c22 underline cleanup
Claus Gittinger <cg@exept.de>
parents: 1124
diff changeset
  2416
    RETURN ( __MKSMALLINT(val) );
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2417
%}
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2418
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2419
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2420
sameAs:aString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2421
    "Compare the receiver with the argument like =, but ignore
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2422
     case differences. 
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2423
     Return true or false."
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2424
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2425
    |mySize "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2426
     otherSize c1 c2|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2427
2581
d06e3bbfc294 faster #sameAs: if string is identical
Claus Gittinger <cg@exept.de>
parents: 2561
diff changeset
  2428
    self == aString ifTrue:[^ self].
d06e3bbfc294 faster #sameAs: if string is identical
Claus Gittinger <cg@exept.de>
parents: 2561
diff changeset
  2429
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2430
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2431
    otherSize := aString size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2432
    mySize == otherSize ifFalse:[^ false].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2433
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2434
    1 to:mySize do:[:index |
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2435
        c1 := self at:index.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2436
        c2 := aString at:index.
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2437
        c1 == c2 ifFalse:[
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2438
            c1 asLowercase = c2 asLowercase ifFalse:[^ false].
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2439
        ]
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2440
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2441
    ^ true
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2442
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2443
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2444
     'foo' sameAs: 'Foo'   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2445
     'foo' sameAs: 'bar' 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2446
     'foo' sameAs: 'foo'   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2447
    "
1252
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2448
105280fd8d72 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1250
diff changeset
  2449
    "Modified: 22.4.1996 / 15:56:17 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2450
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2451
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2452
sameCharacters:aString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2453
    "count & return the number of characters which are the same
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2454
     (ignoring case) in the receiver and the argument, aString."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2455
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2456
    |n "{ Class: SmallInteger }"
986
c3a9f590146d removed unused locals
Claus Gittinger <cg@exept.de>
parents: 915
diff changeset
  2457
     c1 c2 cnt|
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2458
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2459
    n := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2460
    n := n min:(aString size).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2461
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2462
    cnt := 0.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2463
    1 to:n do:[:index |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2464
	c1 := self at:index.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2465
	c2 := aString at:index.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2466
	((c1 == c2)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2467
	or:[c1 asLowercase = c2 asLowercase]) ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2468
	    cnt := cnt + 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2469
	]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2470
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2471
    ^ cnt
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2472
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2473
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2474
     'foobarbaz' sameCharacters: 'foo'   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2475
     'foobarbaz' sameCharacters: 'Foo'   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2476
     'foobarbaz' sameCharacters: 'baz'   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2477
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2478
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2479
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2480
!CharacterArray methodsFor:'converting'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2481
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2482
asArrayOfSubstrings
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2483
    "return an array of substrings from the receiver, interpreting
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2484
     separators (i.e. spaces & newlines) as word-delimiters.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2485
     This is a compatibility method - the actual work is done in
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2486
     asCollectionOfWords."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2487
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2488
    ^ self asCollectionOfWords asArray
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2489
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2490
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2491
     '1 one two three four 5 five' asArrayOfSubstrings  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2492
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2493
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2494
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2495
asCollectionOfLines
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2496
    "return a collection containing the lines (separated by cr) 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2497
     of the receiver. If multiple cr's occur in a row, the result will
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2498
     contain empty strings."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2499
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2500
    ^ self asCollectionOfSubstringsSeparatedBy:Character cr
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2501
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2502
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2503
     '1 one\2 two\3 three\4 four\5 five' withCRs asCollectionOfLines
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2504
     '1 one\\\\2 two\3 three' withCRs asCollectionOfLines  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2505
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2506
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2507
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2508
asCollectionOfSubstringsSeparatedBy:aCharacter
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2509
    "return a collection containing the lines (separated by aCharacter) 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2510
     of the receiver. If aCharacter occurs multiple times in a row, 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2511
     the result will contain empty strings."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2512
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2513
    |lines myClass
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2514
     numberOfLines "{ Class:SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2515
     startIndex    "{ Class:SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2516
     stopIndex     "{ Class:SmallInteger }" |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2517
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2518
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2519
     count first, to avoid regrowing of the OC
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2520
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2521
    numberOfLines := (self occurrencesOf:aCharacter) + 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2522
    lines := OrderedCollection new:numberOfLines.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2523
    myClass := self species.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2524
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2525
    startIndex := 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2526
    1 to:numberOfLines do:[:lineNr |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2527
	stopIndex := self indexOf:aCharacter startingAt:startIndex.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2528
	stopIndex == 0 ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2529
	    stopIndex := self size
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2530
	] ifFalse: [
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2531
	    stopIndex := stopIndex - 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2532
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2533
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2534
	(stopIndex < startIndex) ifTrue: [
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2535
	    lines add:(myClass new:0)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2536
	] ifFalse: [
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2537
	    lines add:(self copyFrom:startIndex to:stopIndex)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2538
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2539
	startIndex := stopIndex + 2
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2540
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2541
    ^ lines
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2542
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2543
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2544
     '1 one:2 two:3 three:4 four:5 five' withCRs asCollectionOfSubstringsSeparatedBy:$: 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2545
     '1 one 2 two 3 three 4 four 5 five' withCRs asCollectionOfSubstringsSeparatedBy:Character space
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2546
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2547
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2548
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2549
asCollectionOfSubstringsSeparatedByAny:aCollectionOfSeparators
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2550
    "return a collection containing the words (separated by any character
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2551
     from aCollectionOfSeparators) of the receiver.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2552
     This allows breaking up strings using any character as separator."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2553
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2554
    |words
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2555
     start  "{ Class:SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2556
     stop   "{ Class:SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2557
     mySize "{ Class:SmallInteger }"|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2558
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2559
    words := OrderedCollection new.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2560
    start := 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2561
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2562
    [start <= mySize] whileTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2563
	"skip multiple separators"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2564
	[aCollectionOfSeparators includes:(self at:start)] whileTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2565
	    start := start + 1 .
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2566
	    start > mySize ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2567
		^ words
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2568
	    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2569
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2570
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2571
	stop := self indexOfAny:aCollectionOfSeparators startingAt:start.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2572
	stop == 0 ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2573
	    words add:(self copyFrom:start to:mySize).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2574
	    ^ words
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2575
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2576
	words add:(self copyFrom:start to:(stop - 1)).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2577
	start := stop
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2578
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2579
    ^ words
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2580
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2581
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2582
     'hello:world:isnt:this nice' asCollectionOfSubstringsSeparatedByAny:#($:)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2583
     'hello:world:isnt:this nice' asCollectionOfSubstringsSeparatedByAny:':' 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2584
     'hello:world:isnt:this nice' asCollectionOfSubstringsSeparatedByAny:(Array with:$: with:Character space) 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2585
     'hello:world:isnt:this nice' asCollectionOfSubstringsSeparatedByAny:': ' 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2586
     'h1e2l3l4o' asCollectionOfSubstringsSeparatedByAny:($1 to: $9) 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2587
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2588
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2589
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2590
asCollectionOfWords
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2591
    "return a collection containing the words (separated by whitespace) 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2592
     of the receiver. Multiple occurences of whitespace characters will
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2593
     be treated like one - i.e. whitespace is skipped."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2594
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2595
    |words
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2596
     start  "{ Class:SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2597
     stop   "{ Class:SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2598
     mySize "{ Class:SmallInteger }"|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2599
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2600
    words := OrderedCollection new.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2601
    start := 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2602
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2603
    [start <= mySize] whileTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2604
	start := self indexOfNonSeparatorStartingAt:start.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2605
	start == 0 ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2606
	    ^ words
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2607
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2608
	stop := self indexOfSeparatorStartingAt:start.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2609
	stop == 0 ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2610
	    words add:(self copyFrom:start to:mySize).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2611
	    ^ words
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2612
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2613
	words add:(self copyFrom:start to:(stop - 1)).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2614
	start := stop
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2615
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2616
    ^ words
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2617
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2618
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2619
     'hello world isnt this nice' asCollectionOfWords
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2620
     '    hello    world   isnt   this   nice  ' asCollectionOfWords
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2621
     'hello' asCollectionOfWords
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2622
     '' asCollectionOfWords
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2623
     '      ' asCollectionOfWords
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2624
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2625
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2626
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2627
asComposedText
1312
0003988bb066 commentary
Claus Gittinger <cg@exept.de>
parents: 1311
diff changeset
  2628
    "ST-80 compatibility 
0003988bb066 commentary
Claus Gittinger <cg@exept.de>
parents: 1311
diff changeset
  2629
     - ST/X does not (as today) support composedTexts."
0003988bb066 commentary
Claus Gittinger <cg@exept.de>
parents: 1311
diff changeset
  2630
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2631
    ^ ComposedText fromString:self
1312
0003988bb066 commentary
Claus Gittinger <cg@exept.de>
parents: 1311
diff changeset
  2632
0003988bb066 commentary
Claus Gittinger <cg@exept.de>
parents: 1311
diff changeset
  2633
    "Modified: 27.4.1996 / 13:30:30 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2634
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2635
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2636
asFilename
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2637
    "return a Filename with pathname taken from the receiver"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2638
1423
d9f177657164 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1422
diff changeset
  2639
    ^ Filename named:self string
d9f177657164 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1422
diff changeset
  2640
d9f177657164 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1422
diff changeset
  2641
    "Modified: 20.5.1996 / 09:38:15 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2642
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2643
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2644
asFloat
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2645
    "read a float number from the receiver.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2646
     Notice, that errors may occur during the read, so you better
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2647
     setup some signal handler when using this method."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2648
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2649
    ^ (Number readFromString:self) asFloat
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2650
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2651
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2652
     '0.123' asFloat 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2653
     '12345' asFloat
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2654
     '(1/5)' asFloat
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2655
     Object errorSignal handle:[:ex | ex returnWith:0] do:['foo' asFloat] 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2656
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2657
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2658
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2659
asInteger
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2660
    "read an integer from the receiver.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2661
     Notice, that errors may occur during the read, so you better
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2662
     setup some signal handler when using this method."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2663
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2664
    ^ Integer readFromString:self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2665
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2666
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2667
     '12345678901234567890' asInteger
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2668
     '-1234' asInteger
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2669
     '0.123' asInteger   <- reader stops at ., returning 0 here
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2670
     '0.123' asNumber    <- returns what you expect
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2671
     Object errorSignal handle:[:ex | ex returnWith:0] do:['foo' asInteger] 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2672
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2673
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2674
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2675
asLowercase
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2676
    "return a copy of myself in lowercase letters"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2677
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2678
    |newStr
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2679
     mySize "{ Class: SmallInteger }" |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2680
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2681
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2682
    newStr := self species new:mySize.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2683
    1 to:mySize do:[:i |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2684
	newStr at:i put:(self at:i) asLowercase
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2685
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2686
    ^ newStr
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2687
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2688
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2689
     'HelloWorld' asLowercase   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2690
     'HelloWorld' asLowercaseFirst   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2691
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2692
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2693
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2694
asLowercaseFirst
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2695
    "return a copy of myself where the first character is
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2696
     converted to lowercase."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2697
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2698
    |newString sz|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2699
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2700
    sz := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2701
    newString := self copyFrom:1 to:sz.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2702
    sz > 0 ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2703
	newString at:1 put:(newString at:1) asLowercase
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2704
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2705
    ^ newString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2706
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2707
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2708
     'HelloWorld' asLowercase   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2709
     'HelloWorld' asLowercaseFirst   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2710
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2711
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2712
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2713
asNumber
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2714
    "read a number from the receiver.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2715
     Notice, that (in contrast to ST-80) errors may occur during the read, 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2716
     so you better setup some signal handler when using this method.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2717
     This may change if ANSI specifies it."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2718
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2719
"/ ST-80 behavior:
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2720
"/  ^ Number readFromString:self onError:0
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2721
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2722
    ^ Number readFromString:self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2723
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2724
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2725
     '123'     asNumber
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2726
     '123.567' asNumber
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2727
     '(5/6)'   asNumber
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2728
     'foo'     asNumber
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2729
     Object errorSignal handle:[:ex | ex returnWith:0] do:['foo' asNumber] 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2730
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2731
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2732
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2733
asNumberFromFormatString:ignored
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2734
    "read a number from the receiver, ignoring any nonDigit characters.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2735
     This is typically used to convert from strings which include
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2736
     dollar-signs or millenium digits. However, this method also ignores
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2737
     the decimal point (if any) and therefore should be used with care."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2738
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2739
    |tempString|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2740
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2741
    tempString := self collect:[:char | char isDigit].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2742
    ^ Number readFromString:tempString onError:0
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2743
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2744
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2745
     'USD 123' asNumberFromFormatString:'foo'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2746
     'DM 123'  asNumberFromFormatString:'foo'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2747
     '123'     asNumberFromFormatString:'foo'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2748
     '123.567' asNumberFromFormatString:'foo'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2749
     '(5/6)'   asNumberFromFormatString:'foo'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2750
     'foo'     asNumberFromFormatString:'foo'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2751
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2752
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2753
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2754
asSingleByteString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2755
    "return the receiver converted to a 'normal' string"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2756
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2757
    ^ String fromString:self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2758
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2759
2732
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2760
asSingleByteStringReplaceInvalidWith:replacementCharacter
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2761
    "return the receiver converted to a 'normal' string,
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2762
     with invalid characters replaced by replacementCharacter.
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2763
     Can be used to convert from 16-bit strings to 8-bit strings
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2764
     and replace characters above code-255 with some replacement."
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2765
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2766
    |newString|
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2767
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2768
    newString := String new:(self size).
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2769
    1 to:self size do:[:idx |
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2770
        |char|
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2771
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2772
        char := self at:idx.
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2773
        char asciiValue <= 16rFF ifTrue:[
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2774
            newString at:idx put:char
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2775
        ] ifFalse:[
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2776
            newString at:idx put:replacementCharacter
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2777
        ].
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2778
    ].
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2779
    ^ newString
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2780
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2781
    "Created: 30.6.1997 / 13:02:14 / cg"
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2782
!
79d06cdc7428 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2731
diff changeset
  2783
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2784
asString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2785
    "return myself - I am a string"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2786
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2787
    ^ self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2788
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2789
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2790
asStringCollection
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2791
    "return a collection of lines from myself."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2792
1393
7afc79a4cc11 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1388
diff changeset
  2793
    ^ StringCollection from:self "string"
7afc79a4cc11 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1388
diff changeset
  2794
7afc79a4cc11 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1388
diff changeset
  2795
    "Modified: 13.5.1996 / 20:36:59 / cg"
1375
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2796
!
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2797
1434
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2798
asSymbol
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2799
    "return a unique symbol with the name taken from my characters.
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2800
     The receiver must be a singleByte-String; twoByteSymbols are
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2801
     (currently) not allowed."
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2802
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2803
    ^ self string asSymbol
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2804
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2805
    "Created: 22.5.1996 / 15:56:11 / cg"
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2806
!
a317ba02d685 allow aText asSymbol
Claus Gittinger <cg@exept.de>
parents: 1429
diff changeset
  2807
1435
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2808
asSymbolIfInterned
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2809
    "if a symbol with the receivers characters is already known, return it.
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2810
     Otherwise, return nil. This can be used to query for an existing
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2811
     symbol and is the same as
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2812
        self knownAsSymbol ifTrue:[self asSymbol] ifFalse:[nil]
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2813
     but slightly faster, since the symbol lookup operation is only
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2814
     performed once.
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2815
     The receiver must be a singleByte-String; twoByteSymbols are
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2816
     (currently) not allowed."
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2817
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2818
    ^ self string asSymbolIfInterned
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2819
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2820
    "Created: 22.5.1996 / 16:37:04 / cg"
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2821
!
3ee1fde2aa58 allow asSymbolIfInterned for Texts
Claus Gittinger <cg@exept.de>
parents: 1434
diff changeset
  2822
1375
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2823
asText
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2824
    "return a Text-object (collection of lines) from myself."
1375
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2825
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2826
    ^ Text fromString:self
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2827
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2828
    "Created: 12.5.1996 / 10:41:14 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2829
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2830
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2831
asTwoByteString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2832
    "return the receiver converted to a two-byte string"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2833
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2834
    ^ TwoByteString fromString:self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2835
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2836
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2837
asUppercase
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2838
    "return a copy of myself in uppercase letters"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2839
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2840
    |newStr
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2841
     mySize "{ Class: SmallInteger }" |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2842
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2843
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2844
    newStr := self species new:mySize.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2845
    1 to:mySize do:[:i |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2846
	newStr at:i put:(self at:i) asUppercase
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2847
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2848
    ^ newStr
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2849
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2850
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2851
     'helloWorld' asUppercase      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2852
     'helloWorld' asUppercaseFirst 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2853
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2854
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2855
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2856
asUppercaseFirst
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2857
    "return a copy of myself where the first character is
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2858
     converted to uppercase."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2859
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2860
    |newString sz|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2861
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2862
    sz := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2863
    newString := self copyFrom:1 to:sz.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2864
    sz > 0 ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2865
	newString at:1 put:(newString at:1) asUppercase
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2866
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2867
    ^ newString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2868
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2869
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2870
     'helloWorld' asUppercase      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2871
     'helloWorld' asUppercaseFirst 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2872
     'HelloWorld' asUppercaseFirst   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2873
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2874
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2875
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2876
string
1312
0003988bb066 commentary
Claus Gittinger <cg@exept.de>
parents: 1311
diff changeset
  2877
    "return the receiver - for ST-80 compatibility"
0003988bb066 commentary
Claus Gittinger <cg@exept.de>
parents: 1311
diff changeset
  2878
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2879
    ^ self
1312
0003988bb066 commentary
Claus Gittinger <cg@exept.de>
parents: 1311
diff changeset
  2880
0003988bb066 commentary
Claus Gittinger <cg@exept.de>
parents: 1311
diff changeset
  2881
    "Modified: 27.4.1996 / 13:29:30 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2882
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2883
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2884
tokensBasedOn:aCharacter
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2885
    "this is an ST-80 alias for the ST/X method
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2886
	asCollectionOfSubstringsSeparatedBy:"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2887
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2888
    ^ self asCollectionOfSubstringsSeparatedBy:aCharacter
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2889
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2890
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2891
     'hello:world:isnt:this nice' tokensBasedOn:$:
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2892
     'foo,bar,baz' tokensBasedOn:$,
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2893
     '/etc/passwd' asFilename readStream nextLine tokensBasedOn:$:
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2894
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2895
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2896
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2897
!CharacterArray methodsFor:'copying'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2898
1050
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2899
, aStringOrCharacter
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2900
    "redefined to allow characters and mixed strings to be appended.
1050
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2901
     This is nonStandard, but convenient"
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2902
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2903
    |myWidth otherWidth|
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2904
1050
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2905
    aStringOrCharacter isCharacter ifTrue:[
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2906
        ^ self , aStringOrCharacter asString
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2907
    ].
1396
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2908
    aStringOrCharacter isText ifTrue:[
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2909
        ^ aStringOrCharacter concatenateFromString:self
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2910
    ].
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2911
    aStringOrCharacter isString ifTrue:[
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2912
        (otherWidth := aStringOrCharacter bitsPerCharacter) ~~ (myWidth := self bitsPerCharacter) ifTrue:[
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2913
            otherWidth > myWidth ifTrue:[
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2914
                ^ (aStringOrCharacter species fromString:self) , aStringOrCharacter
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2915
            ].
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2916
            ^ self , (self species fromString:aStringOrCharacter)
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2917
        ].
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2918
    ].
1050
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2919
    ^ super , aStringOrCharacter
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2920
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2921
    "
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2922
     'hello' , $1    
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2923
     'hello' , '1'   
1411
a3b3ab3044c6 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1396
diff changeset
  2924
     'hello' , (' world' asText allBold) 
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2925
     'hello' , (JISEncodedString fromString:' world') 
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2926
     (JISEncodedString fromString:'hello') , ' world'
1411
a3b3ab3044c6 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1396
diff changeset
  2927
1422
9a0b792f2953 showCr: -> showCR:
Claus Gittinger <cg@exept.de>
parents: 1411
diff changeset
  2928
     Transcript showCR:
1411
a3b3ab3044c6 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1396
diff changeset
  2929
         (Text string:'hello' emphasis:#italic) , (Text string:' world' emphasis:#bold) 
1050
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2930
    "
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2931
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  2932
    "Modified: 28.6.1997 / 00:13:17 / cg"
1050
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2933
!
2f14bbd792a9 , now accepts a character-argument
Claus Gittinger <cg@exept.de>
parents: 1028
diff changeset
  2934
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2935
concatenate:string1 and:string2
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2936
    "return the concatenation of myself and the arguments, string1 and string2.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2937
     This is equivalent to self , string1 , string2
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2938
     - generated by compiler when such a construct is detected and the receiver
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2939
     is known to be a string."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2940
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2941
    ^ self , string1 , string2
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2942
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2943
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2944
concatenate:string1 and:string2 and:string3
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2945
    "return the concatenation of myself and the string arguments.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2946
     This is equivalent to self , string1 , string2 , string3
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2947
     - generated by compiler when such a construct is detected and the receiver
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2948
     is known to be a string."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2949
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2950
    ^ self , string1 , string2 , string3
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2951
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2952
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2953
!CharacterArray methodsFor:'displaying'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2954
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2955
displayOn:aGc x:x y:y
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2956
    "display the receiver in a graphicsContext - this method allows
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2957
     strings to be used like DisplayObjects."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2958
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2959
    ^ aGc displayString:self string x:x y:y.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2960
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2961
    "Modified: 11.5.1996 / 14:42:48 / cg"
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2962
!
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2963
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2964
displayOpaqueOn:aGc x:x y:y
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2965
    "display the receiver in a graphicsContext - this method allows
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2966
     strings to be used like DisplayObjects."
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2967
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2968
    ^ aGc displayOpaqueString:self string x:x y:y.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2969
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2970
    "Modified: 11.5.1996 / 14:42:48 / cg"
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  2971
    "Created: 12.5.1996 / 12:28:40 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2972
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  2973
1375
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2974
!CharacterArray methodsFor:'emphasis'!
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2975
1396
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2976
emphasis
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2977
    "return the emphasis.
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2978
     Since characterArrays do not hold any emphasis information,
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2979
     nil (no emphasis) is returned here."
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2980
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2981
    ^ RunArray new:self size withAll:nil
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2982
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2983
    "Created: 14.5.1996 / 13:58:58 / cg"
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2984
!
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2985
1375
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2986
emphasisAt:characterIndex
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2987
    "return the emphasis at some index.
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2988
     Since characterArrays do not hold any emphasis information,
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2989
     nil (no emphasis) is returned here."
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2990
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2991
    ^ nil
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2992
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  2993
    "Created: 11.5.1996 / 14:13:27 / cg"
1396
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2994
!
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2995
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2996
emphasisCollection
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2997
    "return the emphasis.
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2998
     Since characterArrays do not hold any emphasis information,
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  2999
     nil (no emphasis) is returned here."
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  3000
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  3001
    ^ RunArray new:(self size)
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  3002
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  3003
    "Created: 14.5.1996 / 13:58:58 / cg"
5b5386311dbd concatenating mixed texts & strings
Claus Gittinger <cg@exept.de>
parents: 1393
diff changeset
  3004
    "Modified: 14.5.1996 / 15:02:29 / cg"
1375
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  3005
! !
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  3006
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3007
!CharacterArray methodsFor:'encoding/decoding'!
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3008
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3009
decodeFrom:encodingSymbol
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3010
    "given the receiver encoded as described by encodingSymbol, 
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3011
     convert it into internal ST/X encoding and return a corresponding CharacterArray.
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3012
     Currently, only a few encodings are supported 
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3013
     (and those are untested/incomplete):
1005
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  3014
        #euc
997
6c7f1815ed02 more encoding & decoding support
Claus Gittinger <cg@exept.de>
parents: 991
diff changeset
  3015
        #jis7
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  3016
        #sjis
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3017
        #mac
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3018
        #msdos
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3019
    "
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3020
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3021
    |newString table|
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3022
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3023
    encodingSymbol == #'x-jis7' ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3024
        ^ JISEncodedString decodeFromJIS7:self
1005
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  3025
    ].
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3026
    encodingSymbol == #'x-sjis' ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3027
        ^ JISEncodedString decodeFromSJIS:self
2720
98c21d9a07c0 allow mixed strings to be concatenated using comma
Claus Gittinger <cg@exept.de>
parents: 2692
diff changeset
  3028
    ].
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3029
    encodingSymbol == #'x-shift-jis' ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3030
        ^ JISEncodedString decodeFromSJIS:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3031
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3032
    encodingSymbol == #'x-euc-jp' ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3033
        ^ JISEncodedString decodeFromEUC:self
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3034
    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3035
    encodingSymbol == #big5 ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3036
        ^ BIG5EncodedString decodeFromBIG5:self
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3037
    ].
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3038
    encodingSymbol == #gb2312 ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3039
        ^ GBEncodedString decodeFromGB:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3040
    ].
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3041
    encodingSymbol == #'x-euc-gb' ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3042
        ^ GBEncodedString decodeFromEUC:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3043
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3044
    encodingSymbol == #'x-euc-kr' ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3045
        ^ KSCEncodedString decodeFromEUC:self
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3046
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3047
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3048
    encodingSymbol == #ucs2 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3049
        ^ UnicodeString decodeFromUCS2:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3050
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3051
    encodingSymbol == #ucs4 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3052
        ^ UnicodeString decodeFromUCS4:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3053
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3054
    encodingSymbol == #utf7 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3055
        ^ UnicodeString decodeFromUTF7:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3056
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3057
    encodingSymbol == #utf8 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3058
        ^ UnicodeString decodeFromUTF8:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3059
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3060
    encodingSymbol == #utf16 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3061
        ^ UnicodeString decodeFromUTF16:self
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3062
    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3063
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3064
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3065
    "/ soon obsolete ...
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3066
    encodingSymbol == #jis7 ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3067
        ^ JISEncodedString decodeFromJIS7:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3068
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3069
    encodingSymbol == #sjis ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3070
        ^ JISEncodedString decodeFromSJIS:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3071
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3072
    encodingSymbol == #euc ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3073
        ^ JISEncodedString decodeFromEUC:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3074
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3075
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3076
    encodingSymbol == #gb ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3077
        ^ GBEncodedString decodeFromGB:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3078
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3079
    encodingSymbol == #ksc ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3080
        ^ KSCEncodedString decodeFromKSC:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3081
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3082
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3083
    table := self class decoderTableFor:encodingSymbol.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3084
    table isNil ifTrue:[^ self].
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3085
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3086
    newString := self class new:self size.
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3087
    newString replaceFrom:self translateBy:table.
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3088
    newString = self ifTrue:[^ self].
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3089
    ^ newString
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3090
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3091
    "Created: 22.2.1996 / 15:06:49 / cg"
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3092
    "Modified: 1.7.1997 / 17:18:41 / cg"
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3093
!
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3094
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3095
encodeForDisplayInto:encodingSymbol
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3096
    "given the receiver in internal ST/X encoding, 
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3097
     convert it into the format as described by encodingSymbol
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3098
     for display.
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3099
     The receiver may contain roman characters in 0..255, which
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3100
     are translated according to the encodings roman translation.
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3101
     For example: in JIS, roman $a is found at 16r2361.
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3102
    "
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3103
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3104
    |newString table|
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3105
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3106
    encodingSymbol isNil ifTrue:[^ self].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3107
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3108
    "/ notice: this method may not be moved to
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3109
    "/ concrete TwoByte classes; it is also invoked
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3110
    "/ to convert singleByte strings ...
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3111
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3112
    (encodingSymbol startsWith:'jis') ifTrue:[
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3113
        newString := JISEncodedString encodeRomans:self.
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3114
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3115
        "/ jisx0208.1983 has no half-width katakana
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3116
        (encodingSymbol startsWith:'jisx0208.1983') ifTrue:[
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3117
            newString := JISEncodedString encodeHalfWidthKatakana:newString.
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3118
        ].
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3119
        ^ newString
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3120
    ].
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3121
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3122
    (encodingSymbol startsWith:'big5') ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3123
        ^ BIG5EncodedString encodeRomans:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3124
    ].
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3125
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3126
    (encodingSymbol startsWith:'gb') ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3127
        ^ GBEncodedString encodeRomans:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3128
    ].
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3129
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3130
    (encodingSymbol startsWith:'ksc') ifTrue:[   
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3131
        ^ KSCEncodedString encodeRomans:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3132
    ].
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3133
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3134
    (encodingSymbol startsWith:'uni') ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3135
        ^ UnicodeString encodeRomans:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3136
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3137
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3138
    table := self class encoderTableFor:encodingSymbol.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3139
    table isNil ifTrue:[^ self].
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3140
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3141
    newString := self class new:self size.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3142
    newString replaceFrom:self translateBy:table.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3143
    newString = self ifTrue:[^ self].
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3144
    ^ newString
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3145
2742
3add77872fea care for halfWidth kanakana characters
Claus Gittinger <cg@exept.de>
parents: 2740
diff changeset
  3146
    "Modified: 2.7.1997 / 15:02:56 / cg"
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3147
!
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3148
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3149
encodeInto:encodingSymbol
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3150
    "given the receiver in internal ST/X encoding, 
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3151
     convert it into the format as described by encodingSymbol
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3152
     and return a corresponding CharacterArray.
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3153
     Currently, only a few encodings are supported 
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3154
     (and those are untested/incomplete):
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3155
        #x-euc-jp
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3156
        #x-jis7
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3157
        #x-sjis
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3158
        #x-shift-jis
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3159
        #big5
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3160
        #gb
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3161
        #mac
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3162
        #msdos
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3163
        #next
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3164
    "
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3165
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3166
    |newString table|
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3167
1204
977e83b2c166 oops - leftover prints
Claus Gittinger <cg@exept.de>
parents: 1203
diff changeset
  3168
"/  self encoding storeString print. ' -> ' print. encodingSymbol storeString printNL.
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3169
1028
234d62b7f4fe checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1027
diff changeset
  3170
    encodingSymbol isNil ifTrue:[^ self].
234d62b7f4fe checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1027
diff changeset
  3171
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3172
    encodingSymbol == #'x-jis7' ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3173
        ^ JISEncodedString encodeIntoJIS7:self
1005
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  3174
    ].
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3175
    encodingSymbol == #'x-sjis' ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3176
        ^ JISEncodedString encodeIntoSJIS:self
1013
0e54ddc990cb more 16bit & encoding support
Claus Gittinger <cg@exept.de>
parents: 1012
diff changeset
  3177
    ].
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3178
    encodingSymbol == #'x-shift-jis' ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3179
        ^ JISEncodedString encodeIntoSJIS:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3180
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3181
    encodingSymbol == #'x-euc-jp' ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3182
        ^ JISEncodedString encodeIntoEUC:self
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3183
    ].
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3184
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3185
    encodingSymbol == #big5 ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3186
        ^ BIG5EncodedString encodeIntoBIG5:self
1005
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  3187
    ].
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3188
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3189
    encodingSymbol == #gb2312 ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3190
        ^ GBEncodedString encodeIntoGB:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3191
    ].
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3192
    encodingSymbol == #'x-euc-gb' ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3193
        ^ GBEncodedString encodeIntoGB:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3194
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3195
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3196
    encodingSymbol == #'x-euc-kr' ifTrue:[
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3197
        ^ KSCEncodedString encodeIntoKSC:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3198
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3199
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3200
    encodingSymbol == #ucs2 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3201
        ^ UnicodeString encodeIntoUCS2:self
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3202
    ].
2733
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3203
    encodingSymbol == #ucs4 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3204
        ^ UnicodeString encodeIntoUCS4:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3205
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3206
    encodingSymbol == #utf7 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3207
        ^ UnicodeString encodeIntoUTF7:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3208
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3209
    encodingSymbol == #utf8 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3210
        ^ UnicodeString encodeIntoUTF8:self
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3211
    ].
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3212
    encodingSymbol == #utf16 ifTrue:[
e8734d3c9415 fixed BIG5 encoding stuff
Claus Gittinger <cg@exept.de>
parents: 2732
diff changeset
  3213
        ^ UnicodeString encodeIntoUTF16:self
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3214
    ].
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3215
2736
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3216
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3217
    "/ to be obsoleted ...
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3218
    encodingSymbol == #jis7 ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3219
        ^ JISEncodedString encodeIntoJIS7:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3220
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3221
    encodingSymbol == #sjis ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3222
        ^ JISEncodedString encodeIntoSJIS:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3223
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3224
    encodingSymbol == #euc ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3225
        ^ JISEncodedString encodeIntoEUC:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3226
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3227
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3228
    encodingSymbol == #gb ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3229
        ^ GBEncodedString encodeIntoGB:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3230
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3231
    encodingSymbol == #ksc ifTrue:[
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3232
        ^ KSCEncodedString encodeIntoKSC:self
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3233
    ].
840f97eb7b73 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2735
diff changeset
  3234
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3235
    table := self class encoderTableFor:encodingSymbol.
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3236
    table isNil ifTrue:[^ self].
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3237
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3238
    newString := self class new:self size.
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3239
    newString replaceFrom:self translateBy:table.
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3240
    newString = self ifTrue:[^ self].
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3241
    ^ newString
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3242
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3243
    "Created: 22.2.1996 / 15:07:31 / cg"
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3244
    "Modified: 1.7.1997 / 17:17:53 / cg"
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3245
!
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3246
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3247
replaceFrom:aString decode:encoding 
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3248
    "this is an experimental interface - unfinished"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3249
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3250
    ^ self replaceFrom:aString translateBy:(self class decoderTableFor:encoding)
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3251
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3252
    "Created: 20.10.1995 / 23:00:09 / cg"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3253
    "Modified: 22.2.1996 / 16:08:26 / cg"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3254
!
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3255
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3256
replaceFrom:aString encode:encoding 
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3257
    "this is an experimental interface - unfinished"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3258
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3259
    ^ self replaceFrom:aString translateBy:(self class encoderTableFor:encoding)
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3260
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3261
    "Modified: 20.10.1995 / 23:08:16 / cg"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3262
    "Created: 22.2.1996 / 16:08:34 / cg"
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3263
!
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3264
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3265
replaceFrom:aString translateBy:encodingTable 
1202
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3266
    "replace the receivers characters by translations coming from
200f7cd0a457 moved encode/decode stuff into class methods / added big5/gb support
Claus Gittinger <cg@exept.de>
parents: 1145
diff changeset
  3267
     an encoding table."
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3268
2520
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3269
    |mySize  "{ Class: SmallInteger }" 
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3270
     maxCode "{ Class: SmallInteger }"
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3271
     char 
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3272
     oldCode "{ Class: SmallInteger }"
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3273
     newCode|
1023
5ee2a2372aca 16bit coding
Claus Gittinger <cg@exept.de>
parents: 1018
diff changeset
  3274
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3275
    self replaceFrom:1 with:aString.
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3276
    encodingTable isNil ifTrue:[^ self].
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3277
2520
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3278
    maxCode := encodingTable size.
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3279
    mySize := self size.
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3280
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3281
    1 to:mySize do:[:index |
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3282
        char := aString at:index.
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3283
        oldCode := char asciiValue.
2520
4a7a2f2f0132 tuned #replaceFrom:translateBy:
Claus Gittinger <cg@exept.de>
parents: 2519
diff changeset
  3284
        oldCode >= maxCode ifFalse:[
1023
5ee2a2372aca 16bit coding
Claus Gittinger <cg@exept.de>
parents: 1018
diff changeset
  3285
            newCode := encodingTable at:(oldCode + 1).
5ee2a2372aca 16bit coding
Claus Gittinger <cg@exept.de>
parents: 1018
diff changeset
  3286
            newCode ~~ oldCode ifTrue:[
5ee2a2372aca 16bit coding
Claus Gittinger <cg@exept.de>
parents: 1018
diff changeset
  3287
                self at:index put:(Character value:newCode)
5ee2a2372aca 16bit coding
Claus Gittinger <cg@exept.de>
parents: 1018
diff changeset
  3288
            ]
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3289
        ]
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3290
    ].
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3291
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3292
    "Created: 22.2.1996 / 16:07:26 / cg"
2740
8649f17f9148 added support for hp's non-iso font encodings
Claus Gittinger <cg@exept.de>
parents: 2737
diff changeset
  3293
    "Modified: 1.7.1997 / 17:18:06 / cg"
991
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3294
! !
f0b45a4b82d2 first attempt in encoding/decoding for different character representations (mac/msdos etc.)
Claus Gittinger <cg@exept.de>
parents: 986
diff changeset
  3295
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3296
!CharacterArray methodsFor:'padded copying'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3297
815
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3298
centerPaddedTo:newSize
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3299
     "return a new string consisting of the receivers characters,
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3300
     plus spaces up to length and center the receivers characters in
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3301
     the resulting string.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3302
     If the receivers size is equal or greater than the length argument, 
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3303
     the original receiver is returned unchanged."
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3304
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3305
     ^ self centerPaddedTo:newSize with:(Character space)
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3306
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3307
    "
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3308
     'foo' centerPaddedTo:10            
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3309
     123 printString centerPaddedTo:10 
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3310
    "
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3311
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3312
    "Created: 25.11.1995 / 10:53:57 / cg"
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3313
!
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3314
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3315
centerPaddedTo:size with:padCharacter
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3316
    "return a new string of length size, which contains the receiver
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3317
     centered (i.e. padded on both sides).
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3318
     Characters are filled with padCharacter.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3319
     If the receivers size is equal or greater than the length argument, 
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3320
     the original receiver is returned unchanged."
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3321
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3322
    |len s|
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3323
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3324
    len := self size.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3325
    (len < size) ifTrue:[
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3326
	s := self species new:size withAll:padCharacter.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3327
	s replaceFrom:(size - len) // 2  + 1 with:self.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3328
	^ s
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3329
    ]
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3330
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3331
    "
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3332
     'foo' centerPaddedTo:11 with:$.     
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3333
     'fooBar' centerPaddedTo:5 with:$.      
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3334
     123 printString centerPaddedTo:10 with:$.        
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3335
     (' ' , 123 printString) centerPaddedTo:10 with:$.        
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3336
     (Float pi printString) centerPaddedTo:15 with:(Character space)  
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3337
     (Float pi printString) centerPaddedTo:15 with:$-           
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3338
     (' ' , Float pi class name) centerPaddedTo:15 with:$.     
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3339
    "
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3340
!
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3341
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3342
decimalPaddedTo:size and:afterPeriod at:decimalCharacter
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3343
    "return a new string of overall length size, which contains the receiver
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3344
     aligned at the decimal-period column and afterPeriod characters to the right
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3345
     of the period. The periodCharacter is passed as arguments (allowing for US and European formats
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3346
     to be padded).
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3347
     If the receivers size is equal or greater than the length argument, 
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3348
     the original receiver is returned unchanged.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3349
     (sounds complicated ? -> see examples below)."
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3350
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3351
    ^ self 
915
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3352
	decimalPaddedTo:size 
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3353
	and:afterPeriod 
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3354
	at:decimalCharacter
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3355
	withLeft:(Character space)
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3356
	right:$0
815
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3357
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3358
    "                                                    
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3359
     '123' decimalPaddedTo:10 and:3 at:$.      -> '   123    '  
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3360
     '123' decimalPaddedTo:10 and:3 at:$.      -> '   123.000'  
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3361
     '123.' decimalPaddedTo:10 and:3 at:$.     -> '   123.000' 
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3362
     '123.1' decimalPaddedTo:10 and:3 at:$.    -> '   123.100'
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3363
     '123.1' decimalPaddedTo:10 and:3 at:$.    -> '   123.1  ' 
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3364
     '123.123' decimalPaddedTo:10 and:3 at:$.  -> '   123.123'
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3365
    "
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3366
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3367
    "Created: 23.12.1995 / 13:11:52 / cg"
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3368
!
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3369
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3370
decimalPaddedTo:size and:afterPeriod at:decimalCharacter withLeft:leftPadChar right:rightPadChar
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3371
    "return a new string of overall length size, which contains the receiver
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3372
     aligned at the decimal-period column and afterPeriod characters to the right
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3373
     of the period.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3374
     Characters on the left are filled with leftPadChar.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3375
     If rightPadChar is nil, characters on the right are filled with leftPadCharacter too;
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3376
     otherwise, if missing, a decimal point is added and right characters filled with this.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3377
     If the receivers size is equal or greater than the length argument, 
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3378
     the original receiver is returned unchanged.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3379
     (sounds complicated ? -> see examples below)."
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3380
986
c3a9f590146d removed unused locals
Claus Gittinger <cg@exept.de>
parents: 915
diff changeset
  3381
    |s idx n|
815
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3382
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3383
    idx := self indexOf:decimalCharacter.
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3384
    idx == 0 ifTrue:[
915
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3385
	"/
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3386
	"/ no decimal point found; adjust string to the left of the period column
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3387
	"/
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3388
	rightPadChar isNil ifTrue:[
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3389
	    s := self , (self species new:afterPeriod + 1 withAll:leftPadChar)
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3390
	] ifFalse:[
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3391
	    s:= self , decimalCharacter asString , (self species new:afterPeriod withAll:rightPadChar).
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3392
	].
815
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3393
    ] ifFalse:[
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3394
915
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3395
	"/ the number of after-decimalPoint characters
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3396
	n := self size - idx.
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3397
	rightPadChar isNil ifTrue:[
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3398
	    s := self , (self species new:afterPeriod - n withAll:leftPadChar).
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3399
	] ifFalse:[
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3400
	    s := self , (self species new:afterPeriod - n withAll:rightPadChar).
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  3401
	].
815
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3402
    ].
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3403
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3404
    ^ s leftPaddedTo:size with:leftPadChar
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3405
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3406
    "                                                    
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3407
     '123' decimalPaddedTo:10 and:3 at:$. withLeft:(Character space) right:nil     -> '   123    '  
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3408
     '123' decimalPaddedTo:10 and:3 at:$. withLeft:(Character space) right:$0      -> '   123.000'  
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3409
     '123.' decimalPaddedTo:10 and:3 at:$. withLeft:(Character space) right:$0     -> '   123.000' 
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3410
     '123.1' decimalPaddedTo:10 and:3 at:$. withLeft:(Character space) right:$0    -> '   123.100'
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3411
     '123.1' decimalPaddedTo:10 and:3 at:$. withLeft:(Character space) right:nil   -> '   123.1  ' 
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3412
     '123.123' decimalPaddedTo:10 and:3 at:$. withLeft:(Character space) right:$0  -> '   123.123'
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3413
    "
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3414
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3415
    "Modified: 23.12.1995 / 13:08:18 / cg"
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3416
!
997a02c135e7 support for decimal padding added
Claus Gittinger <cg@exept.de>
parents: 759
diff changeset
  3417
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3418
leftPaddedTo:size
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3419
    "return a new string of length size, which contains the receiver
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3420
     right-adjusted (i.e. padded on the left).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3421
     Characters on the left are filled with spaces.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3422
     If the receivers size is equal or greater than the length argument, 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3423
     the original receiver is returned unchanged."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3424
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3425
    ^ self leftPaddedTo:size with:(Character space)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3426
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3427
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3428
     'foo' leftPaddedTo:10  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3429
     'fooBar' leftPaddedTo:5      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3430
     123 printString leftPaddedTo:10        
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3431
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3432
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3433
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3434
leftPaddedTo:size with:padCharacter
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3435
    "return a new string of length size, which contains the receiver
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3436
     right-adjusted (i.e. padded on the left).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3437
     Characters on the left are filled with padCharacter.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3438
     If the receivers size is equal or greater than the length argument, 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3439
     the original receiver is returned unchanged."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3440
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3441
    |len s|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3442
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3443
    len := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3444
    (len < size) ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3445
	s := self species new:size withAll:padCharacter.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3446
	s replaceFrom:(size - len + 1) with:self.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3447
	^ s
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3448
    ]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3449
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3450
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3451
     'foo' leftPaddedTo:10 with:$.      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3452
     'fooBar' leftPaddedTo:5 with:$.      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3453
     123 printString leftPaddedTo:10 with:$.        
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3454
     (' ' , 123 printString) leftPaddedTo:10 with:$.        
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3455
     (Float pi printString) leftPaddedTo:15 with:(Character space)  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3456
     (Float pi printString) leftPaddedTo:15 with:$-           
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3457
     (' ' , Float pi class name) leftPaddedTo:15 with:$.     
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3458
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3459
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3460
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3461
paddedTo:newSize
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3462
     "return a new string consisting of the receivers characters,
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3463
     plus spaces up to length.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3464
     If the receivers size is equal or greater than the length argument, 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3465
     the original receiver is returned unchanged."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3466
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3467
     ^ self paddedTo:newSize with:(Character space)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3468
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3469
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3470
     'foo' paddedTo:10            
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3471
     123 printString paddedTo:10 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3472
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3473
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3474
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3475
paddedTo:newSize with:padCharacter
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3476
    "return a new string consisting of the receivers characters,
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3477
     plus pad characters up to length.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3478
     If the receivers size is equal or greater than the length argument, 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3479
     the  original receiver is returned unchanged."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3480
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3481
    |s len|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3482
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3483
    len := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3484
    len < newSize ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3485
	s := self species new:newSize withAll:padCharacter.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3486
	s replaceFrom:1 to:len with:self.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3487
	^ s
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3488
    ]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3489
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3490
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3491
     'foo' paddedTo:10 with:$.             
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3492
     123 printString paddedTo:10 with:$*   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3493
     (Float pi printString) paddedTo:15 with:(Character space)  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3494
     (Float pi printString) paddedTo:15 with:$-  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3495
     (Float pi class name , ' ') paddedTo:15 with:$.  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3496
    "
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  3497
! !
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  3498
328
claus
parents: 327
diff changeset
  3499
!CharacterArray methodsFor:'pattern matching'!
claus
parents: 327
diff changeset
  3500
2551
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3501
compoundMatch:aString
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3502
    "like match, but the receiver may be a compount match pattern,
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3503
     consisting of multiple simple patterns, separated by semicolons.
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3504
     This is usable with fileName pattern fields."
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3505
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3506
    |matchers|
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3507
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3508
    matchers := self asCollectionOfSubstringsSeparatedBy:$;.
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3509
    matchers do:[:aPattern |
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3510
        (aPattern match:aString) ifTrue:[^ true].
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3511
    ].
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3512
    ^ false.
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3513
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3514
    "
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3515
     'f*' match:'foo'       
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3516
     'b*' match:'foo'       
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3517
     'f*;b*' match:'foo'    
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3518
     'f*;b*' match:'bar'    
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3519
     'f*;b*' compoundMatch:'foo' 
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3520
     'f*;b*' compoundMatch:'bar' 
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3521
    "
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3522
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3523
    "Modified: 15.4.1997 / 15:50:33 / cg"
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3524
!
46c18418fb05 added compoundMatch
Claus Gittinger <cg@exept.de>
parents: 2537
diff changeset
  3525
328
claus
parents: 327
diff changeset
  3526
findMatchString:matchString
368
a3c21a89ec37 *** empty log message ***
claus
parents: 360
diff changeset
  3527
    "like findString/indexOfSubCollection, but allowing match patterns.
328
claus
parents: 327
diff changeset
  3528
     find matchstring; if found, return the index;
claus
parents: 327
diff changeset
  3529
     if not found, return 0."
claus
parents: 327
diff changeset
  3530
1145
a094d90e11bf dont use [0] blocks - use 0 constant instead
Claus Gittinger <cg@exept.de>
parents: 1133
diff changeset
  3531
    ^ self findMatchString:matchString startingAt:1 ignoreCase:false ifAbsent:0 
328
claus
parents: 327
diff changeset
  3532
!
claus
parents: 327
diff changeset
  3533
claus
parents: 327
diff changeset
  3534
findMatchString:matchString startingAt:index
claus
parents: 327
diff changeset
  3535
    "like findString, but allowing match patterns.
claus
parents: 327
diff changeset
  3536
     find matchstring, starting at index. if found, return the index;
claus
parents: 327
diff changeset
  3537
     if not found, return 0."
claus
parents: 327
diff changeset
  3538
1145
a094d90e11bf dont use [0] blocks - use 0 constant instead
Claus Gittinger <cg@exept.de>
parents: 1133
diff changeset
  3539
    ^ self findMatchString:matchString startingAt:index ignoreCase:false ifAbsent:0 
328
claus
parents: 327
diff changeset
  3540
!
claus
parents: 327
diff changeset
  3541
claus
parents: 327
diff changeset
  3542
findMatchString:matchString startingAt:index ignoreCase:ignoreCase ifAbsent:exceptionBlock
claus
parents: 327
diff changeset
  3543
    "like findString, but allowing match patterns.
claus
parents: 327
diff changeset
  3544
     find matchstring, starting at index. if found, return the index;
claus
parents: 327
diff changeset
  3545
     if not found, return the result of evaluating exceptionBlock.
claus
parents: 327
diff changeset
  3546
     This is a q&d hack - not very efficient"
claus
parents: 327
diff changeset
  3547
claus
parents: 327
diff changeset
  3548
    |firstChar 
claus
parents: 327
diff changeset
  3549
     startIndex "{ Class: SmallInteger }"
claus
parents: 327
diff changeset
  3550
     matchSize  "{ Class: SmallInteger }"
claus
parents: 327
diff changeset
  3551
     mySize     "{ Class: SmallInteger }"
claus
parents: 327
diff changeset
  3552
     realMatchString|
claus
parents: 327
diff changeset
  3553
claus
parents: 327
diff changeset
  3554
    matchSize := matchString size.
claus
parents: 327
diff changeset
  3555
    matchSize == 0 ifTrue:[^ index]. "empty string matches"
claus
parents: 327
diff changeset
  3556
claus
parents: 327
diff changeset
  3557
    realMatchString := matchString.
claus
parents: 327
diff changeset
  3558
    (realMatchString endsWith:$*) ifFalse:[
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3559
        realMatchString := realMatchString , '*'.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3560
        matchSize := matchSize + 1
328
claus
parents: 327
diff changeset
  3561
    ].
claus
parents: 327
diff changeset
  3562
claus
parents: 327
diff changeset
  3563
    mySize := self size.
claus
parents: 327
diff changeset
  3564
    firstChar := realMatchString at:1.
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3565
    firstChar == $\ ifTrue:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3566
        firstChar := realMatchString at:2.
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3567
    ].
328
claus
parents: 327
diff changeset
  3568
claus
parents: 327
diff changeset
  3569
    firstChar asString includesMatchCharacters ifTrue:[
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3570
        index to:mySize do:[:col |
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3571
            (realMatchString match:self from:col to:mySize ignoreCase:ignoreCase)
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3572
            ifTrue:[^ col]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3573
        ].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3574
        ^ exceptionBlock value.
328
claus
parents: 327
diff changeset
  3575
    ].
claus
parents: 327
diff changeset
  3576
    startIndex := self indexOf:firstChar startingAt:index.
claus
parents: 327
diff changeset
  3577
    [startIndex == 0] whileFalse:[
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3578
        (realMatchString match:self from:startIndex to:mySize ignoreCase:ignoreCase)
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3579
        ifTrue:[^ startIndex].
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3580
        startIndex := self indexOf:firstChar startingAt:(startIndex + 1)
328
claus
parents: 327
diff changeset
  3581
    ].
claus
parents: 327
diff changeset
  3582
    ^ exceptionBlock value
claus
parents: 327
diff changeset
  3583
claus
parents: 327
diff changeset
  3584
    "
claus
parents: 327
diff changeset
  3585
     'one two three four' findMatchString:'o[nu]'
claus
parents: 327
diff changeset
  3586
     'one two three four' findMatchString:'o[nu]' startingAt:3
claus
parents: 327
diff changeset
  3587
    "
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3588
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3589
    "Modified: 2.4.1997 / 17:26:39 / cg"
328
claus
parents: 327
diff changeset
  3590
!
claus
parents: 327
diff changeset
  3591
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3592
includesMatchString:matchString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3593
    "like includesString, but allowing match patterns.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3594
     find matchstring; if found, return true, otherwise return false"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3595
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3596
    ^ (self findMatchString:matchString) ~~ 0
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3597
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3598
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3599
     'hello world' includesMatchString:'h*'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3600
     'hello world' includesMatchString:'h[aeiou]llo' 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3601
     'hello world' includesMatchString:'wor*'     
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3602
     'hello world' includesMatchString:'woR*'     
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3603
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3604
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3605
328
claus
parents: 327
diff changeset
  3606
match:aString
claus
parents: 327
diff changeset
  3607
    "return true if aString matches self, where self may contain meta-match 
claus
parents: 327
diff changeset
  3608
     characters $* (to match any string) or $# (to match any character).
claus
parents: 327
diff changeset
  3609
     or [...] to match a set of characters.
345
claus
parents: 343
diff changeset
  3610
     Lower/uppercase are considered different.
claus
parents: 343
diff changeset
  3611
     NOTICE: match-meta character interpretation is like in unix-matching, 
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3612
             NOT the ST-80 meaning."
328
claus
parents: 327
diff changeset
  3613
claus
parents: 327
diff changeset
  3614
    ^ self match:aString from:1 to:aString size ignoreCase:false
claus
parents: 327
diff changeset
  3615
claus
parents: 327
diff changeset
  3616
    "
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3617
     '\*f*' match:'f' 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3618
     '\*f*' match:'*f' 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3619
     '*\*f*' match:'*f' 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3620
     '*f*' match:'*f' 
328
claus
parents: 327
diff changeset
  3621
     '*ute*' match:'computer' 
claus
parents: 327
diff changeset
  3622
     '*uter' match:'computer' 
claus
parents: 327
diff changeset
  3623
     'uter*' match:'computer' 
claus
parents: 327
diff changeset
  3624
     '*ute*' match:'' 
claus
parents: 327
diff changeset
  3625
     '[abcd]*' match:'computer' 
claus
parents: 327
diff changeset
  3626
     '[abcd]*' match:'komputer' 
claus
parents: 327
diff changeset
  3627
     '*some*compl*ern*' match:'this is some more complicated pattern match' 
claus
parents: 327
diff changeset
  3628
     '*some*compl*ern*' match:'this is another complicated pattern match' 
claus
parents: 327
diff changeset
  3629
    "
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3630
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3631
    "Modified: 1.4.1997 / 18:38:02 / cg"
328
claus
parents: 327
diff changeset
  3632
!
claus
parents: 327
diff changeset
  3633
claus
parents: 327
diff changeset
  3634
match:aString from:start to:stop ignoreCase:ignoreCase
claus
parents: 327
diff changeset
  3635
    "return true if part of aString matches myself, 
claus
parents: 327
diff changeset
  3636
     where self may contain meta-match 
claus
parents: 327
diff changeset
  3637
     characters $* (to match any string) or $# (to match any character)
claus
parents: 327
diff changeset
  3638
     or [...] to match a set of characters.
345
claus
parents: 343
diff changeset
  3639
     If ignoreCase is true, lower/uppercase are considered the same.
claus
parents: 343
diff changeset
  3640
     NOTICE: match-meta character interpretation is like in unix-matching, 
1429
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3641
             NOT the ST-80 meaning."
328
claus
parents: 327
diff changeset
  3642
claus
parents: 327
diff changeset
  3643
    |matchScanArray|
claus
parents: 327
diff changeset
  3644
claus
parents: 327
diff changeset
  3645
    "
claus
parents: 327
diff changeset
  3646
     keep the matchScanArray from the most recent match -
claus
parents: 327
diff changeset
  3647
     avoids parsing the pattern over-and over if multiple searches
claus
parents: 327
diff changeset
  3648
     are done with the same pattern.
claus
parents: 327
diff changeset
  3649
    "
claus
parents: 327
diff changeset
  3650
    (PreviousMatch notNil
claus
parents: 327
diff changeset
  3651
    and:[PreviousMatch key = self]) ifTrue:[
1429
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3652
        matchScanArray := PreviousMatch value
328
claus
parents: 327
diff changeset
  3653
    ] ifFalse:[
1429
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3654
        matchScanArray := self class matchScanArrayFrom:self.
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3655
        matchScanArray isNil ifTrue:[
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 2051
diff changeset
  3656
            'CharacterArray [info]: invalid matchpattern:' infoPrint. self infoPrintCR.
1429
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3657
            ^ false
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3658
        ].
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3659
        PreviousMatch := self -> matchScanArray.
328
claus
parents: 327
diff changeset
  3660
    ].
claus
parents: 327
diff changeset
  3661
claus
parents: 327
diff changeset
  3662
    ^ self class
1429
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3663
        matchScan:matchScanArray 
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3664
        from:1 to:matchScanArray size
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3665
        with:aString 
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3666
        from:start to:stop 
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3667
        ignoreCase:ignoreCase
328
claus
parents: 327
diff changeset
  3668
claus
parents: 327
diff changeset
  3669
    "
claus
parents: 327
diff changeset
  3670
     '*ute*' match:'12345COMPUTER' from:1 to:5 ignoreCase:true 
claus
parents: 327
diff changeset
  3671
     '*ute*' match:'12345COMPUTER' from:6 to:13 ignoreCase:true  
claus
parents: 327
diff changeset
  3672
    "
1429
5729275971ae printNL -> printCR
Claus Gittinger <cg@exept.de>
parents: 1423
diff changeset
  3673
2124
6238280f6120 new infoMessage scheme
Claus Gittinger <cg@exept.de>
parents: 2051
diff changeset
  3674
    "Modified: 10.1.1997 / 15:36:02 / cg"
359
claus
parents: 357
diff changeset
  3675
!
claus
parents: 357
diff changeset
  3676
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3677
match:aString ignoreCase:ignoreCase
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3678
    "return true if aString matches self, where self may contain meta-match 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3679
     characters $* (to match any string) or $# (to match any character)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3680
     or [...] to match a set of characters.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3681
     If ignoreCase is true, lower/uppercase are considered the same.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3682
     NOTICE: match-meta character interpretation is like in unix-matching, 
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3683
             NOT the ST-80 meaning."
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3684
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3685
    ^ self match:aString from:1 to:aString size ignoreCase:ignoreCase
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3686
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3687
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3688
     '*ute*' match:'COMPUTER' ignoreCase:true  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3689
     '*uter' match:'COMPUTER' ignoreCase:false 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3690
     '[abcd]*' match:'computer' ignoreCase:false 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3691
     '[abcd]*' match:'Computer' ignoreCase:false 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3692
     '[a-k]*' match:'komputer' ignoreCase:false   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3693
     '[a-k]*' match:'zomputer' ignoreCase:false    
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3694
     '[a-k]*' match:'Komputer' ignoreCase:false    
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3695
     '[a-k]*' match:'Komputer' ignoreCase:true     
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3696
     '*some*compl*ern*' match:'this is some more complicated pattern match' ignoreCase:true 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3697
     '*some*compl*ern*' match:'this is another complicated pattern match' ignoreCase:true 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3698
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3699
     Time millisecondsToRun:[
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3700
        Symbol allInstancesDo:[:sym |
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3701
            '[ab]*' match:sym ignoreCase:false
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3702
        ]
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3703
     ]. 
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3704
     Time millisecondsToRun:[
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3705
        Symbol allInstancesDo:[:sym |
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3706
            '*at:*' match:sym ignoreCase:false
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3707
        ]
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3708
     ]. 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3709
    "
2519
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3710
07674cea76dd allow escaping a * when doing a match
Claus Gittinger <cg@exept.de>
parents: 2452
diff changeset
  3711
    "Modified: 2.4.1997 / 17:28:58 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3712
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3713
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3714
!CharacterArray methodsFor:'printing & storing'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3715
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3716
article
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3717
    "return an article string for the receiver."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3718
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3719
    |firstChar|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3720
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3721
    firstChar := (self at:1) asLowercase. 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3722
    (firstChar isVowel or:[firstChar == $x]) ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3723
	firstChar ~~ $u ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3724
	     ^ 'an'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3725
	]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3726
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3727
    ^ 'a'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3728
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3729
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3730
displayString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3731
    "return a string to display the receiver - use storeString to have
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3732
     quotes around."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3733
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3734
    ^ self storeString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3735
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3736
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3737
printOn:aStream
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3738
    "print the receiver on aStream"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3739
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3740
    aStream nextPutAll:self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3741
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3742
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3743
printString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3744
    "return a string for printing - thats myself"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3745
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3746
    ^ self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3747
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3748
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3749
!CharacterArray methodsFor:'queries'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3750
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3751
encoding
1311
367f740d21e7 commentary
Claus Gittinger <cg@exept.de>
parents: 1308
diff changeset
  3752
    "return the strings encoding, as a symbol.
367f740d21e7 commentary
Claus Gittinger <cg@exept.de>
parents: 1308
diff changeset
  3753
     Here, by default, we assume #iso8859 encoding.
1241
431675180b54 commentary
Claus Gittinger <cg@exept.de>
parents: 1235
diff changeset
  3754
     This has to be redefined in XXXEncodedString subclasses."
1012
19fd2f72269f keep track of encoding of 16bit strings
Claus Gittinger <cg@exept.de>
parents: 1008
diff changeset
  3755
1311
367f740d21e7 commentary
Claus Gittinger <cg@exept.de>
parents: 1308
diff changeset
  3756
    ^ #iso8859
367f740d21e7 commentary
Claus Gittinger <cg@exept.de>
parents: 1308
diff changeset
  3757
367f740d21e7 commentary
Claus Gittinger <cg@exept.de>
parents: 1308
diff changeset
  3758
    "Modified: 27.4.1996 / 13:23:49 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3759
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3760
1375
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  3761
hasChangeOfEmphasis
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  3762
    ^ false
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  3763
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  3764
    "Created: 12.5.1996 / 12:31:39 / cg"
1375
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  3765
!
f035bb7468f5 additions for new Text class
Claus Gittinger <cg@exept.de>
parents: 1333
diff changeset
  3766
1388
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3767
heightOn:aGC
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3768
    "return the size of the recevier in device units if displayed on aGC"
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3769
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3770
    ^ aGC font heightOf:self
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3771
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3772
    "
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3773
     'hello world' heightOn:(View new)                 
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3774
    "
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3775
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3776
    "Created: 12.5.1996 / 20:09:29 / cg"
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3777
    "Modified: 12.5.1996 / 20:32:05 / cg"
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3778
!
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3779
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3780
isString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3781
    "return true, if the receiver is some kind of string;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3782
     true is returned here - redefinition of Object>>isString."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3783
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3784
    ^ true
1235
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3785
!
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3786
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3787
leftIndent
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3788
    "if the receiver starts with spaces, return the number of spaces
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3789
     at the left - otherwise, return 0.
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3790
     If the receiver consists of spaces only, return the receivers size."
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3791
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3792
    |index "{Class: SmallInteger }"
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3793
     end   "{Class: SmallInteger }"|
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3794
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3795
    index := 1.
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3796
    end := self size.
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3797
    [index <= end] whileTrue:[
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3798
        (self at:index) isSeparator ifFalse:[^ index - 1].
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3799
        index := index + 1
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3800
    ].
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3801
    ^ end
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3802
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3803
    "
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3804
     '    hello' leftIndent 
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3805
     'foo      ' leftIndent 
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3806
     '         ' leftIndent 
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3807
    "
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3808
2186eb8a2846 added #leftIndent
Claus Gittinger <cg@exept.de>
parents: 1204
diff changeset
  3809
    "Modified: 20.4.1996 / 19:28:43 / cg"
1388
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3810
!
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3811
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3812
widthOn:aGC
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3813
    "return ths size of the recevier in device units if displayed on aGC"
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3814
2554
309ee6707322 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2551
diff changeset
  3815
    ^ (aGC font on:aGC device) widthOf:self
1388
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3816
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3817
    "
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3818
     'hello world' widthOn:(View new)                 
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3819
    "
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3820
e47b4b894530 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1381
diff changeset
  3821
    "Created: 12.5.1996 / 20:09:29 / cg"
2554
309ee6707322 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2551
diff changeset
  3822
    "Modified: 17.4.1997 / 12:50:23 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3823
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3824
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3825
!CharacterArray methodsFor:'special string converting'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3826
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3827
chopTo:maxLen
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3828
    "if the receivers size is less or equal to maxLen, return it.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3829
     Otherwise, return a copy of the receiver, where some characters 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3830
     in the middle have been removed for a total string length
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3831
     of maxLen."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3832
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3833
    |sz n1 n2|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3834
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3835
    (sz := self size) > maxLen ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3836
	n1 := n2 := maxLen // 2.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3837
	maxLen odd ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3838
	    n2 := n1 + 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3839
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3840
	^ (self copyFrom:1 to:n1) , (self copyFrom:sz - n2 + 1)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3841
    ]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3842
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3843
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3844
     '12345678901234'   chopTo:15            
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3845
     '123456789012345'  chopTo:15         
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3846
     '1234567890123456' chopTo:15      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3847
     'aShortString' chopTo:15 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3848
     'aVeryLongNameForAStringThatShouldBeShortened' chopTo:15 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3849
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3850
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3851
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3852
contractAtBeginningTo:maxLen
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3853
    "if the receivers size is less or equal to maxLen, return it.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3854
     Otherwise, return a copy of the receiver, where some characters 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3855
     at the beginning have been replaced by '...' for a total string length
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3856
     of maxLen. Can be used to abbreviate long entries in tables."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3857
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3858
    |sz|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3859
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3860
    (sz := self size) > maxLen ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3861
	^ '...' , (self copyFrom:(sz - (maxLen - 4))) 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3862
    ]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3863
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3864
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3865
     '12345678901234' contractAtBeginningTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3866
     '123456789012345' contractAtBeginningTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3867
     '1234567890123456' contractAtBeginningTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3868
     'aShortString' contractAtBeginningTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3869
     'aVeryLongNameForAStringThatShouldBeShortened' contractAtBeginningTo:15
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3870
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3871
!
359
claus
parents: 357
diff changeset
  3872
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3873
contractAtEndTo:maxLen
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3874
    "if the receivers size is less or equal to maxLen, return it.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3875
     Otherwise, return a copy of the receiver, where some characters 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3876
     at the end have been replaced by '...' for a total string length
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3877
     of maxLen. Can be used to abbreviate long entries in tables."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3878
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3879
    |sz|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3880
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3881
    (sz := self size) > maxLen ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3882
	^ self copyReplaceFrom:maxLen - 3
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3883
			    with:'...'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3884
    ]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3885
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3886
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3887
     '12345678901234' contractAtEndTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3888
     '123456789012345' contractAtEndTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3889
     '1234567890123456' contractAtEndTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3890
     'aShortString' contractAtEndTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3891
     'aVeryLongNameForAStringThatShouldBeShortened' contractAtEndTo:15 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3892
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3893
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3894
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3895
contractTo:maxLen
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3896
    "if the receivers size is less or equal to maxLen, return it.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3897
     Otherwise, return a copy of the receiver, where some characters 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3898
     in the middle have been replaced by '...' for a total string length
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3899
     of maxLen. Can be used to abbreviate long entries in tables."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3900
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3901
    |sz "{ SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3902
     halfSize "{ SmallInteger }"|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3903
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3904
    (sz := self size) > maxLen ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3905
	halfSize := maxLen // 2.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3906
	^ self copyReplaceFrom:halfSize - 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3907
			    to:sz - maxLen + halfSize + 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3908
			    with:'...'
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3909
    ]
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3910
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3911
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3912
     '12345678901234' contractTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3913
     '123456789012345' contractTo:15          
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3914
     '1234567890123456' contractTo:15        
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3915
     'aShortString' contractTo:15 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3916
     'aVeryLongNameForAStringThatShouldBeShortened' contractTo:15 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3917
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3918
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3919
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3920
expandPlaceholdersWith:argArray
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3921
    "return a copy of the receiver, where all %i escapes are
1308
cf41289a7ce2 commentary
Claus Gittinger <cg@exept.de>
parents: 1290
diff changeset
  3922
     replaced by corresponding arguments' printStrings from the argArray.
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3923
     I.e. 'hello %1; how is %2' expandPlaceholdersWith:#('world' 'this') results
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3924
     in the new string 'hello world; how is this'.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3925
     To get a '%' character, use a '%%'-escape.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3926
     See also bindWith:... for VisualAge compatibility."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3927
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3928
    |expandedString next 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3929
     idx   "{ SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3930
     start "{ SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3931
     stop  "{ SmallInteger }"|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3932
1026
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3933
    expandedString := self species new:0.
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3934
    stop := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3935
    start := 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3936
    [start <= stop] whileTrue:[
1026
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3937
        idx := self indexOf:$% startingAt:start.
2737
9f29aecea39d care for % at end in expandPlaceHolders.
Claus Gittinger <cg@exept.de>
parents: 2736
diff changeset
  3938
        (idx == 0 or:[idx == stop]) ifTrue:[
1026
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3939
            ^ expandedString , (self copyFrom:start to:stop)
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3940
        ].
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3941
        "found a %"
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3942
        expandedString := expandedString , (self copyFrom:start to:(idx - 1)).
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3943
        next := self at:(idx + 1).
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3944
        (next == $%) ifTrue:[
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3945
            expandedString := expandedString , '%'
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3946
        ] ifFalse:[
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3947
            expandedString := expandedString , (argArray at:(next digitValue)) printString
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3948
        ].
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3949
        start := idx + 2
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3950
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3951
    ^  expandedString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3952
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3953
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3954
     'hello %1' expandPlaceholdersWith:#('world') 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3955
     'hello %1; how is %2' expandPlaceholdersWith:#('world' 'this') 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3956
     'hello %2; how is %1' expandPlaceholdersWith:#('world' 'this') 
1308
cf41289a7ce2 commentary
Claus Gittinger <cg@exept.de>
parents: 1290
diff changeset
  3957
     '%1 plus %2 gives %3 ' expandPlaceholdersWith:#(4 5 9) 
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3958
    "
1026
455c9f609f4d care for 16bit strings in expandPlaceHolders
Claus Gittinger <cg@exept.de>
parents: 1025
diff changeset
  3959
2737
9f29aecea39d care for % at end in expandPlaceHolders.
Claus Gittinger <cg@exept.de>
parents: 2736
diff changeset
  3960
    "Modified: 1.7.1997 / 00:53:24 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3961
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3962
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3963
withCRs
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3964
    "return a new string consisting of receivers characters
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3965
     with all \-characters replaced by cr-characters."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3966
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3967
    ^ self copy replaceAll:$\ by:(Character cr)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3968
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3969
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3970
     'hello\world' withCRs
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3971
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3972
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3973
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3974
withEscapes
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3975
    "return a new string consisting of receivers characters
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3976
     with all \X-character escapes replaced by corresponding-characters.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3977
     (similar to the way C-language Strings are converted).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3978
     The following escapes are supported:
1008
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  3979
        \r      return character
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  3980
        \n      newline character
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  3981
        \b      backspace character
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  3982
        \f      formfeed character
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  3983
        \t      tab character
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  3984
        \e      escape character
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  3985
        \\      the \ character itself
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  3986
        \nnn    three digit octal number defining the characters ascii value
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  3987
        \other  other
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3988
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3989
     Notice, that \' is NOT a valid escape, since the general syntax of
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3990
     string constants is not affected by this method.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3991
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3992
     Although easily implementable, this is NOT done automatically
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3993
     by the compiler (due to a lack of a language standard for this).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3994
     However, the compiler may detect sends ot #withEscapes to string literals
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3995
     and place a modified string constant into the binary/byte-code.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3996
     Therefore, no runtime penalty will be payed for using these escapes.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3997
     (not in pre 2.11 versions)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3998
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  3999
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4000
    |sz      "{ SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4001
     newSize "{ SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4002
     srcIdx  "{ SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4003
     dstIdx  "{ SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4004
     val     "{ SmallInteger }"
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4005
     newString next hasEmphasis e| 
359
claus
parents: 357
diff changeset
  4006
claus
parents: 357
diff changeset
  4007
    "
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4008
     first, count the number of escapes, to allow preallocation
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4009
     of the new string ...
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4010
     (it is faster to scan the string twice than to reallocate it multiple
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4011
      times in a WriteStream)
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4012
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4013
    sz := newSize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4014
    srcIdx := 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4015
    [(srcIdx := self indexOf:$\ startingAt:srcIdx) ~~ 0] whileTrue:[
1008
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4016
        srcIdx == sz ifFalse:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4017
            newSize := newSize - 1.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4018
            srcIdx := srcIdx + 1.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4019
            next := self at:srcIdx.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4020
            next == $0 ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4021
                [(self at:srcIdx) isDigit] whileTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4022
                    newSize := newSize - 1. srcIdx := srcIdx + 1.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4023
                ]
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4024
            ].
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4025
        ].
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4026
        srcIdx := srcIdx + 1.
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4027
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4028
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4029
    newSize == sz ifTrue:[
1008
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4030
        ^ self
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4031
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4032
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4033
    newString := self species new:newSize.
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4034
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4035
    hasEmphasis := self hasChangeOfEmphasis.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4036
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4037
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4038
     copy over, replace escapes
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4039
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4040
    srcIdx := dstIdx := 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4041
    [srcIdx <= sz] whileTrue:[
1008
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4042
        next := self at:srcIdx.
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4043
        hasEmphasis ifTrue:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4044
            e := self emphasisAt:srcIdx
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4045
        ].
1008
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4046
        srcIdx := srcIdx + 1.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4047
        next == $\ ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4048
            srcIdx <= sz ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4049
                next := self at:srcIdx.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4050
                srcIdx := srcIdx + 1.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4051
                next == $r ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4052
                    next := Character return
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4053
                ] ifFalse:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4054
                    next == $n ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4055
                        next := Character nl
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4056
                    ] ifFalse:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4057
                        next == $b ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4058
                            next := Character backspace
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4059
                        ] ifFalse:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4060
                            next == $f ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4061
                                next := Character newPage
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4062
                            ] ifFalse:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4063
                                next == $t ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4064
                                    next := Character tab
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4065
                                ] ifFalse:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4066
                                    next == $e ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4067
                                        next := Character esc
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4068
                                    ] ifFalse:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4069
                                        next == $0 ifTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4070
                                            val := 0.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4071
                                            [next isDigit] whileTrue:[
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4072
                                                val := val * 8 + next digitValue.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4073
                                                next := self at:srcIdx.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4074
                                                srcIdx := srcIdx + 1.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4075
                                            ].
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4076
                                            next := Character value:val.
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4077
                                        ]
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4078
                                    ]
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4079
                                ]
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4080
                            ]
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4081
                        ]
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4082
                    ]
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4083
                ].
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4084
            ].
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4085
        ].
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4086
        newString at:dstIdx put:next.
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4087
        hasEmphasis ifTrue:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4088
            newString emphasisAt:dstIdx put:e
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4089
        ].
1008
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4090
        dstIdx := dstIdx + 1.
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4091
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4092
    ^ newString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4093
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4094
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4095
     'hello world' withEscapes  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4096
     'hello\world' withEscapes   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4097
     'hello\world\' withEscapes   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4098
     'hello world\' withEscapes   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4099
     'hello\tworld' withEscapes   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4100
     'hello\nworld\na\n\tnice\n\t\tstring' withEscapes   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4101
     'hello\tworld\n' withEscapes   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4102
     'hello\010world' withEscapes   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4103
     'hello\r\nworld' withEscapes   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4104
    "
1008
1bd95ea83cac add \e in #withEscapes
Claus Gittinger <cg@exept.de>
parents: 1006
diff changeset
  4105
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4106
    "Modified: 12.5.1996 / 12:53:34 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4107
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4108
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4109
withTabs
735
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4110
    "return a string consisting of the receivers characters
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4111
     where leading spaces are replaced by tabulator characters (assuming 8-col tabs).
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4112
     Notice: if the receiver does not contain any tabs, it is returned unchanged;
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4113
     otherwise a new string is returned.
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4114
     Limitation: only the very first spaces are replaced 
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4115
		 (i.e. if the receiver contains newLine characters,
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4116
		  no tabs are inserted after those lineBreaks)"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4117
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4118
    |idx   "{ SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4119
     nTabs "{ SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4120
     newString|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4121
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4122
    idx := self findFirst:[:c | (c ~~ Character space)].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4123
    nTabs := (idx-1) // 8.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4124
    nTabs == 0 ifTrue:[^ self].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4125
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4126
    "any tabs"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4127
    newString := self class new:(self size - (nTabs * 7)).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4128
    newString atAll:(1 to:nTabs) put:(Character tab).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4129
    newString replaceFrom:(nTabs + 1) with:self startingAt:(nTabs * 8 + 1).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4130
    ^ newString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4131
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4132
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4133
     '12345678901234567890' withTabs 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4134
     '       8901234567890' withTabs 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4135
     '        901234567890' withTabs  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4136
     '               67890' withTabs
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4137
     '                7890' withTabs
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4138
     '                 890' withTabs
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4139
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4140
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4141
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4142
withTabsExpanded
735
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4143
    "return a string consisting of the receivers characters,
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4144
     where all tabulator characters are expanded into spaces (assuming 8-col tabs). 
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4145
     Notice: if the receiver does not contain any tabs, it is returned unchanged;
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4146
     otherwise a new string is returned.
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4147
     This does handle multiline strings."
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4148
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4149
    |col    "{ SmallInteger }" 
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4150
     str ch
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4151
     dstIdx "{ SmallInteger }"
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4152
     newSz  "{ SmallInteger }"
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4153
     sz "{ SmallInteger }"
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4154
     hasEmphasis e|
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4155
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4156
    (self includes:(Character tab)) ifFalse:[^ self].
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4157
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4158
    sz := self size.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4159
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4160
    "/ count the new size first, instead of
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4161
    "/ multiple resizing (better for large strings)
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4162
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4163
    col := 1. newSz := 0.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4164
    1 to:sz do:[:srcIdx |
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4165
        ch := self at:srcIdx.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4166
        ch == Character tab ifFalse:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4167
            col := col + 1.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4168
            newSz := newSz + 1.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4169
            ch == Character cr ifTrue:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4170
                col := 1
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4171
            ].
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4172
        ] ifTrue:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4173
            (col \\ 8) to:8 do:[:ii |
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4174
                newSz := newSz + 1.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4175
                col := col + 1
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4176
            ].
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4177
        ]
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4178
    ].
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4179
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4180
    str := self species new:newSz.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4181
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4182
    hasEmphasis := self hasChangeOfEmphasis.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4183
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4184
    col := 1. dstIdx := 1.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4185
    1 to:sz do:[:srcIdx |
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4186
        ch := self at:srcIdx.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4187
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4188
        ch == Character tab ifFalse:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4189
            col := col + 1.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4190
            ch == Character cr ifTrue:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4191
                col := 1
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4192
            ].
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4193
            hasEmphasis ifTrue:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4194
                e := self emphasisAt:srcIdx.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4195
                str emphasisAt:dstIdx put:e
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4196
            ].
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4197
            str at:dstIdx put:ch.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4198
            dstIdx := dstIdx + 1
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4199
        ] ifTrue:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4200
            (col \\ 8) to:8 do:[:ii |
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4201
                str at:dstIdx put:Character space.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4202
                dstIdx := dstIdx + 1.
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4203
                col := col + 1
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4204
            ].
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4205
        ]
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4206
    ].
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4207
    ^ str
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4208
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4209
    "
735
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4210
     ('1' , Character tab asString , 'x') withTabsExpanded          
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4211
     ('12345' , Character tab asString , 'x') withTabsExpanded      
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4212
     ('123456' , Character tab asString , 'x') withTabsExpanded     
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4213
     ('1234567' , Character tab asString , 'x') withTabsExpanded   
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4214
     ('12345678' , Character tab asString , 'x') withTabsExpanded   
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4215
     ('123456789' , Character tab asString , 'x') withTabsExpanded 
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4216
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4217
     (String with:Character tab
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4218
             with:Character tab
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4219
             with:$1) withTabsExpanded
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4220
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4221
     (String with:Character tab
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4222
             with:$1
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4223
             with:Character tab
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4224
             with:$2) withTabsExpanded  
735
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4225
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4226
     (String with:Character tab
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4227
             with:$1
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4228
             with:Character cr
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4229
             with:Character tab
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4230
             with:$2) withTabsExpanded  
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4231
    "
735
362ce9e28d89 expandTabs now handles multiline strings
Claus Gittinger <cg@exept.de>
parents: 678
diff changeset
  4232
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4233
    "Modified: 12.5.1996 / 13:05:10 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4234
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4235
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4236
withoutCRs
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4237
    "return a new collection consisting of receivers elements
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4238
     with all cr-characters replaced by \-characters.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4239
     This is the reverse operation of withCRs."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4240
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4241
    ^ self copy replaceAll:(Character cr) by:$\
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4242
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4243
     'hello
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4244
world' withoutCRs
359
claus
parents: 357
diff changeset
  4245
    "
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4246
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4247
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4248
withoutLeadingSeparators
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4249
    "return a copy of myself without leading separators.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4250
     Notice: this does remove tabs, newline or any other whitespace.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4251
     Returns an empty string, if the receiver consist only of whitespace."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4252
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4253
    |index|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4254
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4255
    index := self indexOfNonSeparatorStartingAt:1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4256
    index ~~ 0 ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4257
	index == 1 ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4258
	    ^ self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4259
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4260
	^ self copyFrom:index
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4261
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4262
    ^ ''
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4263
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4264
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4265
     '    foo    ' withoutLeadingSeparators  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4266
     'foo    '     withoutLeadingSeparators   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4267
     '    foo'     withoutLeadingSeparators  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4268
     '       '     withoutLeadingSeparators   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4269
     'foo'         withoutLeadingSeparators   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4270
     ('  ' , Character tab asString , ' foo   ') withoutLeadingSeparators inspect 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4271
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4272
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4273
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4274
withoutSeparators
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4275
    "return a copy of myself without leading and trailing whitespace.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4276
     Whitespace is space, tab, newline, formfeed.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4277
     Use withoutSpaces, if you want to remove spaces only."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4278
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4279
    |startIndex "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4280
     endIndex   "{ Class: SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4281
     sz|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4282
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4283
    sz := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4284
    startIndex := 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4285
    endIndex := sz.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4286
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4287
    [(startIndex < endIndex) and:[(self at:startIndex) isSeparator]] whileTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4288
	startIndex := startIndex + 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4289
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4290
    [(endIndex > 1) and:[(self at:endIndex) isSeparator]] whileTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4291
	endIndex := endIndex - 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4292
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4293
    startIndex > endIndex ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4294
	^ ''
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4295
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4296
    ((startIndex == 1) and:[endIndex == sz]) ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4297
	^ self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4298
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4299
    ^ self copyFrom:startIndex to:endIndex
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4300
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4301
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4302
     '    foo    ' withoutSeparators      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4303
     '    foo' withoutSeparators      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4304
     'foo    ' withoutSeparators      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4305
     '       ' withoutSeparators      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4306
     ('  foo' , Character tab asString , '    ') withoutSeparators inspect 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4307
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4308
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4309
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4310
withoutSpaces
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4311
    "return a copy of myself without leading and trailing spaces.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4312
     Notice: this does NOT remove tabs, newline or any other whitespace.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4313
     Use withoutSeparators for this."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4314
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4315
    |startIndex "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4316
     endIndex   "{ Class: SmallInteger }" 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4317
     sz|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4318
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4319
    sz := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4320
    startIndex := 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4321
    endIndex := sz.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4322
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4323
    [(startIndex < endIndex) and:[(self at:startIndex) == Character space]] whileTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4324
	startIndex := startIndex + 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4325
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4326
    [(endIndex > 1) and:[(self at:endIndex) == Character space]] whileTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4327
	endIndex := endIndex - 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4328
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4329
    startIndex > endIndex ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4330
	^ ''
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4331
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4332
    ((startIndex == 1) and:[endIndex == sz]) ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4333
	^ self
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4334
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4335
    ^ self copyFrom:startIndex to:endIndex
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4336
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4337
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4338
     '    foo    ' withoutSpaces  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4339
     'foo    '     withoutSpaces   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4340
     '    foo'     withoutSpaces  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4341
     '       '     withoutSpaces   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4342
     ('  foo' , Character tab asString , '    ') withoutSpaces inspect 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4343
    "
915
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4344
!
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4345
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4346
withoutTrailingSeparators
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4347
    "return a copy of myself without trailing separators.
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4348
     Notice: this does remove tabs, newline or any other whitespace.
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4349
     Returns an empty string, if the receiver consist only of whitespace."
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4350
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4351
    |index|
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4352
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4353
    index := self size.
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4354
    [index ~~ 0] whileTrue:[
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4355
	(self at:index) isSeparator ifFalse:[
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4356
	    ^ self copyTo:index
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4357
	].
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4358
	index := index - 1
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4359
    ].
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4360
    ^ ''
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4361
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4362
    "
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4363
     '    foo    ' withoutTrailingSeparators  
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4364
     'foo    '     withoutTrailingSeparators   
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4365
     '    foo'     withoutTrailingSeparators  
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4366
     '       '     withoutTrailingSeparators   
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4367
     'foo'         withoutTrailingSeparators   
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4368
     ('  ' , Character tab asString , ' foo   ') withoutTrailingSeparators inspect 
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4369
     ('   foo' , Character tab asString) withoutTrailingSeparators inspect 
1ac1c7e942f4 added withoutTrailingSeparators
Claus Gittinger <cg@exept.de>
parents: 828
diff changeset
  4370
    "
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4371
! !
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4372
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4373
!CharacterArray methodsFor:'substring searching'!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4374
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4375
findString:subString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4376
    "find a substring. if found, return the index;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4377
     if not found, return 0."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4378
1145
a094d90e11bf dont use [0] blocks - use 0 constant instead
Claus Gittinger <cg@exept.de>
parents: 1133
diff changeset
  4379
    ^ self indexOfSubCollection:subString startingAt:1 ifAbsent:0
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4380
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4381
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4382
     'hello world' findString:'llo'   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4383
     'hello world' findString:'ole'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4384
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4385
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4386
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4387
findString:subString ifAbsent:exceptionBlock
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4388
    "find a substring. If found, return the index;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4389
     if not found, return the result of evaluating exceptionBlock."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4390
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4391
    ^ self indexOfSubCollection:subString startingAt:1 ifAbsent:exceptionBlock
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4392
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4393
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4394
findString:subString startingAt:index
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4395
    "find a substring, starting at index. if found, return the index;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4396
     if not found, return 0."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4397
1145
a094d90e11bf dont use [0] blocks - use 0 constant instead
Claus Gittinger <cg@exept.de>
parents: 1133
diff changeset
  4398
    ^ self indexOfSubCollection:subString startingAt:index ifAbsent:0
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4399
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4400
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4401
     'hello yello' findString:'llo' startingAt:1   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4402
     'hello yello' findString:'llo' startingAt:5   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4403
     'hello yello' findString:'llo' startingAt:15   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4404
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4405
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4406
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4407
findString:subString startingAt:index ifAbsent:exceptionBlock
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4408
    "find a substring, starting at index. if found, return the index;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4409
     if not found, return the result of evaluating exceptionBlock."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4410
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4411
    ^ self indexOfSubCollection:subString startingAt:index ifAbsent:exceptionBlock
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4412
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4413
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4414
includesString:aString
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4415
    "return true, if a substring is contained in the receiver"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4416
1145
a094d90e11bf dont use [0] blocks - use 0 constant instead
Claus Gittinger <cg@exept.de>
parents: 1133
diff changeset
  4417
    ^ (self indexOfSubCollection:aString startingAt:1 ifAbsent:0) ~~ 0
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4418
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4419
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4420
     'hello world' includesString:'hel' 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4421
     'hello world' includesString:'rld' 
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4422
     'hello world' includesString:'llo'  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4423
     'hello world' includesString:'LLO'   
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4424
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4425
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4426
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4427
indexOfSubCollection:subString startingAt:index ifAbsent:exceptionBlock
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4428
    "find a substring, starting at index. if found, return the index;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4429
     if not found, return the result of evaluating exceptionBlock.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4430
     This is a q&d hack - not very efficient"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4431
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4432
    |firstChar found
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4433
     startIndex "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4434
     subSize    "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4435
     mySize     "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4436
     runIdx     "{ Class: SmallInteger }" |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4437
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4438
    subSize := subString size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4439
    subSize == 0 ifTrue:[^ index]. "empty string matches"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4440
    mySize := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4441
    firstChar := subString at:1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4442
    startIndex := self indexOf:firstChar startingAt:index.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4443
    [startIndex == 0] whileFalse:[
1005
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4444
        runIdx := startIndex.
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4445
        found := true.
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4446
        1 to:subSize do:[:i |
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4447
            runIdx > mySize ifTrue:[
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4448
                found := false
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4449
            ] ifFalse:[
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4450
                (subString at:i) ~= (self at:runIdx) ifTrue:[
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4451
                    found := false
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4452
                ]
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4453
            ].
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4454
            runIdx := runIdx + 1
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4455
        ].
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4456
        found ifTrue:[
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4457
            ^ startIndex
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4458
        ].
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4459
        startIndex := self indexOf:firstChar startingAt:(startIndex + 1)
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4460
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4461
    ^ exceptionBlock value
1005
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4462
4507487e0be6 more on encoding/decoding (still unfinished)
Claus Gittinger <cg@exept.de>
parents: 997
diff changeset
  4463
    "Modified: 23.2.1996 / 15:35:15 / cg"
651
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4464
!
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4465
652
Claus Gittinger <cg@exept.de>
parents: 651
diff changeset
  4466
restAfter:keyword withoutSeparators:strip
651
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4467
    "compare the left of the receiver with keyword,
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4468
     if it matches return the right. 
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4469
     Finally, if strip is true, remove whiteSpace.
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4470
     This method is used to match and extract lines of the form:
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4471
	something: rest
759
908363ce8a32 interest is written with one 'r' (shame on me)
Claus Gittinger <cg@exept.de>
parents: 735
diff changeset
  4472
     where we are interested in rest, but only if the receiver string
651
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4473
     begins with something. 
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4474
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4475
     You may wonder why such a specialized method exists here
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4476
     - this is so common when processing mailboxes,
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4477
     rcs files, nntp/pop3 responses, that is was considered worth
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4478
     a special method here to avoid having the code below a hundred
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4479
     times in variuos places."
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4480
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4481
    |rest|
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4482
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4483
    (self startsWith:keyword) ifTrue:[
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4484
	rest := self copyFrom:(keyword size + 1).
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4485
	strip ifTrue:[
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4486
	    rest := rest withoutSeparators
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4487
	].
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4488
	^ rest
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4489
    ].
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4490
    ^ nil
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4491
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4492
    "
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4493
     'foo: hello world' restAfter:'foo:' withoutSeparators:true 
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4494
     'funny: something' restAfter:'foo:' withoutSeparators:true 
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4495
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4496
     'foo:     hello world    ' restAfter:'foo:' withoutSeparators:true 
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4497
     'foo:     hello world    ' restAfter:'foo:' withoutSeparators:false 
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4498
    "
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4499
2093b018dc3a added restAfter:withoutSeparators:
Claus Gittinger <cg@exept.de>
parents: 650
diff changeset
  4500
    "Created: 25.11.1995 / 11:04:18 / cg"
328
claus
parents: 327
diff changeset
  4501
! !
claus
parents: 327
diff changeset
  4502
255
2b2c5c0facab *** empty log message ***
claus
parents: 247
diff changeset
  4503
!CharacterArray methodsFor:'testing'!
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4504
1613
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4505
continuesWith:aString startingAt:startIndex
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4506
    "return true, if the receiver beginning at startIndex
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4507
     contains the characters in aString."
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4508
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4509
    |sz  "{Class: SmallInteger }"
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4510
     idx "{Class: SmallInteger }"|
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4511
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4512
    sz := aString size.
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4513
    idx := startIndex.
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4514
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4515
    1 to:sz do:[:i |
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4516
        (self at:idx) ~~ (aString at:i) ifTrue:[^ false].
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4517
        idx := idx + 1
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4518
    ].
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4519
    ^ true
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4520
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4521
    "
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4522
     'hello world' continuesWith:'world' startingAt:6                
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4523
     'hello world' continuesWith:'world' startingAt:7                
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4524
    "
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4525
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4526
    "Created: 12.5.1996 / 15:46:40 / cg"
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4527
    "Modified: 26.7.1996 / 19:08:36 / cg"
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4528
!
cf7be5555874 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 1440
diff changeset
  4529
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4530
countWords
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4531
    "return the number of words, which are separated by separators"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4532
92
0c73b48551ac *** empty log message ***
claus
parents: 82
diff changeset
  4533
    |tally "{ Class: SmallInteger }"
0c73b48551ac *** empty log message ***
claus
parents: 82
diff changeset
  4534
     start "{ Class: SmallInteger }"
0c73b48551ac *** empty log message ***
claus
parents: 82
diff changeset
  4535
     mySize "{ Class: SmallInteger }"
0c73b48551ac *** empty log message ***
claus
parents: 82
diff changeset
  4536
     stop ch|
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4537
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4538
    tally := 0.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4539
    start := 1.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4540
    mySize := self size.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4541
    [start <= mySize] whileTrue:[
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4542
	ch := self at:start.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4543
	ch isSeparator ifTrue:[
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4544
	    start := start + 1
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4545
	] ifFalse:[
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4546
	    stop := self indexOfSeparatorStartingAt:start.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4547
	    (stop == 0) ifTrue:[
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4548
		stop := mySize + 1
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4549
	    ].
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4550
	    tally := tally + 1.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4551
	    start := stop
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4552
	]
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4553
    ].
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4554
    ^ tally
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4555
92
0c73b48551ac *** empty log message ***
claus
parents: 82
diff changeset
  4556
    "
0c73b48551ac *** empty log message ***
claus
parents: 82
diff changeset
  4557
     'hello world isnt this nice' countWords'
0c73b48551ac *** empty log message ***
claus
parents: 82
diff changeset
  4558
    "
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4559
!
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4560
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4561
endsWith:aString
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4562
    "return true, if the receiver ends with something, aString."
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4563
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4564
    |s|
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4565
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4566
    (s := self string) ~~ self ifTrue:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4567
        ^ s endsWith:aString
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4568
    ].
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4569
    ^ super endsWith:aString
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4570
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4571
    "
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4572
     'hello world' endsWith:'world'                 
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4573
     'hello world' asText allBold endsWith:'world'  
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4574
    "
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4575
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4576
    "Modified: 12.5.1996 / 15:49:18 / cg"
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4577
!
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4578
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4579
isAlphaNumeric
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4580
    "return true, if the receiver is some alphanumeric word;
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4581
     i.e. consists of a letter followed by letters or digits."
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4582
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4583
    self size == 0 ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4584
	"mhmh what is this ?"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4585
	^ false
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4586
    ].
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4587
    (self at:1) isLetter ifFalse:[^ false].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4588
    self do:[:char |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4589
	char isLetterOrDigit ifFalse:[^ false].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4590
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4591
    ^ true
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4592
77
6c38ca59927f *** empty log message ***
claus
parents: 68
diff changeset
  4593
    "
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4594
     'helloWorld' isAlphaNumeric  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4595
     'foo1234' isAlphaNumeric    
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4596
     'f1234' isAlphaNumeric      
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4597
     '1234' isAlphaNumeric       
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4598
     '+' isAlphaNumeric         
77
6c38ca59927f *** empty log message ***
claus
parents: 68
diff changeset
  4599
    "
6c38ca59927f *** empty log message ***
claus
parents: 68
diff changeset
  4600
!
6c38ca59927f *** empty log message ***
claus
parents: 68
diff changeset
  4601
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4602
isBlank
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4603
    "return true, if the receiver contains spaces only"
77
6c38ca59927f *** empty log message ***
claus
parents: 68
diff changeset
  4604
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4605
    self do:[:char |
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4606
	char ~~ Character space ifTrue:[^ false].
77
6c38ca59927f *** empty log message ***
claus
parents: 68
diff changeset
  4607
    ].
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4608
    ^ true
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4609
!
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4610
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4611
levenshteinTo:aString
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4612
    "return the levenshtein distance to the argument, aString;
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4613
     this value corrensponds to the number of replacements that have to be
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4614
     made to get aString from the receiver.
196
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4615
     See IEEE transactions on Computers 1976 Pg 172 ff."
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4616
327
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  4617
    "
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  4618
     in the following, we assum that ommiting a character
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  4619
     is less of an error than inserting an extra character.
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  4620
     Therefore the different insertion (i) and deletion (d)
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  4621
     values.
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  4622
    "
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  4623
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4624
    ^ self levenshteinTo:aString s:4 c:1 i:2 d:6
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4625
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4626
    "
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4627
     'ocmprt' levenshteinTo:'computer'
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4628
     'computer' levenshteinTo:'computer'
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4629
     'ocmputer' levenshteinTo:'computer'
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4630
     'cmputer' levenshteinTo:'computer'
327
183f094cfd72 last with old match code
claus
parents: 326
diff changeset
  4631
     'computer' levenshteinTo:'cmputer'
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4632
     'Computer' levenshteinTo:'computer'
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4633
    "
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4634
!
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4635
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4636
levenshteinTo:aString s:substWeight c:caseWeight i:insrtWeight d:deleteWeight
196
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4637
    "parametrized levenshtein. 
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4638
     return the levenshtein distance to the argument, aString;
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4639
     this value corrensponds to the number of replacements that have to be
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4640
     made to get aString from the receiver.
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4641
     The arguments are the costs for substitution, case-change, insertion and 
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4642
     deletion of a character.
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4643
     See IEEE transactions on Computers 1976 Pg 172 ff"
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4644
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4645
    |d  "delta matrix"
196
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4646
     len1 "{ Class: SmallInteger }"
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4647
     len2 "{ Class: SmallInteger }"
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4648
     dim  "{ Class: SmallInteger }"
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4649
     prevRow row col 
ab5727ae9e38 added asUppercaseFirst / asLowercaseFirst
claus
parents: 163
diff changeset
  4650
     dimPlus1 "{ Class: SmallInteger }"
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4651
     min pp c1 c2|
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4652
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4653
    len1 := self size.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4654
    len2 := aString size.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4655
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4656
    "create the help-matrix"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4657
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4658
    dim := len1 max:len2.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4659
    dimPlus1 := dim + 1.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4660
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4661
    d := Array new:dimPlus1.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4662
    1 to:dimPlus1 do:[:i |
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4663
	d at:i put:(Array new:dimPlus1)
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4664
    ].
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4665
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4666
    "init help-matrix"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4667
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4668
    (d at:1) at:1 put:0.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4669
    row := d at:1.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4670
    1 to:dim do:[:j |
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4671
	row at:(j + 1) put:( (row at:j) + insrtWeight )
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4672
    ].
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4673
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4674
    1 to:dim do:[:i |
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4675
	 (d at:(i + 1)) at:1 put:(  ((d at:i) at:1) + deleteWeight )
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4676
    ].
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4677
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4678
    1 to:len1 do:[:i |
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4679
	c1 := self at:i.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4680
	1 to:len2 do:[:j |
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4681
	    c2 := aString at:j.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4682
	    (c1 == c2) ifTrue:[
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4683
		pp := 0
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4684
	    ] ifFalse:[
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4685
		(c1 asLowercase == c2 asLowercase) ifTrue:[
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4686
		    pp := caseWeight
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4687
		] ifFalse:[
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4688
		    pp := substWeight
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4689
		]
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4690
	    ].
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4691
	    prevRow := d at:i.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4692
	    row := d at:(i + 1).
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4693
	    col := j + 1.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4694
	    min := (prevRow at:j) + pp.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4695
	    min := min min:( (row at:j) + insrtWeight).
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4696
	    min := min min:( (prevRow at:col) + deleteWeight).
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4697
	    row at:col put: min
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4698
	]
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4699
    ].
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4700
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4701
    ^ (d at:(len1 + 1)) at:(len2 + 1)
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4702
!
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4703
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4704
numArgs
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4705
    "treating the receiver as a message selector, return how many arguments would it take"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4706
2051
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4707
    |binopChars firstChar|
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4708
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4709
    (self size > 2) ifFalse:[
2051
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4710
        binopChars := Scanner binarySelectorCharacters.
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4711
        firstChar := self at:1.
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4712
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4713
        (self size == 1) ifTrue:[
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4714
            ((binopChars occurrencesOf:firstChar) == 0) ifTrue:[^ 0].
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4715
            ^ 1
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4716
        ].
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4717
        ((binopChars occurrencesOf:firstChar) == 0) ifFalse:[
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4718
            ((binopChars occurrencesOf:(self at:2)) == 0) ifFalse:[^ 1]
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4719
        ]
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4720
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4721
    ^ self occurrencesOf:$:
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4722
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4723
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4724
     'foo:bar:' numArgs  
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4725
     #foo:bar: numArgs    
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4726
     'hello' numArgs       
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4727
     '+' numArgs   
2051
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4728
     '|' numArgs   
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4729
     '?' numArgs   
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4730
    "
2051
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4731
08fd6a693ee3 ask Scanner for valid binarySelector characters in numArgs
Claus Gittinger <cg@exept.de>
parents: 1978
diff changeset
  4732
    "Modified: 4.1.1997 / 14:16:14 / cg"
608
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4733
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4734
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4735
partsIfSelector
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4736
    "treat the receiver as a message selector, return a collection of parts."
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4737
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4738
    |idx1 "{ Class: SmallInteger }"
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4739
     coll idx2 sz|
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4740
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4741
    coll := OrderedCollection new.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4742
    idx1 := 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4743
    sz := self size.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4744
    [true] whileTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4745
	idx2 := self indexOf:$: startingAt:idx1 + 1.
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4746
	(idx2 == 0 or:[idx2 == sz]) ifTrue:[
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4747
	    coll add:(self copyFrom:idx1).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4748
	    ^ coll
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4749
	].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4750
	coll add:(self copyFrom:idx1 to:idx2).
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4751
	idx1 := idx2 + 1
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4752
    ].
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4753
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4754
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4755
     'foo:bar:' partsIfSelector     
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4756
     #foo:bar: partsIfSelector     
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4757
     'hello' partsIfSelector       
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4758
     '+' partsIfSelector           
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4759
    "
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4760
!
cd5ac440fa95 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 581
diff changeset
  4761
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4762
spellAgainst: aString 
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4763
    "return an integer between 0 and 100 indicating how similar 
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4764
     the argument is to the receiver.  No case conversion is done.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4765
     This algorithm is much simpler (but also less exact) than the
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4766
     levenshtein distance. Experiment which is better for your
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4767
     application."
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4768
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4769
    | i1     "{ Class: SmallInteger }"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4770
      i2     "{ Class: SmallInteger }"
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4771
      next1  "{ Class: SmallInteger }"
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4772
      next2  "{ Class: SmallInteger }"
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4773
      size1  "{ Class: SmallInteger }"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4774
      size2  "{ Class: SmallInteger }"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4775
      score  "{ Class: SmallInteger }"
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4776
      maxLen "{ Class: SmallInteger }" |
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4777
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4778
    size1 := self size.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4779
    size2 := aString size.
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4780
    maxLen := size1 max:size2.
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4781
    score := 0.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4782
    i1 := i2 := 1.
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4783
    [i1 <= size1 and: [i2 <= size2]] whileTrue:[
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4784
	next1 := i1 + 1.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4785
	next2 := i2 + 1.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4786
	(self at:i1) == (aString at:i2) ifTrue: [
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4787
	    score := score+1.             
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4788
	    i1 := next1.                    
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4789
	    i2 := next2
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4790
	] ifFalse: [
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4791
	    (i2 < size2 and: [(self at:i1) == (aString at:next2)]) ifTrue: [
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4792
		i2 := next2
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4793
	    ] ifFalse: [
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4794
		(i1 < size1 and: [(self at:next1) == (aString at:i2)]) ifTrue: [
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4795
		    i1 := next1
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4796
		] ifFalse: [
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4797
		    i1 := next1.
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4798
		    i2 := next2
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4799
		] 
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4800
	    ] 
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4801
	] 
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4802
    ].
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4803
1124
53f6970d9d7d *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1059
diff changeset
  4804
    score == maxLen ifTrue: [^ 100].
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4805
    ^ 100 * score // maxLen
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4806
154
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4807
    " 
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4808
     'Smalltalk' spellAgainst: 'Smalltlak' 
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4809
     'Smalltalk' spellAgainst: 'smalltlak' 
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4810
     'Smalltalk' spellAgainst: 'smalltalk' 
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4811
     'Smalltalk' spellAgainst: 'smalltlk'  
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4812
     'Smalltalk' spellAgainst: 'Smalltolk'   
d4236ec280a6 *** empty log message ***
claus
parents: 138
diff changeset
  4813
    "
1381
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4814
!
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4815
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4816
startsWith:aString
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4817
    "return true, if the receiver starts with something, aString."
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4818
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4819
    |s|
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4820
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4821
    (s := self string) ~~ self ifTrue:[
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4822
        ^ s startsWith:aString
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4823
    ].
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4824
    ^ super startsWith:aString
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4825
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4826
    "
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4827
     'hello world' startsWith:'hello'                 
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4828
     'hello world' asText allBold startsWith:'hello'  
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4829
    "
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4830
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4831
    "Created: 12.5.1996 / 15:46:40 / cg"
c920ec16a98f Text fixes
Claus Gittinger <cg@exept.de>
parents: 1375
diff changeset
  4832
    "Modified: 12.5.1996 / 15:49:24 / cg"
64
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4833
! !
af7aeb79b25e Initial revision
claus
parents:
diff changeset
  4834
1849
9f35f5934ec6 added msdos-latin1 en/decoder (code = #msdos850)
Claus Gittinger <cg@exept.de>
parents: 1669
diff changeset
  4835
!CharacterArray class methodsFor:'documentation'!
629
2ceefe9b5a19 version at the end
Claus Gittinger <cg@exept.de>
parents: 608
diff changeset
  4836
2ceefe9b5a19 version at the end
Claus Gittinger <cg@exept.de>
parents: 608
diff changeset
  4837
version
2748
3002ca130a92 checkin from browser
Claus Gittinger <cg@exept.de>
parents: 2747
diff changeset
  4838
    ^ '$Header: /cvs/stx/stx/libbasic/CharacterArray.st,v 1.126 1997-07-04 09:23:39 cg Exp $'
629
2ceefe9b5a19 version at the end
Claus Gittinger <cg@exept.de>
parents: 608
diff changeset
  4839
! !
2728
7569e45a4d15 raise error signal if anything goes wrong while decoding.
Claus Gittinger <cg@exept.de>
parents: 2721
diff changeset
  4840
CharacterArray initialize!