Unicode32String.st
author Claus Gittinger <cg@exept.de>
Wed, 25 Jan 2012 12:00:10 +0100
changeset 2709 0d45ae840f12
parent 2708 20400249a530
child 2740 d4a984bff73c
permissions -rw-r--r--
backward compatibility storestring
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
1404
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
     1
"
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
     2
 COPYRIGHT (c) 2004 by eXept Software AG 
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
     3
              All Rights Reserved
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
     4
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
     5
 This software is furnished under a license and may be used
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
     6
 only in accordance with the terms of that license and with the
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
     7
 inclusion of the above copyright notice.   This software may not
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
     8
 be provided or otherwise made available to, or used by, any
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
     9
 other person.  No title to or ownership of the software is
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    10
 hereby transferred.
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    11
"
1411
1cb88c3dbd22 encoding now unicode
Claus Gittinger <cg@exept.de>
parents: 1404
diff changeset
    12
"{ Package: 'stx:libbasic2' }"
1404
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    13
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    14
FourByteString variableLongSubclass:#Unicode32String
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    15
	instanceVariableNames:''
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    16
	classVariableNames:''
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    17
	poolDictionaries:''
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    18
	category:'Collections-Text'
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    19
!
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    20
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    21
!Unicode32String class methodsFor:'documentation'!
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    22
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    23
copyright
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    24
"
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    25
 COPYRIGHT (c) 2004 by eXept Software AG 
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    26
              All Rights Reserved
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    27
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    28
 This software is furnished under a license and may be used
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    29
 only in accordance with the terms of that license and with the
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    30
 inclusion of the above copyright notice.   This software may not
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    31
 be provided or otherwise made available to, or used by, any
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    32
 other person.  No title to or ownership of the software is
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    33
 hereby transferred.
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    34
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    35
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    36
"
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    37
!
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    38
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    39
documentation
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    40
"
1573
dec4f78e857b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1460
diff changeset
    41
    Not yet fully finished - unicode support is still being implemented.
1404
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    42
"
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    43
! !
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    44
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    45
!Unicode32String class methodsFor:'initialization'!
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    46
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    47
initialize
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    48
    "initialize the class - private"
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    49
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    50
    self flags:(Behavior flagLongs)
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    51
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    52
    "
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    53
     Unicode32String initialize
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    54
    "
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    55
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    56
    "Created: 30.6.1997 / 15:35:52 / cg"
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    57
    "Modified: 30.6.1997 / 15:39:21 / cg"
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    58
! !
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    59
2052
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    60
!Unicode32String class methodsFor:'reading'!
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    61
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    62
readFrom:aStreamOrString onError:exceptionBlock
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    63
    "read & return the next String from the (character-)stream aStream;
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    64
     skipping all whitespace first; return the value of exceptionBlock,
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    65
     if no string can be read. The sequence of characters as read from the
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    66
     stream must be one as stored via storeOn: or storeString."
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    67
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    68
    "
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    69
     this method is not to be inherited
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    70
     (i.e. not ok for subclasses; Symbol, for example)
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    71
    "
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    72
    self ~~ Unicode32String ifTrue:[
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    73
        ^ super readFrom:aStreamOrString onError:exceptionBlock
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    74
    ].
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    75
    ^ self readSmalltalkStringFrom:aStreamOrString onError:exceptionBlock
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    76
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    77
    "
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    78
        self readFrom:'abcäöü' storeString
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    79
        String readFrom:'abcäöü' storeString
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    80
    "
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    81
! !
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    82
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
    83
1404
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    84
!Unicode32String methodsFor:'conversion'!
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    85
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    86
asSymbolIfInterned
1605
893af2a9f99e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1573
diff changeset
    87
    "If a symbol with the receivers characters is already known, return it. Otherwise, return nil. 
893af2a9f99e *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1573
diff changeset
    88
     Because ST/X does not support non-8-bit symbols, this method
1573
dec4f78e857b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1460
diff changeset
    89
     has been redefined to only return a symbol, if the receiver does NOT contain
dec4f78e857b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1460
diff changeset
    90
     any non-8 bit characters."
dec4f78e857b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1460
diff changeset
    91
1404
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    92
    |s|
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    93
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    94
    Error catch:[
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    95
        s := self asSingleByteString.
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    96
    ].
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    97
    s isNil ifTrue:[^ s].
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    98
    ^ s asSymbolIfInterned
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
    99
!
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   100
1443
e971d86d6259 Define #asUnicode16String and #asUnicode32String
Stefan Vogel <sv@exept.de>
parents: 1411
diff changeset
   101
asUnicode32String
1573
dec4f78e857b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1460
diff changeset
   102
    "as the receiver already is a unicode-32 string, return it"
dec4f78e857b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1460
diff changeset
   103
1443
e971d86d6259 Define #asUnicode16String and #asUnicode32String
Stefan Vogel <sv@exept.de>
parents: 1411
diff changeset
   104
    ^ self
e971d86d6259 Define #asUnicode16String and #asUnicode32String
Stefan Vogel <sv@exept.de>
parents: 1411
diff changeset
   105
!
e971d86d6259 Define #asUnicode16String and #asUnicode32String
Stefan Vogel <sv@exept.de>
parents: 1411
diff changeset
   106
1404
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   107
asUnicodeString
1573
dec4f78e857b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1460
diff changeset
   108
    "as the receiver already is a unicode string, return it"
dec4f78e857b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1460
diff changeset
   109
1404
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   110
    ^ self
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   111
!
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   112
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   113
printOn:aStream
2052
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   114
    "print the receiver on aStream. 
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   115
     Let aStream decide how to represent this, wether utf8, ucs16, ..."
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   116
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   117
    aStream nextPutAllUnicode:self
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   118
! !
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   119
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   120
!Unicode32String methodsFor:'printing & storing'!
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   121
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   122
storeOn:aStream
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   123
    "put the storeString of myself on aStream"
1573
dec4f78e857b *** empty log message ***
Claus Gittinger <cg@exept.de>
parents: 1460
diff changeset
   124
2664
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   125
    (self utf8Encoded storeOn:aStream).
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   126
    aStream nextPutAll:' utf8Decoded'.
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   127
"/    aStream nextPut:$'.
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   128
"/    (self includes:$') ifTrue:[
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   129
"/        self do:[:thisChar |
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   130
"/            (thisChar == $') ifTrue:[aStream nextPut:thisChar].
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   131
"/            aStream nextPutUnicode:thisChar
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   132
"/        ]
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   133
"/    ] ifFalse:[
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   134
"/        aStream nextPutAllUnicode:self
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   135
"/    ].
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   136
"/    aStream nextPut:$'
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   137
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   138
    "Modified: / 28-09-2011 / 16:18:43 / cg"
2052
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   139
!
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   140
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   141
storeString
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   142
    "return a String for storing myself"
80b8a0c74063 Unicode handling:
Stefan Vogel <sv@exept.de>
parents: 2048
diff changeset
   143
2664
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   144
"/    ^ self basicStoreString.
2709
0d45ae840f12 backward compatibility storestring
Claus Gittinger <cg@exept.de>
parents: 2708
diff changeset
   145
"/    ^ (self withCEscapes storeString),' withoutCEscapes'.
0d45ae840f12 backward compatibility storestring
Claus Gittinger <cg@exept.de>
parents: 2708
diff changeset
   146
    ^ (self utf8Encoded storeString),' utf8Decoded'.
2664
8c8745eac73d unicode storestring fixed
Claus Gittinger <cg@exept.de>
parents: 2052
diff changeset
   147
2709
0d45ae840f12 backward compatibility storestring
Claus Gittinger <cg@exept.de>
parents: 2708
diff changeset
   148
    "Modified: / 25-01-2012 / 11:59:26 / cg"
1404
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   149
! !
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   150
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   151
!Unicode32String class methodsFor:'documentation'!
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   152
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   153
version
2709
0d45ae840f12 backward compatibility storestring
Claus Gittinger <cg@exept.de>
parents: 2708
diff changeset
   154
    ^ '$Header: /cvs/stx/stx/libbasic2/Unicode32String.st,v 1.12 2012-01-25 11:00:10 cg Exp $'
0d45ae840f12 backward compatibility storestring
Claus Gittinger <cg@exept.de>
parents: 2708
diff changeset
   155
!
0d45ae840f12 backward compatibility storestring
Claus Gittinger <cg@exept.de>
parents: 2708
diff changeset
   156
0d45ae840f12 backward compatibility storestring
Claus Gittinger <cg@exept.de>
parents: 2708
diff changeset
   157
version_CVS
0d45ae840f12 backward compatibility storestring
Claus Gittinger <cg@exept.de>
parents: 2708
diff changeset
   158
    ^ '$Header: /cvs/stx/stx/libbasic2/Unicode32String.st,v 1.12 2012-01-25 11:00:10 cg Exp $'
1404
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   159
! !
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   160
776eb9467dc3 initial checkin
Claus Gittinger <cg@exept.de>
parents:
diff changeset
   161
Unicode32String initialize!