|
Class: TwoByteString
Object
|
+--Collection
|
+--SequenceableCollection
|
+--ArrayedCollection
|
+--UninterpretedBytes
|
+--CharacterArray
|
+--TwoByteString
|
+--BIG5EncodedString
|
+--GBEncodedString
|
+--JISEncodedString
|
+--KSCEncodedString
|
+--TwoByteSymbol
|
+--Unicode16String
- Package:
- stx:libbasic
- Category:
- Collections-Text
- Version:
- rev:
1.48
date: 2019/03/27 13:14:46
- user: cg
- file: TwoByteString.st directory: libbasic
- module: stx stc-classLibrary: libbasic
- Author:
- Claus Gittinger
TwoByteStrings are like strings, but storing 16bits per character.
The integration of them into the system is not completed ....
Text
JISEncodedString
StringCollection
initialization
-
initialize
-
initialize the class - private
usage example(s):
instance creation
-
basicNew: anInteger
-
return a new empty string with anInteger number of characters
-
uninitializedNew: anInteger
-
return a new empty string with anInteger characters
usage example(s):
accessing
-
basicAt: index
-
return the character at position index, an Integer
- reimplemented here since we return 16-bit characters
-
basicAt: index put: aCharacter
-
store the argument, aCharacter at position index, an Integer.
Returns aCharacter (sigh).
- reimplemented here since we store 16-bit characters
-
unsignedShortAt: index
-
return the short at position index, an Integer
filling and replacing
-
from: start to: stop put: aCharacter
-
fill part of the receiver with aCharacter.
- reimplemented here for speed
usage example(s):
(Unicode16String new:10) from:1 to:10 put:$a
(Unicode16String new:20) from:10 to:20 put:$b
(Unicode16String new:20) from:1 to:10 put:$c
(Unicode16String new:20) from:1 to:10 put:$c
(Unicode16String new:100) from:2 to:99 put:$c
(Unicode16String new:10) from:0 to:9 put:$a
(Unicode16String new:10) from:1 to:11 put:$a
|
-
replaceFrom: start to: stop with: aString startingAt: repStart
-
replace the characters starting at index start, anInteger and ending
at stop, anInteger with characters from aString starting at repStart.
Return the receiver.
- reimplemented here for speed
usage example(s):
'hello world' asUnicode16String replaceFrom:1 to:5 with:'123456' startingAt:2
'hello world' asUnicode16String replaceFrom:1 to:5 with:'123456' asUnicode16String startingAt:2
'hello world' asUnicode16String replaceFrom:1 to:0 with:'123456' startingAt:2
'hello' asUnicode16String replaceFrom:1 to:6 with:'123456' startingAt:2
'hello world' asUnicode16String replaceFrom:1 to:1 with:'123456' startingAt:2
|
queries
-
bitsPerCharacter
-
return the number of bits each character has.
Here, 16 is returned (storing double byte characters).
-
bytesPerCharacter
-
return the number of bytes each character has.
Here, 2 is returned (storing double byte characters).
-
characterSize
-
answer the size in bits of my largest character (actually only 7, 8 or 16)
usage example(s):
'hello world' asUnicode16String characterSize
'hello worldüäö' asUnicode16String characterSize
'a' asUnicode16String characterSize
'ü' asUnicode16String characterSize
'aa' asUnicode16String characterSize
'aü' asUnicode16String characterSize
'aaa' asUnicode16String characterSize
'aaü' asUnicode16String characterSize
'aaaü' asUnicode16String characterSize
'aaaa' asUnicode16String characterSize
'aaaaü' asUnicode16String characterSize
|
-
containsNon7BitAscii
-
return true, if the underlying string contains 8BitCharacters (or widers)
(i.e. if it is non-ascii)
usage example(s):
'hello world' asUnicode16String containsNon7BitAscii
'hello worldüäö' asUnicode16String containsNon7BitAscii
'ü' asUnicode16String containsNon7BitAscii
'aü' asUnicode16String containsNon7BitAscii
'aaü' asUnicode16String containsNon7BitAscii
'aaaü' asUnicode16String containsNon7BitAscii
'aaaaü' asUnicode16String containsNon7BitAscii
'aaaaa' asUnicode16String containsNon7BitAscii
|
-
isWideString
-
true if I require more than one byte per character
-
occurrencesOf: aCharacter
-
count the occurrences of the argument, aCharacter in myself
- reimplemented here for speed
usage example(s):
'hello world' occurrencesOf:$a
'hello world' occurrencesOf:$w
'hello world' occurrencesOf:$l
'hello world' occurrencesOf:$x
'hello world' occurrencesOf:1
Time millisecondsToRun:[
|s|
s := 'abcdefghijklmn' asUnicode16String.
1000000 timesRepeat:[ s occurrencesOf:$x ]
]. 60 60 60 70 (untuned: 670 710 740)
|
testing
-
isSingleByteCollection
-
return true, if the receiver has access methods for bytes;
i.e. #at: and #at:put: accesses a byte and are equivalent to #byteAt: and byteAt:put:
and #replaceFrom:to: is equivalent to #replaceBytesFrom:to:.
false is returned here since at: returns 2-byte characters and not bytes
- the method is redefined from UninterpretedBytes.
|