eXept Software AG Logo

Smalltalk/X Webserver

Documentation of class 'TwoByteString':

Home

Documentation
www.exept.de
Everywhere
for:
[back]

Class: TwoByteString


Inheritance:

   Object
   |
   +--Collection
      |
      +--SequenceableCollection
         |
         +--ArrayedCollection
            |
            +--UninterpretedBytes
               |
               +--CharacterArray
                  |
                  +--TwoByteString
                     |
                     +--BIG5EncodedString
                     |
                     +--GBEncodedString
                     |
                     +--JISEncodedString
                     |
                     +--KSCEncodedString
                     |
                     +--TwoByteSymbol
                     |
                     +--Unicode16String

Package:
stx:libbasic
Category:
Collections-Text
Version:
rev: 1.48 date: 2019/03/27 13:14:46
user: cg
file: TwoByteString.st directory: libbasic
module: stx stc-classLibrary: libbasic
Author:
Claus Gittinger

Description:


TwoByteStrings are like strings, but storing 16bits per character.
The integration of them into the system is not completed ....


Related information:

    Text
    JISEncodedString
    StringCollection

Class protocol:

initialization
o  initialize
initialize the class - private

usage example(s):

     TwoByteString initialize

instance creation
o  basicNew: anInteger
return a new empty string with anInteger number of characters

o  uninitializedNew: anInteger
return a new empty string with anInteger characters

usage example(s):

        self uninitializedNew:10


Instance protocol:

accessing
o  basicAt: index
return the character at position index, an Integer
- reimplemented here since we return 16-bit characters

o  basicAt: index put: aCharacter
store the argument, aCharacter at position index, an Integer.
Returns aCharacter (sigh).
- reimplemented here since we store 16-bit characters

o  unsignedShortAt: index
return the short at position index, an Integer

filling and replacing
o  from: start to: stop put: aCharacter
fill part of the receiver with aCharacter.
- reimplemented here for speed

usage example(s):

     (Unicode16String new:10) from:1 to:10 put:$a
     (Unicode16String new:20) from:10 to:20 put:$b
     (Unicode16String new:20) from:1 to:10 put:$c
     (Unicode16String new:20) from:1 to:10 put:$c 
     (Unicode16String new:100) from:2 to:99 put:$c 

     (Unicode16String new:10) from:0 to:9 put:$a
     (Unicode16String new:10) from:1 to:11 put:$a

o  replaceFrom: start to: stop with: aString startingAt: repStart
replace the characters starting at index start, anInteger and ending
at stop, anInteger with characters from aString starting at repStart.
Return the receiver.

- reimplemented here for speed

usage example(s):

     'hello world' asUnicode16String replaceFrom:1 to:5 with:'123456' startingAt:2
     'hello world' asUnicode16String replaceFrom:1 to:5 with:'123456' asUnicode16String startingAt:2
     'hello world' asUnicode16String replaceFrom:1 to:0 with:'123456' startingAt:2
     'hello' asUnicode16String replaceFrom:1 to:6 with:'123456' startingAt:2
     'hello world' asUnicode16String replaceFrom:1 to:1 with:'123456' startingAt:2

queries
o  bitsPerCharacter
return the number of bits each character has.
Here, 16 is returned (storing double byte characters).

o  bytesPerCharacter
return the number of bytes each character has.
Here, 2 is returned (storing double byte characters).

o  characterSize
answer the size in bits of my largest character (actually only 7, 8 or 16)

usage example(s):

     'hello world' asUnicode16String characterSize
     'hello worldŁšŲ' asUnicode16String characterSize
     'a' asUnicode16String characterSize
     'Ł' asUnicode16String characterSize
     'aa' asUnicode16String characterSize
     'aŁ' asUnicode16String characterSize
     'aaa' asUnicode16String characterSize
     'aaŁ' asUnicode16String characterSize
     'aaaŁ' asUnicode16String characterSize
     'aaaa' asUnicode16String characterSize
     'aaaaŁ' asUnicode16String characterSize

o  containsNon7BitAscii
return true, if the underlying string contains 8BitCharacters (or widers)
(i.e. if it is non-ascii)

usage example(s):

     'hello world' asUnicode16String containsNon7BitAscii
     'hello worldŁšŲ' asUnicode16String containsNon7BitAscii
     'Ł' asUnicode16String containsNon7BitAscii
     'aŁ' asUnicode16String containsNon7BitAscii
     'aaŁ' asUnicode16String containsNon7BitAscii
     'aaaŁ' asUnicode16String containsNon7BitAscii
     'aaaaŁ' asUnicode16String containsNon7BitAscii
     'aaaaa' asUnicode16String containsNon7BitAscii

o  isWideString
true if I require more than one byte per character

o  occurrencesOf: aCharacter
count the occurrences of the argument, aCharacter in myself
- reimplemented here for speed

usage example(s):

     'hello world' occurrencesOf:$a
     'hello world' occurrencesOf:$w
     'hello world' occurrencesOf:$l
     'hello world' occurrencesOf:$x
     'hello world' occurrencesOf:1
     Time millisecondsToRun:[
        |s|
        
        s := 'abcdefghijklmn' asUnicode16String. 
        1000000 timesRepeat:[ s occurrencesOf:$x ]
     ]. 60 60 60 70 (untuned: 670 710 740)

testing
o  isSingleByteCollection
return true, if the receiver has access methods for bytes;
i.e. #at: and #at:put: accesses a byte and are equivalent to #byteAt: and byteAt:put:
and #replaceFrom:to: is equivalent to #replaceBytesFrom:to:.
false is returned here since at: returns 2-byte characters and not bytes
- the method is redefined from UninterpretedBytes.



ST/X 7.2.0.0; WebServer 1.670 at bd0aa1f87cdd.unknown:8081; Fri, 16 Apr 2021 10:26:52 GMT