eXept Software AG Logo

Smalltalk/X Webserver

Documentation of class 'PhoneticStringUtilities::MiracodeStringComparator':

Home

Documentation
www.exept.de
Everywhere
for:
[back]

Class: MiracodeStringComparator (private in PhoneticStringUtilities

This class is only visible from within PhoneticStringUtilities.

Inheritance:

   Object
   |
   +--PhoneticStringUtilities::PhoneticStringComparator
      |
      +--PhoneticStringUtilities::SingleResultPhoneticStringComparator
         |
         +--PhoneticStringUtilities::SoundexStringComparator
            |
            +--PhoneticStringUtilities::MiracodeStringComparator

Package:
stx:libbasic2
Category:
Collections-Text-Support
Owner:
PhoneticStringUtilities

Description:


Miracode (also called << American Soundex >>) is like Soundex with the 
addition that h and w are discarded if they separate consonants.

These variants may be specifically important because they were used in 
U.S. National Archives. 
Most archive data were encoded with Miracode, 
but there are some (older) entries encoded with Simplified Soundex. 

The HW-rule was documented as a standard in 1910, 
but actually data of 1880, 1900 and 1910 
censuses were encoded with mixed methods.

 self new encode:'washington' -> 'W252'
 self new encode:'lee'        -> 'L000'
 self new encode:'Gutierrez'  -> 'G362'
 self new encode:'Pfister'    -> 'P236'
 self new encode:'Jackson'    -> 'J250'
 self new encode:'Tymczak'    -> 'T522'

notice:
 MiracodeStringComparator new encode:'Ashcraft' -> 'A261'
 SoundexStringComparator new encode:'Ashcraft'  -> 'A226'

see also:            
    https://www.archives.gov/research/census/soundex.html


Instance protocol:

private
o  encode: word
same as inherited, but cares for W and H



ST/X 7.2.0.0; WebServer 1.670 at bd0aa1f87cdd.unknown:8081; Fri, 26 Apr 2024 22:17:15 GMT