Revision as of 12:10, 8 September 2020 (view source) rosettacode>Gerard Schildberger m (→‎{{header\|REXX}}: added/changed whitespace and comments, used a template for the output section.) ← Older edit		Revision as of 12:15, 8 September 2020 (view source) rosettacode>Gerard Schildberger m (added whitespace.) Newer edit →
Line 1: {{draft task\|text processing}} {{wikipedia}} The [[wp:New York State Identification and Intelligence System\|New York State Identification and Intelligence System phonetic code]], commonly known as NYSIIS, is a phonetic algorithm for creating indices for words based on their pronunciation. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.▼ The task here is to implement the original NYSIIS algorithm, shown in Wikipedia, rather than any other subsequent modification. Also, before the algorithm is applied the input string should be converted to upper case with all white space removed.▼ ▲The [[wp:New York State Identification and Intelligence System\|New York State Identification and Intelligence System phonetic code]], commonly known as NYSIIS, is a phonetic algorithm for creating indices for words based on their pronunciation. ~~The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.~~ An optional step is to handle multiple names, including double-barrelled names or double surnames (e.g. 'Hoyle-Johnson' or 'Vaughan Williams') and unnecessary suffixes/honours that are not required for indexing purposes (e.g. 'Jnr', 'Sr', 'III', etc) - a small selection will suffice. The original implementation is also restricted to six characters, but this is not a requirement.▼ The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling. ;Task: Implement the original NYSIIS algorithm, shown in Wikipedia, rather than any other subsequent modification. ▲~~The task here is to implement the original NYSIIS algorithm, shown in Wikipedia, rather than any other subsequent modification.~~ Also, before the algorithm is applied the input string should be converted to upper case with all white space removed. ▲An optional step is to handle multiple names, including double-barrelled names or double surnames (e.g. 'Hoyle-Johnson' or 'Vaughan Williams') and unnecessary suffixes/honours that are not required for indexing purposes (e.g. 'Jnr', 'Sr', 'III', etc) - a small selection will suffice. ~~The original implementation is also restricted to six characters, but this is not a requirement.~~ The original implementation is also restricted to six characters, but this is not a requirement. ;See also

NYSIIS: Difference between revisions

NYSIIS (view source)

Revision as of 12:15, 8 September 2020