MSW:Spelling Normalization

12. Spelling Normalization
Normalization is the process of making something normal with regards to a standard that has been accepted by a community.

SignWriting has two issues with spelling normalization: the visual appearance as 2-dimensional order and the formal string as 1-dimensional order. With SignWriting it is possible to have several strings have the same exact 2-dimensional visual appearance. For complete normalization, a spelling standard must specify both the visual appearance and the formal string.

12.A. Unordered and Inexact
Proper spelling with SignWriting is primarily concerned with the 2-dimensional visual appearance of a written sign. As the writer can start with any symbol and a writer can precisely position each symbol, spelling is considered unordered and inexact.

Artificial rules can be created that limit the variability, but none are universal or inherent in the SignWriting script.

12.B. Exact Spelling
Outside of a two writers using the same editor with restricted rules, it is unlikely that two writers will produce the exact same spelling for any sign. The more complicated the sign, the less chance of producing an exact match.

Exact spellings are only possible if two writers normalize to the same standard.

12.C. Reflected Statistics
There are two ways to achieve this standard of normal spelling: either a dictate from an accepted authority or an analysis of a large and coherent body of writing.

When considering any spelling, there are three statistics which can be helpful: strength, association, and displacement.

The strength of a spelling is evident in the number of time the exact spelling has been used in a body of writing. A query search with a variance of “0” can be used to determine the strength of a spelling.

The association of a spelling is evident in the number of approximate spelling matches. A standard query search can be used to determine the association of a spelling.

The displacement of a spelling can be realized by using the displacement search method. A spelling displacement will report the number of times a spelling was used with a different relation to the center of the signbox. The displacement usually deals with combination signs, when 2 signs are written together.

For spelling normalization, the reflected statistics can provide a guide when choosing one spelling over another.

12.D. Symbol Subsets
The ISWA is a huge set of symbols. There is no language that will use every symbol. As with reflected spelling statistics, a body of writing can be analyzed for the symbols that have been used. Reflected symbol statistics can provide a guide to the norms within a community. If the writer is offered a symbol subset rather than the entire ISWA, the symbol subset can become self reinforcing and aid in spelling normalization.