Unicode
From PuddleNet
The following is the beginning of a proposal for the inclusion of SignWriting in Unicode.
Contents |
[edit] Introduction
Provide an introduction with background on the script, including
- history
- current usage
- number of users
- number of publications
- number of countries in which it is used
[edit] How SignWriting Works
[edit] Symbols
The latest symbol set of SignWriting (the ISWA) defines 37,811 symbols. These symbols are divided into three main types: writing symbols, sequence symbols, and punctuation symbols. Writing symbols are combined on a canvas to create signs. Punctuation symbols are always used alone. Sequence symbols are special use symbols for advance collation. All of the symbols can be accessed via a 6 by 16 symbol palette.
[edit] Canvas
The main structure of SignWriting is the sign: a spatial combination of symbols on a 2 dimensional canvas.
Here's the name for SignWriting in American Sign Language using seven symbols.
Here's another version for SignWriting in American Sign Language using thirteen symbols.
The writer positions the symbols directly on the canvas to create a meaningful whole. No meta data is used to construct the signs. The writer chooses the symbols and the symbol placement, creating a whole that is greater than the sum of it's parts.
[edit] Spelling
There are two general rules for proper spelling. Rule #1: Write the position of contact. Rule #2: Every sign has a center.
Some details of a spelling are assumed or implied. A sign, correctly written, is able to be understood by a human knowledgeable in the target sign language. A written sign does not contain the linguistic information required to create an avatar animation solely based on a computerized analysis.
While the writing system is stable, neither the vocabulary nor the spelling of any sign language is standardized. It will take years and decades to document and evaluate the signs of the various sign languages. The standardization of the script is an important milestone.
[edit] Lanes
SignWriting uses signs in lanes and punctuation. A sign can be placed in any lane: left, middle, or right. A punctuation symbol is always used by itself and always in the middle lane.
This example uses two simple signs in each lane separated by punctuation. The first 2 signs are in the middle lane. The second 2 signs are in the left lane. The last 2 signs are in the right lane.
The signs are aligned based on their centers. The center is determined by one of two methods. The first method is used for signs without head or trunk symbols. The sign are enclosed by a smallest possible rectangle called a bounding box. The center of the sign is the center of the bounding box.
The second method is used for signs with head and trunk symbols. A centering bounding box is defined which only encloses the head and trunk symbols of the sign. The center for such a sign is the center of its centering bounding box.
[edit] Characters
674 characters required for symbol standardization are proposed. These characters break down into two types: base symbol characters and modifier characters. View or download the specific proposed characters.
[edit] BaseSymbol Characters
- 652 BaseSymbol characters
[edit] Modifier Characters
[edit] Symbol Access
To access any of the symbols in the ISWA, a combination of 3 characters is required. Every BaseSymbol has a defined list of valid fills, and rotations. Every combination of a BaseSymbol with a valid fill and rotation modifiers equates to a valid symbol.
[edit] Additional Characters
Additional structural characters should be common to all encoding models; however, since these characters are used for layout, they are not proposed for addition to Unicode at this time.
- Left Lane SignBox
- Middle Lane SignBox
- Right Lane SignBox
- Sequence Marker
[edit] Encoding Model Issues
The symbol set is stable and widely accepted. The main structures of signs and punctuations in lanes are understood. The major unresolved issue with encoding SignWriting deals with layout. While every sign has a defined center, how to define the sign itself is still unresolved.
The main issue of layout is how the writer will use the system. The balance between complexity and usability from the writer's perspective is of primary importance. It must always be remembered that the writer decides the symbols and their placement. Anything else is a different writing system.
The second issue of layout involve comparison. Signs can quickly be scanned for the symbols used; however, the relative position of the symbols require an analysis of the layout. The different layouts offer different approaches for evaluation.
The third issues of layout involves variability. There are two types of variability. The first, inter-personal variability, occurs when writers pick different symbols and different details. Inter-personal variability is part of the writing system that layout can not resolve. The second, intra-personal variability, occurs when writers use the same symbols, but in slightly different positions. With layout choices, it is possible to reduce the intra-personal variability, but this reduction may harm the writing system by imposing too many restrictions on the writer.
A fourth issues of layout involves elegance and beauty. Some may consider one type of layout to be superior to another based on subjective personal opinions. SignWriting is a unique script. The ultimate choice of layout should be based on the writer's experience, comparison, and variability.
Below, the various layout options are identified, with examples provided (where available) for the sign sentence "hello world." which contains 2 signs and 1 punctuation.
[edit] Freeform Layout
With freeform layout, the writer decides what symbols to use and the exact symbol position. The freeform layout offers the greatest flexibility for the writer and the greatest intra-personal variability.
[edit] Cartesian Coordinates
Cartesian coordinates specify X and Y coordinates for the top, left of the symbol glyph. The coordinates of the symbols relate to the center of the canvas. This is the current method for writing. The writer is presented with a canvas and positions each symbol independently. A Unicode Technical Note is planned that will define a SignWriting Cartesian Markup.
Example:B๐ก๐ช๐ช-19,-29๐ฅท๐ช๐ช3,-11 B๐ข๐ช๐ชข-11,12๐ข๐ช๐ช-18,-10๐ค๐ช๐ช6,-7๐งต๐ช๐ช-2,-30 ๐ช๐ช๐ช
[edit] Polar Coordinates
Polar coordinates specify an angle and a distance from the center of the sign to the center of each symbol. The current cartesian data can be exported to this format. A Unicode Technical Note is planned that will define a SignWriting Polar Markup.
Example:B๐ก๐ช๐ช330ยฐ16๐ฅท๐ช๐ช129ยฐ14 B๐ข๐ช๐ชข169ยฐ21๐ข๐ช๐ช252ยฐ6๐ค๐ช๐ช80ยฐ11๐งต๐ช๐ช19ยฐ21 ๐ช๐ช๐ช
[edit] Restricted Layout
It is possible to impose restrictions on symbol placement thereby limiting the intra-personal variability of sign spellings.
[edit] Generic Restrictions
Instead of allowing any coordinates, it may be possible to limit the options. For example, with polar coordinates, only allow specific angles and specific distances. This has not been evaluated.
[edit] Specific Restrictions
It may be possible to perform a statistical analysis of the symbols used to come up with a limited number of attachment points around each symbol and a small list of predefined distances between symbols. This information would be symbol specific and could greatly reduce the intra-personal variability if successfully implemented.
[edit] Non-form Layout
Some would argue that the writer should not determine the form of a sign, but should input linguistic analysis and let the layout/font manager determine the best representation for the written sign. This would change the script from a writing system into computer aided design, requiring concepts that are not part of the script and are not part of the writer's thought processes. The idea would make for an interesting project, but it is not about encoding SignWriting.
[edit] Absolute versus Relative Positioning
Any of the above layout options have two choices for positioning. The freeform layouts defined above use absolute positioning, where the position of each symbol relates to the center of the sign. It may be possible to interrelate the position of the symbols. Relative positioning would create some type of linked list for symbol positions. The position of each symbol would be defined based on the position of the previous symbol in the list. The viability and usability of relative positioning is unknown and has not been investigated.
[edit] Collation
SignWriting performs collation using the SignSpelling Sequence. The SignSpelling Sequence is extra data not contained in the layout data. Each sign can have an optional sequential list of symbols. The symbols in the SignSpelling Sequence are ordered based on a theory of ordering. The predominant theory of ordering uses syllable. The first syllable represents the starting handshapes. The second syllable represents the movement between the first and third syllable. The third syllable represents the ending handshapes. The fourth syllable represents the location. The SignSpelling Sequence can use either writing symbols or sequence symbols, but not punctuation.
The signs are sorted based on the symbol IDs of the symbols in the SignSpelling Sequence. SignSpelling Guidelines 2008.
[edit] Punctuation
Punctuation symbols are always used alone and always in the middle lane. Punctuation BaseSymbols require a fill and rotation modifier to uniquely identify a symbol.
[edit] Line Breaking
Line breaks do not occur in the middle of a sign. Line breaks do not occur before punctuation.
[edit] References
- List of references (handbooks, etc.)
[edit] Examples
- Examples of the script

