Unicode

From PuddleNet

Jump to: navigation, search

The following is the beginning of a proposal for the inclusion of SignWriting in Unicode.

Contents

[edit] Introduction

Provide an introduction with background on the script, including

  • history
  • current usage
    • number of users
    • number of publications
    • number of countries in which it is used


[edit] How SignWriting Works

[edit] Symbols

The latest symbol set of SignWriting (the ISWA) defines 37,811 symbols. These symbols are divided into three main types: writing symbols, sequence symbols, and punctuation symbols. Writing symbols are combined on a canvas to create signs. Punctuation symbols are always used alone. Sequence symbols are special use symbols for advance collation. All of the symbols can be accessed via a 6 by 16 symbol palette.

[edit] Canvas

The main structure of SignWriting is the sign: a spatial combination of symbols on a 2 dimensional canvas.

Here's the name for SignWriting in American Sign Language using seven symbols.

Here's another version for SignWriting in American Sign Language using thirteen symbols.

The writer positions the symbols directly on the canvas to create a meaningful whole. No meta data is used to construct the signs. The writer chooses the symbols and the symbol placement, creating a whole that is greater than the sum of it's parts.

[edit] Spelling

There are two general rules for proper spelling. Rule #1: Write the position of contact. Rule #2: Every sign has a center.

Some details of a spelling are assumed or implied. A sign, correctly written, is able to be understood by a human knowledgeable in the target sign language. A written sign does not contain the linguistic information required to create an avatar animation solely based on a computerized analysis.

While the writing system is stable, neither the vocabulary nor the spelling of any sign language is standardized. It will take years and decades to document and evaluate the signs of the various sign languages. The standardization of the script is an important milestone.

[edit] Lanes

SignWriting uses signs in lanes and punctuation. A sign can be placed in any lane: left, middle, or right. A punctuation symbol is always used by itself and always in the middle lane.

This example uses two simple signs in each lane separated by punctuation. The first 2 signs are in the middle lane. The second 2 signs are in the left lane. The last 2 signs are in the right lane.

The signs are aligned based on their centers. The center is determined by one of two methods. The first method is used for signs without head or trunk symbols. The sign are enclosed by a smallest possible rectangle called a bounding box. The center of the sign is the center of the bounding box.

The second method is used for signs with head and trunk symbols. A centering bounding box is defined which only encloses the head and trunk symbols of the sign. The center for such a sign is the center of its centering bounding box.

[edit] Characters

674 characters required for symbol standardization are proposed. These characters break down into two types: base symbol characters and modifier characters. View or download the specific proposed characters.

[edit] BaseSymbol Characters

[edit] Modifier Characters


[edit] Symbol Access

To access any of the symbols in the ISWA, a combination of 3 characters is required. Every BaseSymbol has a defined list of valid fills, and rotations. Every combination of a BaseSymbol with a valid fill and rotation modifiers equates to a valid symbol.


[edit] Additional Characters

Additional structural characters should be common to all encoding models; however, since these characters are used for layout, they are not proposed for addition to Unicode at this time.

  • Left Lane SignBox
  • Middle Lane SignBox
  • Right Lane SignBox
  • Sequence Marker

[edit] Encoding Model Issues

The symbol set is stable and widely accepted. The main structures of signs and punctuations in lanes are understood. The major unresolved issue with encoding SignWriting deals with layout. While every sign has a defined center, how to define the sign itself is still unresolved.

The main issue of layout is how the writer will use the system. The balance between complexity and usability from the writer's perspective is of primary importance. It must always be remembered that the writer decides the symbols and their placement. Anything else is a different writing system.

The second issue of layout involve comparison. Signs can quickly be scanned for the symbols used; however, the relative position of the symbols require an analysis of the layout. The different layouts offer different approaches for evaluation.

The third issues of layout involves variability. There are two types of variability. The first, inter-personal variability, occurs when writers pick different symbols and different details. Inter-personal variability is part of the writing system that layout can not resolve. The second, intra-personal variability, occurs when writers use the same symbols, but in slightly different positions. With layout choices, it is possible to reduce the intra-personal variability, but this reduction may harm the writing system by imposing too many restrictions on the writer.

A fourth issues of layout involves elegance and beauty. Some may consider one type of layout to be superior to another based on subjective personal opinions. SignWriting is a unique script. The ultimate choice of layout should be based on the writer's experience, comparison, and variability.

Below, the various layout options are identified, with examples provided (where available) for the sign sentence "hello world." which contains 2 signs and 1 punctuation.

[edit] Freeform Layout

With freeform layout, the writer decides what symbols to use and the exact symbol position. The freeform layout offers the greatest flexibility for the writer and the greatest intra-personal variability.

[edit] Cartesian Coordinates

Cartesian coordinates specify X and Y coordinates for the top, left of the symbol glyph. The coordinates of the symbols relate to the center of the canvas. This is the current method for writing. The writer is presented with a canvas and positions each symbol independently. A Unicode Technical Note is planned that will define a SignWriting Cartesian Markup.

Example:B๐ก’๐ช”๐ช˜-19,-29๐ฅท๐ช’๐ชž3,-11 B๐ข๐ช’๐ชข-11,12๐ข๐ช’๐ช™-18,-10๐ค‹๐ช’๐ช˜6,-7๐งต๐ช’๐ช˜-2,-30 ๐ชŽ๐ช’๐ช˜

[edit] Polar Coordinates

Polar coordinates specify an angle and a distance from the center of the sign to the center of each symbol. The current cartesian data can be exported to this format. A Unicode Technical Note is planned that will define a SignWriting Polar Markup.

Example:B๐ก’๐ช”๐ช˜330ยฐ16๐ฅท๐ช’๐ชž129ยฐ14 B๐ข๐ช’๐ชข169ยฐ21๐ข๐ช’๐ช™252ยฐ6๐ค‹๐ช’๐ช˜80ยฐ11๐งต๐ช’๐ช˜19ยฐ21 ๐ชŽ๐ช’๐ช˜


[edit] Restricted Layout

It is possible to impose restrictions on symbol placement thereby limiting the intra-personal variability of sign spellings.

[edit] Generic Restrictions

Instead of allowing any coordinates, it may be possible to limit the options. For example, with polar coordinates, only allow specific angles and specific distances. This has not been evaluated.

[edit] Specific Restrictions

It may be possible to perform a statistical analysis of the symbols used to come up with a limited number of attachment points around each symbol and a small list of predefined distances between symbols. This information would be symbol specific and could greatly reduce the intra-personal variability if successfully implemented.

[edit] Non-form Layout

Some would argue that the writer should not determine the form of a sign, but should input linguistic analysis and let the layout/font manager determine the best representation for the written sign. This would change the script from a writing system into computer aided design, requiring concepts that are not part of the script and are not part of the writer's thought processes. The idea would make for an interesting project, but it is not about encoding SignWriting.


[edit] Absolute versus Relative Positioning

Any of the above layout options have two choices for positioning. The freeform layouts defined above use absolute positioning, where the position of each symbol relates to the center of the sign. It may be possible to interrelate the position of the symbols. Relative positioning would create some type of linked list for symbol positions. The position of each symbol would be defined based on the position of the previous symbol in the list. The viability and usability of relative positioning is unknown and has not been investigated.


[edit] Collation

SignWriting performs collation using the SignSpelling Sequence. The SignSpelling Sequence is extra data not contained in the layout data. Each sign can have an optional sequential list of symbols. The symbols in the SignSpelling Sequence are ordered based on a theory of ordering. The predominant theory of ordering uses syllable. The first syllable represents the starting handshapes. The second syllable represents the movement between the first and third syllable. The third syllable represents the ending handshapes. The fourth syllable represents the location. The SignSpelling Sequence can use either writing symbols or sequence symbols, but not punctuation.

The signs are sorted based on the symbol IDs of the symbols in the SignSpelling Sequence. SignSpelling Guidelines 2008.


[edit] Punctuation

Punctuation symbols are always used alone and always in the middle lane. Punctuation BaseSymbols require a fill and rotation modifier to uniquely identify a symbol.


[edit] Line Breaking

Line breaks do not occur in the middle of a sign. Line breaks do not occur before punctuation.


[edit] References

  • List of references (handbooks, etc.)

[edit] Examples

  • Examples of the script
Personal tools