Binary SignWriting

From PuddleNet

Revision as of 11:02, 18 January 2010 by 207.32.43.33 (Talk)
Jump to: navigation, search

Binary SignWriting is an encoding model for sign language data. It handles the specific requirements of a spatial script combined with the unique features of SignWriting.


Contents

Status of the work

This document is a prerelease of version 2 of Binary SignWriting. Stable release with 2 version of character encoding: BSW and UTF-8.


Repertoire

Name Tokens BSW Range UTF-8 Range Notes
7 bit ASCII A 0000 - 00f7 not currently used, but compatibility offers many advantages and no disadvantages.
x-ISWA Control Characters BQLR 0080 - 00ff Special control characters for SignWriting structures
ISWA 2008 wcPs 0100 - f0ff Valerie Sutton hand crafted, organized and named the ISWA 2008 as part of the Sutton-Slevinski collaboration.
x-ISWA Number Characters n f100 - fffe Number range -1919 thru 1919 as characters to avoid collision when parsing


x-ISWA Control Characters

Name Token BSW UTF Notes
SignBox B 0080 A marker for a new canvas in the middle lane
Sequence Marker Q 0081 A marker for a sequence of writing and sorting symbols
Left Lane SignBox L 0082 A marker for a new canvas in the left lane
Right Lane SignBox R 0083 A marker for a new canvas in the right lane

ISWA Token Ranges

Name Token BSW Range UTF Range Notes
Writing symbols, non-centering w 0100-EBBF wo/B980-E61F; A symbol used for spatial writing that is non-centering
Centering symbols c B980-E61F A type of writing symbol that takes precedence for the center of a spatial cluster
Punctuation Symbols P EBC0-ED9F A non-spatial symbol always used by itself in the middle lane
Sorting symbols s EDA0-F09F A non-spatial symbols that can only be used after the Sequence marker


Structures

Name Description Regular expression for token analysis
Cluster SignBox marker with repeating spatial symbols, either writing or centering [LBR]([wc]nn)*
Sequence Sequence maker followed by one or more symbols, either writing, centering, or sorting Q[wcs]+
Punctuation Punctuation is always used by itself P
Sign A cluster with an optional sequence, either left, middle, or right lane [LBR]([wc]nn)*[Q[wcs]+]
SignText A list of signs and punctuation ([LBR]([wc]nn)*(Q[wcs]+)?|P)+

Other

Language Requirements

  • Two way conversion between character code and symbol id.
  • Access to glyphs - symbol image
  • Creation of glyphograms - visual unit of spatially written glyphs
  • Center estimate for glyphs and glyphograms
  • Horizontal stacking of glyphs and glyphograms based on center
  • Proper spacing between writing and punctuation
  • Horizontal off center alignment for lanes
  • Sorting based on sequence data, including special sorting symbols
  • Searching for symbol, BaseSymbol, symbol combination, spatial arrangement, or exact sign match
  • Drag and drop user interface
  • Keyboarding user interface
  • Special Commands for text entry