Hello world

From PuddleNet

Jump to: navigation, search

Contents

[edit] Live parsing

Hello world. consists of 2 signs and one punctuation. As sign language data, Hello world can be encoded as 35 characters.

0fb 14c 38e 392 4a5 4bd 271 38c 398 4bb 4cf 0fb 187 38c 39c 4c0 4c6 187 38c 393 4b9 4b0 205 38c 392 4d1 4b3 2ef 38c 392 4c9 49c 388 38c 392

[edit] Encodings

4 flavors of the same data for Hello world

[edit] XML

<sign lane="0">
  <sym x="-40" y="-16">14c20</sym>
  <sym x="-18" y="2">27106</sym>
</sign>
<sign lane="0">
  <sym x="-13" y="-7">1870a</sym>
  <sym x="-20" y="-29">18701</sym>
  <sym x="4" y="-26">20500</sym>
  <sym x="-4" y="-49">2ef00</sym>
</sign>
<punc>38800<punc>


This is verbose and requires 270 bytes to store.


[edit] Compact

B,14c,3,1,-40,-16,271,1,7,-18,2
B,187,1,11,-13,-7,187,1,2,-20,-29,205,1,1,4,-26,2ef,1,1,-4,-49
388,1,1


The compact is a smaller representation of the same data and requires 103 bytes to store.


[edit] Binary SignWriting

0fb14c38e3924a54bd27138c3984bb4cf0fb18738c39c4c04c618738c3934b94b020538c3924d14b32ef38c3924c949c38838c392

Binary SignWriting uses 12 bit character codes from the x-iswa-2010 as 3 bytes of hexadecimal characters to represent the same data. This requires 106 bytes to store.


[edit] UTF-8


UTF-8 is an encoding for Unicode that uses a string of single byte character codes. The UTF-8 characters above require 4 bytes per character. This requires 148 bytes to store. Unfortunately, the UTF-8 string requires a tripling of the size to 12 bytes per character when encoded for the URL. So the above example requires 444 bytes when used in the url.




Sign Language Data
Language Script Alphabet User interface Rendering Model
spatial
written
varied
symbol
cluster
center
column
lane
sort
search
name
structure
type
category
token
style
palette
canvas
cursor
transformation
sequence
text
glyph
glyphogram
strip
page
requirements
use cases
repertoire
character
encoding
Personal tools