Hello world
From PuddleNet
Contents |
[edit] Live parsing
Hello world. consists of 2 signs and one punctuation. As sign language data, Hello world can be encoded as 21 characters.
- 0080 1ce0 f8d4 f8ec 8746 f8ea f8fe 0080 32ea f89e f914 32e1 f897 f8fb 5ec0 f8b1 f901 b3b4 f8a9 f8e5 ec20
[edit] Encodings
4 flavors of the same data for Hello world
[edit] XML
- <sign lane="0">
- <sym x="85" y="109">01-05-001-01-03-01</sym>
- <sym x="107" y="127">02-05-006-01-01-07</sym>
- </sign>
- <sign lane="0">
- <sym x="31" y="149">01-06-002-01-01-11</sym>
- <sym x="24" y="124">01-06-002-01-01-02</sym>
- <sym x="50" y="130">02-01-001-01-01-01</sym>
- <sym x="42" y="102">02-10-003-01-04-05</sym>
- </sign>
- <punc>06-01-001-02-01-01</punc>
This is verbose and requires 357 bytes to store.
[edit] Compact
7392,85,109,34630,107,127
13034,31,149,13025,24,124,24256,50,130,46004,42,102
60448
The compact is a smaller representation of the same data and requires 84 bytes to store.
[edit] Binary
00801ce0f8d4f8ec8746f8eaf8fe 008032eaf89ef91432e1f897f8fb5ec0f8b1f901b3b4f8a9f8e5 ec20
Binary SignWriting uses 2 byte character codes from the x-iswa-2008 coded character set to represent the same data. This requires 42 bytes to store.
[edit] UTF-8
UTF-8 is an encoding for Unicode that uses a string of single byte character codes. The UTF-8 characters above require 4 bytes per character, the same as BSW. Unfortunately, the UTF-8 string requires a tripling of the size to 12 bytes per character when encoded for the URL. The number 127 becomes BSW character "0080", while the UTF-8 becomes "%ED%A3%80%ED".


