EDIFACT Character Encoding

Encoding Allows
UNOA As defined in ISO 646 (with the exception of letters, lower case a to z).

  • A to Z
  • 0 to 9
  • . , – ( ) / = (space)
UNOB As defined in ISO 646

  • All of UNOA
  • a to z
  • ‘ + : ? ! ” % & * ; < >
UNOC As defined in ISO 8859-1 : Information processing – Part 1: Latin alphabet No. 1.
UNOD As defined in ISO 8859-2 : Information processing – Part 2: Latin alphabet No. 2.
UNOE As defined in ISO 8859-5 : Information processing – Part 5: Latin/Cyrillic alphabet.
UNOF As defined in ISO 8859-7 : Information processing – Part 7: Latin/Greek alphabet.
UNOG As defined in ISO 8859-3 : Information processing – Part 3: Latin alphabet.
UNOH As defined in ISO 8859-4 : Information processing – Part 4: Latin alphabet.
UNOI As defined in ISO 8859-6 : Information processing – Part 6: Latin/Arabic alphabet.
UNOJ As defined in ISO 8859-8 : Information processing – Part 8: Latin/Hebrew alphabet.
UNOK As defined in ISO 8859-9 : Information processing – Part 9: Latin alphabet.
UNOX Code extension technique as defined by ISO 2022 utilizing the escape techniques in accordance with ISO 2375.ISO-2022-JP character set

  • This code page allows the escape techniques in accordance with ISO 2375. The text starts in ASCII and switches to Japanese characters through an escape sequence. The bytes following the escape sequence are encoded in two bytes each
UNOY ISO 10646-1 octet without code extension technique.
KECA A to Z
0 to 9
. , – ( ) / = ! ” % & * ; < >Windows 949 code page

  • Korean Syllables (2350 characters)
  • Korean Hanja (4888 characters)
  • Korean Alphabets
  • Characters and numbers enclosed in a circle
  • The length of the strings are counted by byte instead of characters. So if you have a data element of length 3, you can have 3 latin characters, 1 Korean character or 1 Korean and 1 Latin character!