<?xml version="1.0" encoding="UTF-8"?>

<ValueSet xmlns="http://hl7.org/fhir">
  <id value="v2-0211"/>
  <meta>
    <profile value="http://hl7.org/fhir/StructureDefinition/shareablevalueset"/>
  </meta>
  <language value="en"/>
  <text>
    <status value="additional"/>
    <div xmlns="http://www.w3.org/1999/xhtml">
      <p>Alternate Character Sets</p>

      <table class="grid">
        <tr>
          <td>
            <b>Code</b>
          </td>
          <td>
            <b>Description</b>
          </td>
          <td>
            <b>Comment</b>
          </td>
          <td>
            <b>Version</b>
          </td>
        </tr>
        <tr>
          <td>8859/1
            <a name="8859.471"> </a>
          </td>
          <td>The printable characters from the ISO 8859/1 Character set</td>
          <td/>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>8859/15
            <a name="8859.4715"> </a>
          </td>
          <td>The printable characters from the ISO 8859/15 (Latin-15)</td>
          <td/>
          <td>added v2.6</td>
        </tr>
        <tr>
          <td>8859/2
            <a name="8859.472"> </a>
          </td>
          <td>The printable characters from the ISO 8859/2 Character set</td>
          <td/>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>8859/3
            <a name="8859.473"> </a>
          </td>
          <td>The printable characters from the ISO 8859/3 Character set</td>
          <td/>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>8859/4
            <a name="8859.474"> </a>
          </td>
          <td>The printable characters from the ISO 8859/4 Character set</td>
          <td/>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>8859/5
            <a name="8859.475"> </a>
          </td>
          <td>The printable characters from the ISO 8859/5 Character set</td>
          <td/>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>8859/6
            <a name="8859.476"> </a>
          </td>
          <td>The printable characters from the ISO 8859/6 Character set</td>
          <td/>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>8859/7
            <a name="8859.477"> </a>
          </td>
          <td>The printable characters from the ISO 8859/7 Character set</td>
          <td/>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>8859/8
            <a name="8859.478"> </a>
          </td>
          <td>The printable characters from the ISO 8859/8 Character set</td>
          <td/>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>8859/9
            <a name="8859.479"> </a>
          </td>
          <td>The printable characters from the ISO 8859/9 Character set</td>
          <td/>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>ASCII
            <a name="ASCII"> </a>
          </td>
          <td>The printable 7-bit ASCII character set.</td>
          <td>(This is the default if this field is omitted)</td>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>BIG-5
            <a name="BIG-5"> </a>
          </td>
          <td>Code for Taiwanese Character Set (BIG-5)</td>
          <td>Does not need an escape sequence.

BIG-5 does not need an escape sequence. ASCII is a 7 bit character set, which means that the top bit of the byte is &#x201C;0&#x201D;. The parser knows that when the top bit of the byte is &#x201C;0&#x201D;, the character set is ASCII. When it is &#x201C;1&#x201D;, the following bytes should be handled as 2 bytes (or more). No escape technique is needed. However, since some servers do not correctly interpret when they receive a top bit &#x201C;1&#x201D;, it is advised, in internet RFC, to not use these kind of non-safe non-escape extension.</td>
          <td>added v2.5</td>
        </tr>
        <tr>
          <td>CNS 11643-1992
            <a name="CNS11643-1992"> </a>
          </td>
          <td>Code for Taiwanese Character Set (CNS 11643-1992)</td>
          <td>Does not need an escape sequence.</td>
          <td>added v2.5</td>
        </tr>
        <tr>
          <td>GB 18030-2000
            <a name="GB18030-2000"> </a>
          </td>
          <td>Code for Chinese Character Set (GB 18030-2000)</td>
          <td>Does not need an escape sequence.</td>
          <td>added v2.5</td>
        </tr>
        <tr>
          <td>ISO IR14
            <a name="ISOIR14"> </a>
          </td>
          <td>Code for Information Exchange (one byte)(JIS X 0201-1976).</td>
          <td>Note that the code contains a space, i.e., &quot;ISO IR14&quot;.</td>
          <td>added v2.3.1</td>
        </tr>
        <tr>
          <td>ISO IR159
            <a name="ISOIR159"> </a>
          </td>
          <td>Code of the supplementary Japanese Graphic Character set for information interchange (JIS X 0212-1990).</td>
          <td>Note that the code contains a space, i.e., &quot;ISO IR159&quot;.</td>
          <td>added v2.3.1</td>
        </tr>
        <tr>
          <td>ISO IR6
            <a name="ISOIR6"> </a>
          </td>
          <td>ASCII graphic character set consisting of 94 characters.</td>
          <td>http://www.itscj.ipsj.or.jp/ISO-IR/006.pdf</td>
          <td>added v2.7</td>
        </tr>
        <tr>
          <td>ISO IR87
            <a name="ISOIR87"> </a>
          </td>
          <td>Code for the Japanese Graphic Character set for information interchange (JIS X 0208-1990),</td>
          <td>Note that the code contains a space, i.e., &#x201C;ISO IR87&#x201D;.

The JIS X 0208 needs an escape sequence. In Japan, the escape technique is ISO 2022. From basic ASCII, escape sequence &#x201C;escape&#x201D; $ B (in HEX, 1B 24 42) lets the parser know that following bytes should be handled 2-byte wise. Back to ASCII is 1B 28 42.</td>
          <td>added v2.3.1</td>
        </tr>
        <tr>
          <td>JAS2020
            <a name="JAS2020"> </a>
          </td>
          <td>A subset of ISO2020 used for most Kanjii transmissions</td>
          <td>deprecated</td>
          <td>added v2.3, removed after v2.3</td>
        </tr>
        <tr>
          <td>JIS X 0202
            <a name="JISX0202"> </a>
          </td>
          <td>ISO 2022 with escape sequences for Kanjii</td>
          <td>deprecated</td>
          <td>added v2.3, removed after v2.3</td>
        </tr>
        <tr>
          <td>KS X 1001
            <a name="KSX1001"> </a>
          </td>
          <td>Code for Korean Character Set (KS X 1001)</td>
          <td/>
          <td>added v2.5</td>
        </tr>
        <tr>
          <td>UNICODE
            <a name="UNICODE"> </a>
          </td>
          <td>The world wide character standard from ISO/IEC 10646-1-1993</td>
          <td>Deprecated. Retained for backward compatibility only as v 2.5. Replaced by specific Unicode encoding codes.</td>
          <td>added v2.3</td>
        </tr>
        <tr>
          <td>UNICODE UTF-16
            <a name="UNICODEUTF-16"> </a>
          </td>
          <td>UCS Transformation Format, 16-bit form</td>
          <td>UTF-16 is identical to ISO/IEC 10646 UCS-2. Note that the code contains a space before UTF but not before and after the hyphen.</td>
          <td>added v2.5, removed after v2.7.1</td>
        </tr>
        <tr>
          <td>UNICODE UTF-32
            <a name="UNICODEUTF-32"> </a>
          </td>
          <td>UCS Transformation Format, 32-bit form</td>
          <td>UTF-32 is defined by Unicode Technical Report #19, and is an officially recognized encoding as of Unicode Version 3.1. UTF-32 is a proper subset of ISO/IEC 10646 UCS-4. Note that the code contains a space before UTF but not before and after the hyphen.</td>
          <td>added v2.5, removed after v2.7.1</td>
        </tr>
        <tr>
          <td>UNICODE UTF-8
            <a name="UNICODEUTF-8"> </a>
          </td>
          <td>UCS Transformation Format, 8-bit form</td>
          <td>UTF-8 is a variable-length encoding, each code value is represented by 1,2 or 3 bytes, depending on the code value. 7 bit ASCII is a proper subset of UTF-8. Note that the code contains a space before UTF but not before and after the hyphen. Since UTF-8 represents the full UNICODE character set, the following restriction apply to its use: 
1. UTF-8 must be the default encoding of the message, UTF-8 cannot be specified as an additional character set in MSH-18
2. There are no other character sets allowed in a message where UTF-8 is the default encoding in the message.
In other words, UNICODE UTF-8 can only be specified as a single value in MSH-18
3. A message encoded in UTF-8 must not use a Byte Order Mark (BOM).</td>
          <td>added v2.5</td>
        </tr>
      </table>

    </div>
  </text>
  <extension url="http://hl7.org/fhir/StructureDefinition/structuredefinition-standards-status">
    <valueString value="External"/>
  </extension>
  <extension url="http://hl7.org/fhir/StructureDefinition/structuredefinition-fmm">
    <valueInteger value="0"/>
  </extension>
  <url value="http://terminology.hl7.org/ValueSet/v2-0211"/>
  <version value="2.9"/>
  <name value="v2.0211"/>
  <title value="v2 Alternate Character Sets"/>
  <status value="active"/>
  <experimental value="false"/>
  <publisher value="HL7, Inc"/>
  <contact>
    <telecom>
      <system value="url"/>
      <value value="http://hl7.org"/>
    </telecom>
  </contact>
  <description value="FHIR Value set/code system definition for HL7 v2 table 0211 ( Alternate Character Sets)"/>
  <immutable value="true"/>
  <compose>
    <include>
      <system value="http://terminology.hl7.org/CodeSystem/v2-0211"/>
    </include>
  </compose>
</ValueSet>