4 NAME "KSC5601HANGUL/UCS"
5 SRC_ZONE 0x24-0x48 / 0x21-0x7E / 8
12 # This mapping data is made from the mapping data provided by Unicode, Inc.
15 # Name: Unified Hangul (KS X 1001) to Unicode table
16 # Unicode version: 2.0
18 # Table format: Format A
20 # Authors: Jungshik Shin at jshin@pantheon.yale.edu
23 # This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
24 # No claims are made as to fitness for any particular purpose. No
25 # warranties of any kind are expressed or implied. The recipient
26 # agrees to determine applicability of information provided. If this
27 # file has been provided on magnetic media by Unicode, Inc., the sole
28 # remedy for any claim will be exchange of defective media within 90
31 # Recipient is granted the right to make copies in any form for
32 # internal distribution and to freely use the information supplied
33 # in the creation of products supporting Unicode. Unicode, Inc.
34 # specifically excludes the right to re-distribute this file directly
35 # to third parties or other organizations whether for profit or not.
37 # What is enclosed below is the mapping between KS X 1001(KS C 5601-1987
38 # and Unicode 2.0. It's automatically generated from KSC5601.TXT
39 # (at ftp://ftp.unicode.org/Public/MAPPING/EASTASIA/KSC) which is
40 # actually NOT the mapping between KS X 1001(KS C 5601-1992) and Unicode 2.0
41 # BUT the mapping table between UHC (Microsoft Unified Hangul Code)
42 # and Unicode 2.0. Hence, in this pacakge, I renamed it as UHC.TXT
44 # Please, note that there was a change in naming scheme of
45 # Korean standard for information exchange.
46 # What used to be in KS C 5[6-8]xx are now in KS X xxxx.
47 # See http://pantheon.yale.edu/~jshin/faq/qa8.html for more details.
49 # The Unix command used is
50 # egrep '^0x' < KSC5601.TXT | \
51 # egrep -v '^0x([8-9]...|A0..|..[4-9].|..A0)' | perl tab.pl
53 # where tab.pl is as following
57 # local($euck, $ucs4, @rest) = split;
58 # local($u)=hex($ucs4);
59 # local($k)=hex($euck);
60 # printf ("0x%04X 0x%04X %s\n",$k-0x8080, $u,join(' ',@rest));
63 # Column #1 : KS X 1001(KS C 5601-1992 excluding addtional Hangul
64 # syllables defined for Johab encoding in Annex 3)
66 # Column #2 : the Unicode (in hex as 0xXXXX)
67 # Column #3 : the Unicode name (following a comment sign, '#')
68 # The number of characters enumerated in this table is 8824,
69 # as listed in KS X 1001
71 # The entries are in KS X 1001 order
72 # You can use the following algorithms to convert the hex form
73 # of KS X 1001 to other forms
74 # To get EUC Korean(EUC-KR) code points, add 0x8080.
75 # To get row(Hang) and column(Yol) as used in KS X 1001 manual,
76 # first subtract 0x2020. Then
77 # the high and low bytes correspond to the row(Hang) and the column(Yol),