1 # $NetBSD: CNS11643-1%UCS.src,v 1.1 2005/03/27 22:30:06 tnozaki Exp $
5 SRC_ZONE 0x21-0x7E / 0x21-0x7E / 8
12 # Name: CNS 11643-1986 to Unicode table (complete)
13 # Unicode version: 1.1
14 # Table version: 0.0d1
15 # Table format: Format A
16 # Date: 21 October 1994
18 # Copyright (c) 1991-1994 Unicode, Inc. All Rights reserved.
20 # This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
21 # No claims are made as to fitness for any particular purpose. No
22 # warranties of any kind are expressed or implied. The recipient
23 # agrees to determine applicability of information provided. If this
24 # file has been provided on magnetic media by Unicode, Inc., the sole
25 # remedy for any claim will be exchange of defective media within 90
28 # Recipient is granted the right to make copies in any form for
29 # internal distribution and to freely use the information supplied
30 # in the creation of products supporting Unicode. Unicode, Inc.
31 # specifically excludes the right to re-distribute this file directly
32 # to third parties or other organizations whether for profit or not.
37 # This table contains one set of mappings from CNS 11643-1986 into Unicode.
38 # Note that these data are *possible* mappings only and may not be the
39 # same as those used by actual products, nor may they be the best suited
40 # for all uses. For more information on the mappings between various code
41 # pages incorporating the repertoire of CNS 11643-1986 and Unicode, consult the
42 # VENDORS mapping data. Normative information on the mapping between
43 # CNS 11643-1986 and Unicode may be found in the Unihan.txt file in the
44 # latest Unicode Character Database.
46 # If you have carefully considered the fact that the mappings in
47 # this table are only one possible set of mappings between CNS 11643-1986 and
48 # Unicode and have no normative status, but still feel that you
49 # have located an error in the table that requires fixing, you may
50 # report any such error to errata@unicode.org.
53 # WARNING! It is currently impossible to provide round-trip compatibility
54 # between CNS 11643-1986 and Unicode.
56 # (1) Some characters are not currently mapped because
57 # of conflicts with other mappings. They include the following:
59 # CNS Description Comments
61 # 0x12224 SPACING HEAVY OVERSCORE not in Unicode
62 # 0x12226 SPACING HEAVY UNDERSCORE not in Unicode
64 # (2) Some characters are not currently mapped because CNS 11643-1986 includes
65 # duplicate versions of some ideographs which are included in Unicode only
66 # once. They include the following:
68 # 0x1243E through 0x12440, HANGZHOU NUMERAL TEN, TWENTY, and THIRTY
69 # 0x12721 through 0x12939, KangXi radical set
71 # Entries for these characters are not included in this table.
75 # 1. There is an uncertainty in the mapping of the CNS 11643-1986 character
76 # 0x1256D. This character occurs within the CNS 11643-1986 block of tone marks
77 # for bopomofo and is intended to be the tone mark for the first tone in
78 # Mandarin Chinese. We have selected the mapping U+02C9 MODIFIER LETTER
79 # MACRON (Mandarin Chinese first tone) to reflect this semantic.
80 # However, because bopomofo uses the absense of a tone mark to indicate
81 # the first Mandarin tone, most implementations of CNS 11643-1986
82 # represent this character with a blank space, and so a mapping such as
83 # U+2003 EM SPACE might be preferred.
85 # Format: Three tab-separated columns
86 # Column #1 is the CNS 11643-1986 code (in hex as 0xXXXXX)
87 # Column #2 is the Unicode (in hex as 0xXXXX)
88 # Column #3 is the Unicode name (follows a comment sign, '#')
89 # The official names for Unicode characters U+4E00
90 # to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
91 # where XXXX is the code point. Including all these
92 # names in this file increases its size substantially
93 # and needlessly. The token "<CJK>" is used for the
94 # name of these characters. If necessary, it can be
95 # expanded algorithmically by a parser or editor.
97 # The entries are in CNS 11643-1986 order
102 # Name: Unihan database
103 # Unicode version: 4.0.1
105 # Date: 31 October 2003
107 # Copyright (c) 1996-2003 Unicode, Inc. All Rights reserved.
113 # The Unicode Character Database (including the Unihan database) is provided
114 # as is by Unicode, Inc. No claims are made as to fitness for any particular
115 # purpose. No warranties of any kind are expressed or implied. The recipient agrees
116 # to determine applicability of information provided. If this file has been
117 # purchased on magnetic or optical media from Unicode, Inc., the sole remedy for
118 # any claim will be exchange of defective media within 90 days of receipt.
120 # This disclaimer is applicable for all other data files accompanying the Unicode
121 # Character Database, some of which have been compiled by the Unicode Consortium,
122 # and some of which have been supplied by other sources.
124 # Limitations on Rights to Redistribute This Data
126 # Recipient is granted the right to make copies in any form for internal
127 # distribution and to freely use the information supplied in the creation
128 # of products supporting the UnicodeTM Standard. The files in the Unicode
129 # Character Database can be redistributed to third parties or other
130 # organizations (whether for profit or not) as long as this notice and
131 # the disclaimer notice are retained. Information can be extracted from
132 # these files and used in documentation or programs, as long as there is
133 # an accompanying notice indicating the source.
574 0x2721 - 0x277E = 0x2F00 -
575 0x2821 - 0x287E = 0x2F5F -
576 0x2921 - 0x2939 = 0x2FBD -
577 0x293A - 0x2951 = 0x31A0 -
578 0x2952 - 0x297E = 0x3041 -
579 0x2A21 - 0x2A47 = 0x306E -
580 0x2A48 - 0x2A4D = 0x3099 -
581 0x2A4E - 0x2A7E = 0x30A1 -
582 0x2B21 - 0x2B4D = 0x30D2 -