1 # $NetBSD: UCS%CNS11643-1.src,v 1.3 2006/06/22 16:18:45 tnozaki Exp $
5 SRC_ZONE 0x00A2 - 0xFFE5
12 # Name: CNS 11643-1986 to Unicode table (complete)
13 # Unicode version: 1.1
14 # Table version: 0.0d1
15 # Table format: Format A
16 # Date: 21 October 1994
18 # Copyright (c) 1991-1994 Unicode, Inc. All Rights reserved.
20 # This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
21 # No claims are made as to fitness for any particular purpose. No
22 # warranties of any kind are expressed or implied. The recipient
23 # agrees to determine applicability of information provided. If this
24 # file has been provided on magnetic media by Unicode, Inc., the sole
25 # remedy for any claim will be exchange of defective media within 90
28 # Recipient is granted the right to make copies in any form for
29 # internal distribution and to freely use the information supplied
30 # in the creation of products supporting Unicode. Unicode, Inc.
31 # specifically excludes the right to re-distribute this file directly
32 # to third parties or other organizations whether for profit or not.
37 # This table contains one set of mappings from CNS 11643-1986 into Unicode.
38 # Note that these data are *possible* mappings only and may not be the
39 # same as those used by actual products, nor may they be the best suited
40 # for all uses. For more information on the mappings between various code
41 # pages incorporating the repertoire of CNS 11643-1986 and Unicode, consult the
42 # VENDORS mapping data. Normative information on the mapping between
43 # CNS 11643-1986 and Unicode may be found in the Unihan.txt file in the
44 # latest Unicode Character Database.
46 # If you have carefully considered the fact that the mappings in
47 # this table are only one possible set of mappings between CNS 11643-1986 and
48 # Unicode and have no normative status, but still feel that you
49 # have located an error in the table that requires fixing, you may
50 # report any such error to errata@unicode.org.
53 # WARNING! It is currently impossible to provide round-trip compatibility
54 # between CNS 11643-1986 and Unicode.
56 # (1) Some characters are not currently mapped because
57 # of conflicts with other mappings. They include the following:
59 # CNS Description Comments
61 # 0x12224 SPACING HEAVY OVERSCORE not in Unicode
62 # 0x12226 SPACING HEAVY UNDERSCORE not in Unicode
64 # (2) Some characters are not currently mapped because CNS 11643-1986 includes
65 # duplicate versions of some ideographs which are included in Unicode only
66 # once. They include the following:
68 # 0x1243E through 0x12440, HANGZHOU NUMERAL TEN, TWENTY, and THIRTY
69 # 0x12721 through 0x12939, KangXi radical set
71 # Entries for these characters are not included in this table.
75 # 1. There is an uncertainty in the mapping of the CNS 11643-1986 character
76 # 0x1256D. This character occurs within the CNS 11643-1986 block of tone marks
77 # for bopomofo and is intended to be the tone mark for the first tone in
78 # Mandarin Chinese. We have selected the mapping U+02C9 MODIFIER LETTER
79 # MACRON (Mandarin Chinese first tone) to reflect this semantic.
80 # However, because bopomofo uses the absense of a tone mark to indicate
81 # the first Mandarin tone, most implementations of CNS 11643-1986
82 # represent this character with a blank space, and so a mapping such as
83 # U+2003 EM SPACE might be preferred.
85 # Format: Three tab-separated columns
86 # Column #1 is the CNS 11643-1986 code (in hex as 0xXXXXX)
87 # Column #2 is the Unicode (in hex as 0xXXXX)
88 # Column #3 is the Unicode name (follows a comment sign, '#')
89 # The official names for Unicode characters U+4E00
90 # to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
91 # where XXXX is the code point. Including all these
92 # names in this file increases its size substantially
93 # and needlessly. The token "<CJK>" is used for the
94 # name of these characters. If necessary, it can be
95 # expanded algorithmically by a parser or editor.
97 # The entries are in CNS 11643-1986 order
102 # Name: Unihan database
103 # Unicode version: 4.0.1
105 # Date: 31 October 2003
107 # Copyright (c) 1996-2003 Unicode, Inc. All Rights reserved.
113 # The Unicode Character Database (including the Unihan database) is provided
114 # as is by Unicode, Inc. No claims are made as to fitness for any particular
115 # purpose. No warranties of any kind are expressed or implied. The recipient agrees
116 # to determine applicability of information provided. If this file has been
117 # purchased on magnetic or optical media from Unicode, Inc., the sole remedy for
118 # any claim will be exchange of defective media within 90 days of receipt.
120 # This disclaimer is applicable for all other data files accompanying the Unicode
121 # Character Database, some of which have been compiled by the Unicode Consortium,
122 # and some of which have been supplied by other sources.
124 # Limitations on Rights to Redistribute This Data
126 # Recipient is granted the right to make copies in any form for internal
127 # distribution and to freely use the information supplied in the creation
128 # of products supporting the UnicodeTM Standard. The files in the Unicode
129 # Character Database can be redistributed to third parties or other
130 # organizations (whether for profit or not) as long as this notice and
131 # the disclaimer notice are retained. Information can be extracted from
132 # these files and used in documentation or programs, as long as there is
133 # an accompanying notice indicating the source.
136 0x00A2 = 0x2266 # fallback -> 0xFFE0
137 0x00A3 = 0x2267 # fallback -> 0xFFE1
138 0x00A5 = 0x2264 # fallback -> 0xFFE5
201 0x2016 = 0x225D # fallback -> 0x2225
206 0x2022 = 0x2126 # fallback -> 0x2027
259 0x223C = 0x2244 # fallback -> 0xFF5E
379 0x2609 = 0x2254 # fallback -> 0x2299
381 0x2641 = 0x2253 # fallback -> 0x2295
383 0x2F00 - 0x2F5D = 0x2721 -
384 0x2F5F - 0x2FBC = 0x2821 -
385 0x2FBD - 0x2FD5 = 0x2921 -
418 0x3041 - 0x306D = 0x2952 -
419 0x306E - 0x3094 = 0x2A21 -
420 0x3099 - 0x309E = 0x2A48 -
421 0x30A1 - 0x30D1 = 0x2A4E -
422 0x30D2 - 0x30FE = 0x2B21 -
423 0x30FB = 0x2126 # fallback -> 0x2027
461 0x31A0 - 0x31B7 = 0x293A -
942 0x5344 = 0x243F # fallback -> 0x3039
5887 0xFE32 = 0x2138 # fallback -> 0xFE31
6026 0xFF64 = 0x212F # fallback -> 0xFE51