1 # $NetBSD: UCS%JISX0208@1990.src,v 1.6 2006/07/05 15:27:53 tnozaki Exp $
4 NAME "UCS/JISX0208:1990"
5 SRC_ZONE 0x005C - 0xFFE5
12 # This mapping data is made from the mapping data provided by Unicode, Inc.
15 # Name: JIS X 0208 (1990) to Unicode
16 # Unicode version: 1.1
18 # Table format: Format A
21 # Copyright (c) 1991-1994 Unicode, Inc. All Rights reserved.
23 # This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
24 # No claims are made as to fitness for any particular purpose. No
25 # warranties of any kind are expressed or implied. The recipient
26 # agrees to determine applicability of information provided. If this
27 # file has been provided on magnetic media by Unicode, Inc., the sole
28 # remedy for any claim will be exchange of defective media within 90
31 # Recipient is granted the right to make copies in any form for
32 # internal distribution and to freely use the information supplied
33 # in the creation of products supporting Unicode. Unicode, Inc.
34 # specifically excludes the right to re-distribute this file directly
35 # to third parties or other organizations whether for profit or not.
40 # This table contains one set of mappings from JIS X 0208 (1990) into Unicode.
41 # Note that these data are *possible* mappings only and may not be the
42 # same as those used by actual products, nor may they be the best suited
43 # for all uses. For more information on the mappings between various code
44 # pages incorporating the repertoire of JIS X 0208 (1990) and Unicode, consult the
45 # VENDORS mapping data. Normative information on the mapping between
46 # JIS X 0208 (1990) and Unicode may be found in the Unihan.txt file in the
47 # latest Unicode Character Database.
49 # If you have carefully considered the fact that the mappings in
50 # this table are only one possible set of mappings between JIS X 0208 (1990)
51 # and Unicode and have no normative status, but still feel that you
52 # have located an error in the table that requires fixing, you may
53 # report any such error to errata@unicode.org.
56 # Format: Four tab-separated columns
57 # Column #1 is the shift-JIS code (in hex)
58 # Column #2 is the JIS X 0208 code (in hex as 0xXXXX)
59 # Column #3 is the Unicode (in hex as 0xXXXX)
60 # Column #4 the Unicode name (follows a comment sign, '#')
61 # The official names for Unicode characters U+4E00
62 # to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
63 # where XXXX is the code point. Including all these
64 # names in this file increases its size substantially
65 # and needlessly. The token "<CJK>" is used for the
66 # name of these characters. If necessary, it can be
67 # expanded algorithmically by a parser or editor.
69 # The entries are in JIS X 0208 order
71 # The following algorithms can be used to change the hex form
72 # of JIS 0208 to other standard forms:
74 # To change hex to EUC form, add 0x8080
75 # To change hex to kuten form, first subtract 0x2020. Then
76 # the high and low bytes correspond to the ku and ten of
77 # the kuten form. For example, 0x2121 -> 0x0101 -> 0101;
78 # 0x7426 -> 0x5406 -> 8406
80 # The kanji mappings are a normative part of ISO/IEC 10646. The
81 # non-kanji mappings are provisional, pending definition of
82 # official mappings by Japanese standards bodies
84 # Any comments or problems, contact <John_Jenkins@taligent.com>
87 # NetBSD specific modification:
88 # 2003-08-18 : add U+FF3C -> 0x2140 conversion.
89 # see UCS%JISX0208@1990.src.