Manual Page Result
0
Command: deckanji | Section: 5 | Source: Digital UNIX | File: deckanji.5.gz
deckanji(5) File Formats Manual deckanji(5)
NAME
deckanji - A character encoding system (codeset) for Japanese
DESCRIPTION
The DEC Kanji codeset consists of the following characters: ASCII or
JIS X0201 Roman letters Katakana characters specified by JIS X0201 JIS
X0208 characters User-defined characters (UDC)
DEC Kanji uses a combination of single-byte data and two-byte data to
represent these characters.
JIS X0201 is a single-byte character set and consists of Roman letters
and Katakana characters, which are Japanese phonetic symbols. The Roman
letters defined in the JIS X0201-1976 standard are the same as ASCII
letters. (For information on the ASCII character set, refer to
ascii(5).) However, JIS X0201 replaces the ASCII tilde (~) character
with a horizontal bar (located at the upper part of a character cell),
and replaces the backslash (\) with the Japanese currency sign (Yen).
The JIS XO208 standard specifies two-byte character values that repre-
sent a variety of characters, including ideographic symbols.
DEC Kanji Encoding
All ASCII characters are represented by single-byte 7-bit values in DEC
Kanji. That is, the most significant bit (MSB) is always set off in the
byte that represents an ASCII character. The Roman letters and the
Katakana characters specified by JIS X0201 are also single-byte values
in which the most significant bit (MSB) is set off and on, respec-
tively.
The code table for JIS X0208 characters is divided into 94 rows, num-
bered from 1 to 94. Each row has 94 columns, also numbered from 1 to
94. JIS X0208 defines a total of 6877 characters which include: Special
symbols, in rows 1 and 2 Numerals and Roman letters, in row 3 Hiragana
characters, in row 4 Katakana characters, in row 5 Greek letters, in
row 6 Russian letters, in row 7 Symbols for drawing graphs, diagrams,
and lines, in row 8 First level Kanji characters, in rows 16 to 47 Sec-
ond level Kanji characters, in rows 48 to 84
To comply with the JIS X0208 standard, each JIS X0208 character is a
two-byte value in the DEC Kanji codeset. The MSB of both the first and
second bytes is always set on to distinguish JIS X0208 characters from
ASCII/JIS Roman or user-defined characters.
For each JIS X0208 character, The first byte of the two-byte value de-
termines the row number and the second determines the column number in
the JIS X0208 code table. The following formula shows the code value
for a JIS X0208 character in relation to its row and column numbers:
1st byte = A0 + Row number
2nd byte = A0 + Column number
For example, if a character is positioned at the first column of the
36th row, its code value is C4A1, which is calculated as follows:
1st byte = A0 (hex) + 36 = C4 (hex)
2nd byte = A0 (hex) + 01 = A1 (hex)
For user-defined character (UDC) definitions, DEC Kanji provides an
area of 2914 positions (from row 1 to row 31). Each UDC is represented
by a two-byte value, just like a JIS X0208 character value. However,
the MSB of the second byte of a UDC is set off to distinguish it from a
JIS X0208 character. The code range of the UDC area is A121 to BF7E.
The following formula calculates the code of a UDC in relation to its
row and column numbers:
1st byte = A0 + Row number
2nd byte = 20 + Column number
For example, if a UDC is positioned at the first column of the 16th
row, its code value is B021, which is calculated as follows:
1st byte = A0 (hex) + 16 = B0 (hex)
2nd byte = 20 (hex) + 01 = 21 (hex)
Codeset Conversion
The following codeset converter pairs are available for converting
Japanese characters between deckanji and other encoding formats. Refer
to iconv_intro(5) for an introduction to codeset conversion. For more
information about the other codeset for which deckanji is the input or
output, see the reference page specified in the list item. eucJP_deck-
anji, deckanji_eucJP
Converting from and to Japanese Extended UNIX Code: eucJP(5).
iso-2022-jp_deckanji, deckanji_iso-2022-jp
Converting from and to the ISO 2022-JP codeset: iso2022jp(5).
iso-2022-jpext_deckanji, deckanji_iso-2022-jpext
Converting from and to the ISO 2022-JPexp codeset: iso2022jp(5).
JIS7_deckanji, deckanji_JIS7
Converting from and to the JIS7 codeset: jiskanji(5). sdeck-
anji_deckanji, deckanji_sdeckanji
Converting from and to the Super DEC Kanji codeset: sdeck-
anji(5). SJIS_deckanji, deckanji_SJIS
Converting from and to the Shift JIS codeset: shiftjis(5).
Note that Shift JIS encoding is equivalent to the Microsoft
code-page format used on PCs for Japanese. Therefore, you can
use these converters to convert data between DEC Kanji and PC
code-page format. For information on PC code pages, see
code_page(5) UCS-2_deckanji, deckanji_UCS-2
Converting from and to UCS-2 format: Unicode(5). UCS-4_deck-
anji, deckanji_UCS-4
Converting from and to UCS-4 format: Unicode(5). UTF-8_deck-
anji, deckanji_UTF-8
Converting from and to UTF--8 format: Unicode(5).
Japanese Fonts
The operating system provides the following Japanese bitmap fonts in
various sizes and typefaces for 75dpi and 100dpi (dot-per-inch) display
devices: JIS X0201-1976 characters (Gothic family):
-jdecw-gothic-medium-r-normal--8-80-75-75-m-40-jisx0201.1976-0
-jdecw-gothic-medium-r-normal--14-140-75-75-m-70-jisx0201.1976-0
-jdecw-gothic-medium-r-normal--12-120-75-75-m-60-jisx0201.1976-0
-jdecw-gothic-medium-r-nor-
mal--24-240-75-75-m-120-jisx0201.1976-0 -jdecw-gothic-medium-r-
normal--10-100-75-75-m-50-jisx0201.1976-0 -jdecw-gothic-medium-
r-normal--18-180-75-75-m-90-jisx0201.1976-0 -jdecw-gothic-
medium-r-normal--17-120-100-100-m-85-jisx0201.1976-0 -jdecw-
gothic-medium-r-normal--34-240-100-100-m-170-jisx0201.1976-0
-jdecw-gothic-medium-r-nor-
mal--14-100-100-100-m-70-jisx0201.1976-0 -jdecw-gothic-medium-r-
normal--25-180-100-100-m-125-jisx0201.1976-0 -jdecw-gothic-
medium-r-normal--20-140-100-100-m-100-jisx0201.1976-0 -jdecw-
gothic-medium-r-normal--11-80-100-100-m-55-jisx0201.1976-0 JIS
X0201-1976 characters (Kmenu family)
-jdecw-kmenu-medium-r-normal--12-120-75-75-p-70-jisx0201.1976-0
-jdecw-kmenu-medium-r-nor-
mal--17-120-100-100-p-85-jisx0201.1976-0 JIS X0201-1976 charac-
ters (Mincho family)
-jdecw-mincho-medium-r-normal--8-80-75-75-m-40-jisx0201.1976-0
-jdecw-mincho-medium-r-normal--14-140-75-75-m-70-jisx0201.1976-0
-jdecw-mincho-medium-r-nor-
mal--24-240-75-75-m-120-jisx0201.1976-0 -jdecw-mincho-medium-r-
normal--10-100-75-75-m-50-jisx0201.1976-0 -jdecw-mincho-medium-
r-normal--18-180-75-75-m-90-jisx0201.1976-0 -jdecw-mincho-
medium-r-normal--17-120-100-100-m-85-jisx0201.1976-0 -jdecw-min-
cho-medium-r-normal--34-240-100-100-m-170-jisx0201.1976-0
-jdecw-mincho-medium-r-nor-
mal--14-100-100-100-m-70-jisx0201.1976-0 -jdecw-mincho-medium-r-
normal--25-180-100-100-m-125-jisx0201.1976-0 -jdecw-mincho-
medium-r-normal--20-140-100-100-m-100-jisx0201.1976-0 -jdecw-
mincho-medium-r-normal--11-80-100-100-m-55-jisx0201.1976-0 JIS
X0201-1976 characters (Screen family)
-jdecw-screen-medium-r-normal--24-240-75-75-m-120-jisx0201-ro-
mankana -jdecw-screen-medium-r-nor-
mal--18-180-75-75-m-80-jisx0201-romankana -jdecw-screen-medium-
r-normal--14-140-75-75-m-70-jisx0201-romankana -jdecw-screen-
medium-r-normal--10-100-75-75-m-50-jisx0201-romankana JIS X0208
characters (Gothic family)
-jdecw-gothic-medium-r-nor-
mal--14-140-75-75-m-140-jisx0208.1983-1 -jdecw-gothic-medium-r-
normal--12-120-75-75-m-120-jisx0208.1983-1 -jdecw-gothic-medium-
r-normal--24-240-75-75-m-240-jisx0208.1983-1 -jdecw-gothic-
medium-r-normal--10-100-75-75-m-100-jisx0208.1983-1 -jdecw-
gothic-medium-r-normal--18-180-75-75-m-180-jisx0208.1983-1
-jdecw-gothic-medium-r-normal--8-80-75-75-m-80-jisx0208.1983-1
-jdecw-gothic-medium-r-nor-
mal--17-120-100-100-m-170-jisx0208.1983-1 -jdecw-gothic-medium-
r-normal--34-240-100-100-m-340-jisx0208.1983-1 -jdecw-gothic-
medium-r-normal--14-100-100-100-m-140-jisx0208.1983-1 -jdecw-
gothic-medium-r-normal--25-180-100-100-m-250-jisx0208.1983-1
-jdecw-gothic-medium-r-nor-
mal--20-140-100-100-m-200-jisx0208.1983-1 -jdecw-gothic-medium-
r-normal--11-80-100-100-m-110-jisx0208.1983-1 JIS X0208 charac-
ters (Mincho family)
-jdecw-mincho-medium-r-nor-
mal--14-140-75-75-m-140-jisx0208.1983-1 jdecw-mincho-medium-r-
normal--12-120-75-75-m-120-jisx0208.1983-1 -jdecw-mincho-medium-
r-normal--24-240-75-75-m-240-jisx0208.1983-1 -jdecw-mincho-
medium-r-normal--10-100-75-75-m-100-jisx0208.1983-1 -jdecw-min-
cho-medium-r-normal--18-180-75-75-m-180-jisx0208.1983-1 -jdecw-
mincho-medium-r-normal--8-80-75-75-m-80-jisx0208.1983-1 -jdecw-
mincho-medium-r-normal--17-120-100-100-m-170-jisx0208.1983-1
-jdecw-mincho-medium-r-nor-
mal--34-240-100-100-m-340-jisx0208.1983-1 -jdecw-mincho-medium-
r-normal--14-100-100-100-m-140-jisx0208.1983-1 -jdecw-mincho-
medium-r-normal--25-180-100-100-m-250-jisx0208.1983-1 -jdecw-
mincho-medium-r-normal--20-140-100-100-m-200-jisx0208.1983-1
-jdecw-mincho-medium-r-nor-
mal--11-80-100-100-m-110-jisx0208.1983-1 JIS X0208 characters
(Screen family)
-jdecw-screen-medium-r-nor-
mal--24-240-75-75-m-240-jisx0208-kanji00 -jdecw-screen-medium-r-
normal--10-100-75-75-m-100-jisx0208-kanji00 -jdecw-screen-
medium-r-normal--18-180-75-75-m-160-jisx0208-kanji00 -jdecw-
screen-medium-r-normal--16-160-75-75-m-160-jisx0208-kanji00
-jdecw-screen-medium-r-nor-
mal--14-140-75-75-m-140-jisx0208-kanji00 -jdecw-screen-medium-r-
normal--24-240-75-75-m-240-jisx0208-kanji11 -jdecw-screen-
medium-r-normal--10-100-75-75-m-100-jisx0208-kanji11 -jdecw-
screen-medium-r-normal--18-180-75-75-m-160-jisx0208-kanji11
-jdecw-screen-medium-r-nor-
mal--14-140-75-75-m-140-jisx0208-kanji11
For printers, the operating system provides only Japanese fonts that
are printer-resident; that is, there are no Japanese fonts that can be
dynamically down-loaded to the printer. Refer to i18n_printing(5) for
general information on printing non-English text.
SEE ALSO
Commands: locale(1)
Others: ascii(5), code_page(5), eucJP(5), i18n_intro(5), i18n_print-
ing(5), iconv_intro(5), iso2022jp(5), Japanese(5), jiskanji(5), sdeck-
anji(5), shiftjis(5), Unicode(5)
deckanji(5)