GNU Info

Info Node: (mtools.info)explicit tables

(mtools.info)explicit tables


Prev: country Up: character sets
Enter node , (file) or (file)node

Configuration using explicit translation tables
-----------------------------------------------

   Translation tables may be described in line in the configuration
file. Two tables are needed: first the DOS-to-Unix table, and then the
Lowercase-to-Uppercase table. A DOS-to-Unix table starts with the
`tounix' keyword, followed by a colon, and 128 hexadecimal numbers.  A
lower-to-upper table starts with the `fucase' keyword, followed by a
colon, and 128 hexadecimal numbers.

   The tables only show the translations for characters whose codes is
greater than 128, because translation for lower codes is trivial.

   Example:

      tounix:
        0xc7 0xfc 0xe9 0xe2 0xe4 0xe0 0xe5 0xe7
        0xea 0xeb 0xe8 0xef 0xee 0xec 0xc4 0xc5
        0xc9 0xe6 0xc6 0xf4 0xf6 0xf2 0xfb 0xf9
        0xff 0xd6 0xdc 0xf8 0xa3 0xd8 0xd7 0x5f
        0xe1 0xed 0xf3 0xfa 0xf1 0xd1 0xaa 0xba
        0xbf 0xae 0xac 0xbd 0xbc 0xa1 0xab 0xbb
        0x5f 0x5f 0x5f 0x5f 0x5f 0xc1 0xc2 0xc0
        0xa9 0x5f 0x5f 0x5f 0x5f 0xa2 0xa5 0xac
        0x5f 0x5f 0x5f 0x5f 0x5f 0x5f 0xe3 0xc3
        0x5f 0x5f 0x5f 0x5f 0x5f 0x5f 0x5f 0xa4
        0xf0 0xd0 0xc9 0xcb 0xc8 0x69 0xcd 0xce
        0xcf 0x5f 0x5f 0x5f 0x5f 0x7c 0x49 0x5f
        0xd3 0xdf 0xd4 0xd2 0xf5 0xd5 0xb5 0xfe
        0xde 0xda 0xd9 0xfd 0xdd 0xde 0xaf 0xb4
        0xad 0xb1 0x5f 0xbe 0xb6 0xa7 0xf7 0xb8
        0xb0 0xa8 0xb7 0xb9 0xb3 0xb2 0x5f 0x5f
     
      fucase:
        0x80 0x9a 0x90 0xb6 0x8e 0xb7 0x8f 0x80
        0xd2 0xd3 0xd4 0xd8 0xd7 0xde 0x8e 0x8f
        0x90 0x92 0x92 0xe2 0x99 0xe3 0xea 0xeb
        0x59 0x99 0x9a 0x9d 0x9c 0x9d 0x9e 0x9f
        0xb5 0xd6 0xe0 0xe9 0xa5 0xa5 0xa6 0xa7
        0xa8 0xa9 0xaa 0xab 0xac 0xad 0xae 0xaf
        0xb0 0xb1 0xb2 0xb3 0xb4 0xb5 0xb6 0xb7
        0xb8 0xb9 0xba 0xbb 0xbc 0xbd 0xbe 0xbf
        0xc0 0xc1 0xc2 0xc3 0xc4 0xc5 0xc7 0xc7
        0xc8 0xc9 0xca 0xcb 0xcc 0xcd 0xce 0xcf
        0xd1 0xd1 0xd2 0xd3 0xd4 0x49 0xd6 0xd7
        0xd8 0xd9 0xda 0xdb 0xdc 0xdd 0xde 0xdf
        0xe0 0xe1 0xe2 0xe3 0xe5 0xe5 0xe6 0xe8
        0xe8 0xe9 0xea 0xeb 0xed 0xed 0xee 0xef
        0xf0 0xf1 0xf2 0xf3 0xf4 0xf5 0xf6 0xf7
        0xf8 0xf9 0xfa 0xfb 0xfc 0xfd 0xfe 0xff

   The first table maps DOS character codes to Unix character codes. For
example, the DOS character number 129. This is a u with to dots on top
of it. To translate it into Unix, we look at the character number 1 in
the first table (1 = 129 - 128). This is 0xfc. (Beware, numbering
starts at 0).  The second table maps lower case DOS characters to upper
case DOS characters. The same lower case u with dots maps to character
0x9a, which is an uppercase U with dots in DOS.

Unicode characters greater than 256
-----------------------------------

   If an existing MS-DOS name contains Unicode character greater than
256, these are translated to underscores or to characters which are
close in visual appearance. For example, accented consonants are
translated into their unaccented counterparts. This translation is used
for mdir and for the Unix filenames generated by mcopy. Linux does
support Unicode too, but unfortunately too few applications support it
yet to bother with it in mtools. Most importantly, xterm can't display
Unicode yet. If there is sufficient demand, I might include support for
Unicode in the Unix filenames as well.

   *Caution:* When deleting files with mtools, the underscore matches
all characters which can't be represented in Unix. Be careful with mdel!


automatically generated by info2www version 1.2.2.9