Codebase list eo-spell / HEAD
HEAD

Tree @HEAD (Download .tar.gz)

# Name:         README
# Content:      Information about Esperanto dictionary for the Ispell speller
# Created:      1997-08-30 de Sergio Pokrovskij <pok@nbsp.nsk.su>
# Versio 2.0.
# Lasta sxangxo:
#               Time-stamp: <98/09/18 11:33:17 pok>
#
# Copyright 1997, 1998 Sergio Pokrovskij
#
# This dictionary  is available on the terms of GNU General Public License
# (Free Software Foundation, 675 Mass Ave, Cambridge, MA 02139, USA).

Here is the Esperanto dictionary, compiled by Sergio Pokrovskij for
the version 3 of ispell.

You will first need to get ispell 3.0+ it is available from
ftp://ftp.cs.ucla.edu/pub/ispell-3.1; make sure that in its local.h
NO8BIT is commented out.

There are several encodings used with Esperanto text:

1. The very best is the Unicode or its subset, which contains the
   esperantic letters, like the Microsoft's WGL4; unfortunately, most
   of the present-day editors do not support 2-byte characters. I have
   supplied Unicode code elements for the UTF-8 encoding, but I have no
   editor to make use of it. Please tell me if this could make sense.
   
2. The second best choice is the Latin-3 encoding (ISO-8859-3); this
   is the encoding I currently use.

3. For the sake of the ASCII-impaired (and ANSI-impaired), there are
   several surrogates. I offer a support for three of them:

3.1. The TeX-like ^cirka^u-style;

3.2. The most popular cxirkaux-style, which is also convenient for
     lexicographical ordering and thus is used in the dictionaries;

3.3. The unambiguous cxirkavx-style, which creates no problems with
     proper names like "Michaux", which the Westerners often smuggle into
     Esperantic texts.
     
4. There is a quick hack to spellcheck MS-Word documents; it relies on
   the RTF representation.

So, the eo.aff file from this package supports Latin-3 and the
ASCIIzations according to the following styles:
     
        ^cirka^u        (turned on with C-c i ^)
        cxirkaux        (turned on with C-c i u)
        cxirkavx        (turned on with C-c i v)
        \'e6irka\'fd    (turned on with C-c i w)

     
SIMPLE INSTALL

If you do not care about customizing the dictionary to your
speciality, copy the files eo.aff and eo.hash to the folder where
ispell expects to find them (LIBDIR in local.h or config.h; probably
you already have your american.hash there).

After that you can call
                  ispell -d eo FILENAME
(see TO USE below).

NOTE. There is an Ispell tradition to name languages by their own
names rather than according to the ISO standard (the ISO name for
Esperanto is "eo"). If you prefer this Ispell custom you may rename
the resulting dictionary to "Esperanto.hash" or similarly.


CUSTOMIZED BUILDING

In order to enable selective construction of dictionaries, there are
separate files:

kerno.txt       the lexical kernel
komputo.txt     some computer-science terminology according to the
                "Komputika Leksikono"
                ( http://www.cwi.nl/~fagot/eo/KompLeks/
                  or
                  http://www.esperanto.mv.ru/KompLeks/UTF8/DEFAULT.html
                )
movado.txt      vocabulary of the Esperanto Movement
etnoland.txt    countries and ethnography
kancero.txt     redundant words, which are used by some esperantists,
                though they are less precise and unnecessarily
                complicate the language; e.g. olda (maljuna or
                malnova)
personoj.txt    given names and assimilated names of important personalities
                (Petro, Zamenhof, Noa ...)


More files should be added, but are not ready yet:

tero.txt        (geography, geology, astronomy, meteorology)
medicino.txt    (anatomy, medicine, biology)
hxemio.txt      (chemistry, pharmacology)
botaniko.txt
pok.txt         the words specific to my idiolect
...

If you can assist in compiling them please let me know.

To make a customized dictionary, merge the desired files and sort
them, e.g.

     sort -u -f -o eo.txt kerno.txt komputo.txt

Use buildhash to create the hashed dictionary:

     buildhash eo.txt eo.aff eo.hash

(A model command file `kunmetu' and the resulting eo.hash are included
in the package.)


TO USE:

You can use ispell in a stand-alone mode; type

    ispell -d eo FILENAME

or you may prefer to customize your emacs. I have the following in my
.emacs file:

;-------- ISPELL:
(setq ispell-dictionary-alist
      (append '(
               ("eo"
               "[A-Za-z\246\254\266\274\306\330\335\336\346\370\375\376]"
               "[^A-Za-z\246\254\266\274\306\330\335\336\346\370\375\376]"
               "[-']" t ("-C") "~latin3")
               ("eo-rtf"
				 "[A-Za-z^\\'68]"
				 "[^A-Za-z^\\'68]"
				 "[-'\\]" t ("-C" "-d" "eo") "~rtf")
               ("eo-tex"
                "[A-Za-z^\\]"
                "[^A-Za-z^\\]"
                "[-'`\"]" t ("-C" "-d" "eo") "~tex")
               ("eo-ux"
                "[A-Za-z\\]"
                "[^A-Za-z\\]"
                "[-'`\"]" t ("-C" "-d" "eo") "~cxirkaux")
               ("eo-vx"
                "[A-Za-z\\]"
                "[^A-Za-z\\]"
                "[-'`\"]" t ("-C" "-d" "eo") "~cxirkavx")
              )
              ispell-dictionary-alist)
)
(setq  ispell-menu-map-needed t
       ispell-menu-map nil)
(load "ispell")  ; reload ISPELL 


;(add-hook 'text-mode-hook   ;  to run ispell interactively in text-mode
;          (function 
;           (lambda () 
;             (ispell-minor-mode)
;             )
;           ))
;
;
(global-set-key "\C-ci3" '(lambda () "set Esperanto dictionary in Latin-3"
                            (interactive) 
                            (ispell-change-dictionary "eo")
                            ))
(global-set-key "\C-ciu" '(lambda () "set Esperanto dictionary (cxirkaux)"
                            (interactive) 
                            (ispell-change-dictionary "eo-ux")
                            ))
(global-set-key "\C-civ" '(lambda () "set Esperanto dictionary (cxirkaux)"
                            (interactive) 
                            (ispell-change-dictionary "eo-vx")
                            ))
(global-set-key "\C-ciw" '(lambda () "set Esperanto dictionary (RTF)"
                            (interactive)
                            (ispell-change-dictionary "eo-rtf")
                            ))
(global-set-key "\C-cia" '(lambda () "set english dictionary"
                            (interactive)
                            (ispell-change-dictionary "english")
                            ))
;for easy switching between English and Esperanto.

;And some other short-cuts:
(global-set-key "\M-#" 'ispell-word)      ; easier to type than \M-$
(global-set-key "\C-cim" 'ispell-message)
(global-set-key "\C-cib" 'ispell-buffer)
(global-set-key "\C-cir" 'ispell-region)
(global-set-key "\C-cic" 'ispell-change-dictionary)


Some more comments are in legumin.l3 (in esperanto).