%% LyX 1.1 created this file. For more info, see http://www.lyx.org/. %% Do not edit unless you really know what you are doing. \documentclass[11pt,oneside,english]{book} \usepackage[T1]{fontenc} \usepackage[latin1]{inputenc} \usepackage{geometry} \geometry{verbose,letterpaper,tmargin=1in,bmargin=1in,lmargin=1in,rmargin=1in} \usepackage{babel} \setcounter{secnumdepth}{3} \setlength\parskip{\medskipamount} \setlength\parindent{0pt} \usepackage{url} \makeatletter %%%%%%%%%%%%%%%%%%%%%%%%%%%%%% LyX specific LaTeX commands. \providecommand{\LyX}{L\kern-.1667em\lower.25em\hbox{Y}\kern-.125emX\@} \newenvironment{LyXParagraphIndent}[1]% { \begin{list}{}{% \setlength\topsep{0pt}% \addtolength{\leftmargin}{#1} \setlength\parsep{0pt plus 1pt}% } \item[] } {\end{list}} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Textclass specific LaTeX commands. \newenvironment{lyxcode} {\begin{list}{}{ \setlength{\rightmargin}{\leftmargin} \raggedright \setlength{\itemsep}{0pt} \setlength{\parsep}{0pt} \normalfont\ttfamily}% \item[]} {\end{list}} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%% User specified LaTeX commands. \usepackage[T1]{fontenc} \usepackage{xspace} \newcommand{\nach}{$\to$\xspace} \newcommand{\hoch}{\texttt{$^\wedge$}} \usepackage{html} \newcommand{\doubledash}{-\hspace{0.1em}-} \newcommand{\doubledashb}{-\/-} \newcommand{\dlt}{{\footnotesize$\ll$}} \newcommand{\dgt}{{\footnotesize$\gg$}} \begin{htmlonly} \renewenvironment{lyxcode} {\begin{list}{}{ \setlength{\rightmargin}{\leftmargin} \raggedright \setlength{\itemsep}{0pt} \setlength{\parsep}{0pt} \ttfamily}% \item[] \begin{ttfamily}} {\end{ttfamily} \end{list} } \newenvironment{LyXParagraphIndent}[1]% {\begin{quote}} {\end{quote}} \renewcommand{\LyX}{LyX} \renewcommand{\doubledash}{\rawhtml --\endrawhtml} \renewcommand{\doubledashb}{\rawhtml --\endrawhtml} \renewcommand{\dlt}{«} \renewcommand{\dgt}{»} \renewcommand{\nach}{\rawhtml to \endrawhtml} \renewcommand{\hoch}{\rawhtml ^\endrawhtml} \end{htmlonly} \makeatother \begin{document} \title{Aspell .33.7.1 alpha{\Large }\\ \textbf{\large A More Intelligent Ispell}} \author{Kevin Atkinson\\ kevina at users sourceforge net} \maketitle \tableofcontents{} \chapter{Introduction} Aspell is an Open Source spell checker designed to eventually replace Ispell. Its main feature is that it does a much better job of coming up with possible suggestions than Ispell does. In fact recent tests shows that it even does better than Microsoft Word 97's spell checker in some cases. In addition it has both compile time and run time support for other non-English languages. Aspell is also a library however the recommend way to use aspell is through the Pspell library as the actual interface to the aspell library is constantly changing. \section{The Future of Aspell\label{future}} Aspell .33.7 is most likely going to be the last official version of Aspell as in the near future Aspell is going to be merged into Pspell: \textbf{\large From:} {\large Kevin Atkinson}\\ \textbf{\large To:} {\large aspell-announce, pspell-announce}\\ \textbf{\large Date:} {\large 08/01/2001}\\ \textbf{\large Subject:} {\large Aspell and Pspell will be merged.}{\large \par} In the near future Aspell is going to be merged into Pspell. This will happen with the next major release of Pspell. If everything goes as planed, in the next major version of Pspell: \begin{itemize} \item Aspell will be included as part of Pspell as a module \item The Ispell module will also be included \item Most of the functionally of the Aspell command will be replaced with a new Pspell command which will work for any spell checker \item The English dictionary will no longer be included with Aspell. Instead it will be part of the Aspell-dicts package \item The manuals will be switched over from \LyX{}/\LaTeX{} to \LyX{}/DocBook so that in can easily be converted to other formats such as info and man. \item The current C++ interface may disappear and be replaced with a nicer C++ interface which will act as a wrapper for the C interface as I discussed earlier. If you are currently using the C++ interface and this will create a major problem for you please let me know. \end{itemize} This will require major reorganization of just about anything associated with Aspell and Pspell which will probably require packagers to redo there packaging scheme for Aspell and Pspell as some libraries and data files will disappear, other will be created, and the location of some of the data files will be moved. Also, as mentioned above Aspell, as you know it know, is going to disappear and be replaced with a generic utility. Some Aspell specific things, such as creating dictionaries, will remain in the Aspell utility however the executable will be renamed to aspell-util to avoid any confusion. Also, the Aspell sourceforge project might disappear, however I am not sure about this. At very least the Aspell mailing lists and bug tracker will disappear to avoid confusion on where question and bug reports should be posted. All of this \emph{should} happen around the end of August. \section{Comparison to other spell checker engines} \vspace{0.3cm} {\centering \begin{tabular}{|r|l|c|c|c|} \hline & Aspell& Ispell& Netscape 4.0& Microsoft Word 97\\ \hline Open Source& x& x& & \\ \hline Suggestion Intelligence& 88-98& 54& 55-70?& 71\\ \hline Personal part of Suggestions& x& x& x& \\ \hline Alternate Dictionaries& x& x& ?& ?\\ \hline International Support& x& x& ?& ?\\ \hline \end{tabular}\par} \vspace{0.3cm} The suggestion Intelligence is based on a small test kernel of misspelled/correct word pairs. Go to \url{http://aspell.sourceforge.net/test} for more info and how you can help contribute to the test kernel. The current scores for aspell are 88 in \textbf{fast} mode, 93 in \textbf{normal} mode, and 98 in \textbf{bad spellers} mode see section \ref{suggestion} for more information about the various suggestion modes. If you have any other information you would like to add to this chart please contact me at kevina at users sourceforge net. \subsection{Comparison to Ispell} \subsubsection{Features that only Aspell has} \begin{itemize} \item Does a much better job with coming up with suggestions than Ispell does or for that matter any other spell checker I have seen. If you know a spell checker that does a better job please let me know. \item Can learn from users misspellings. \item Is an actual library that others programs can link to instead of having to use it through a pipe. \item Is multiprocess intelligent. When a personal dictionary (or replacement list) is saved it will now first update the list against the dictionary on disk in case another process modified it. \item Can share the memory used in the main word list between processes. \item Support for detachable dictionaries \textbf{}so that more than one aspell class can use the same dictionary. \item Support for multiple personal dictionaries as well as support for special auxiliary dictionaries. \item Better support for run-together words. \item Ability to use multiple dictionaries by simply specifying it on the command line or in the configuration files. \end{itemize} \subsubsection{Things that, currently, only Ispell have} \begin{itemize} \item Lower memory footprint \item Support for affix compression \item Perhaps better support for spell checking (La)\TeX{} files. \item Support for spell checking Nroff files. \end{itemize} \chapter{Getting Started} \section{Requirements\label{reqs}} Aspell requires gcc 2.95 (or better) as the C++ compiler. Other C++ compilers should work with some effort. Other C++ compilers for mostly POSIX compliant (Unix, Linux, BeOS, CygWin) systems should work with out any major problems provided that the compile can handle all of the advanced C++ features Aspell uses. C++ compilers for non-Unix systems might work but it will take some work. Aspell at very least requires a Unix-like environment (sh, grep, sed, tr, etc...) in order to build. Aspell also uses a few POSIX functions when necessary. Nevertheless, Aspell will compile and run using the MinGW version of gcc provided that the CygWin environment is used to to build it. \textbf{Aspell also requires the Portable Spell Checker Interface Library}, otherwise known as Pspell, to be installed on your system, in the same location as Aspell will be installed in, before it will compile. Aspell requires version .12.2 or better. You can obtain the latest version of Pspell from \url{http://pspell.sourceforge.net/} \section{Obtaining} The latest version can always be found at Aspell's home page at \url{http://aspell.sourceforge.net}. \section{Support} Support for Aspell can be found on the Aspell mailing lists. Instructions for joining the various mailing lists (and an archive of them) can be found off the Aspell home page at \url{http://aspell.sourceforge.net}. Please use aspell-help for problems compiling and installing aspell, and aspell-user for general questions. \section{Helping Out} The easiest thing you can do to help out it is too send me your \textbf{.aspell.\dlt{}lang\dgt{}.prepl} file located in your home directory every so often. (Email kevina at users sourceforge net) The file contains data on which word pairs aspell is unable to come up with the proper suggestion. If the file does not exist is simply means that you were only using aspell in a program such as emacs which does not communicate the replacement pairs back to aspell. Sending me the file would help me improve aspell accuracy as right now I don't have very much real data to work with. If you are a good programmer and really want to help me consider doing one of the tasks listed in a recent post: \textbf{\large From}{\large : Kevin Atkinson}\\ \textbf{\large Date}{\large : 07/18/2001}\\ \textbf{\large Subject}{\large : Serious Help Needed for Aspell and Pspell}{\large \par} In the past I have asked for help, but in specific areas and not very forcefully. Well, now I \emph{really} would appreciate some help with developing Aspell and Pspell. I would like for Aspell to go to beta by the end of this summer but I really don't see that happening unless I get some help. Some of the areas I could use assistance in: \begin{itemize} \item Adding Affix compression support for Aspell \item Adding gettext support to Aspell and Pspell \item Making Aspell and Pspell thread safe \end{itemize} And lots of other smaller areas. I am looking for people who are competent programmers and have some experience with C++. I am also looking for people which have a good deal of shell programming, Perl, and autotools experience to help me clean up my build system. Some areas such as thread safety will simply not happen until I get some help because I really do not know enough about it. I hate to whine. However, I have put an extremely large about of unpaid time into Aspell and would appreciate some giving back from those who use Aspell in there distribution or have used the Pspell library as part of there applications, especially with those with the resources to do so such as Open Source companies. I release that my code, especially Aspell's, can be a bit scary with all the complex C++, however I will be glade to help any one with this part and for most of the tasks I need help with, one really don't even need to know C++ that well. Here is a breakdown of the tasks I would like help with and the skills required. \textbf{\textit{Task}}\textit{: Adding Affix compression support for Aspell} \textbf{Skills}: Competent programmer with some C++ knowledge. I know exactly what needs to be done in this area so it will require very little knowledge of how Aspell works. \textbf{\textit{Task}}\textit{: Adding gettext support to Aspell and Pspell} \textbf{Skills}: Competent programmer with C++ knowledge and gettext knowledge. I don't know what needs to be done in this area however they shouldn't have to study Aspell that intensely as most of my text strings are concentrated together. They may also need to rework some of my strings into format strings (ie convert cout < \char`\"{}Bla..\char`\"{} < name < \char`\"{}..bla..\char`\"{} to something like \char`\"{}Bla.. \$1 ..bla..\char`\"{}) so that it can be translated correctly. However I have an idea how this should be done. \textbf{\textit{Task}}\textit{: Making Aspell and Pspell thread safe.} \textbf{Skills}: Competent programmer with good C++ and thread safety knowledge. This is the area where the C++ skills will probably be needed the most. Even though Aspell itself is not multi-threaded I would like it to be thread safe so that it can be used by multi-threaded programs. I have several areas that are potently thread unsafe (such as accessing a global pool) and need some one to give me advice on how to best do this. I also have several classes which have the potential of being used by more than one thread (such as the personal dictionary) and needs to be made thread safe. \textbf{\textit{Task}}\textit{: Cleaning up the build system with Aspell and Pspell and similar tasks} \textbf{Skills}: Competent shell programmer (preferably someone with more knowledge than I have) and a good knowledge of how GNU autoconf, automake, and libtool work. \textbf{\textit{Task}}\textit{: Help with making the Aspell Dicts build system more robust} \textbf{Skills}: Competent shell programmer (preferably someone with more knowledge than I have) and a good enough knowledge of Perl to understand everything in in my \char`\"{}proc\char`\"{} script (\url{http://aspell.sourceforge.net/ aspell-gen-0.9.tar.bz2}). \textbf{\textit{Task}}\textit{: Miscellaneous other programming Tasks as I (or someone else) thinks of them} \textbf{Skills}: Competent programmer with good C++ knowledge. Once Aspell is complete (ie gets into a beta state) you are really going to appreciate it. It will be able to do everything ispell can do and a \emph{lot} more. However, I need some help in getting there. With out some serious help it is highly likely that Aspell will not be able to be complete (ie reach a beta state) till next summer as I won't be able to work on Aspell a great deal once School starts up again. Thanks in advance for anyone who can offer be some help. ---\\ Kevin Atkinson\\ kevina at users sourceforge net\\ \url{http://www.ibiblio.org/kevina/} \section{Compiling \& Installing} \subsection{Generic Install Instructions} Before Aspell is compiled Pspell must be installed. You can obtain the latest version of Pspell from \url{http://pspell.sourceforge.net/}. Both Pspell and Aspell must have the same prefix directory in order to function correctly. Once Pspell is installed and you have read the sections below to take care of any special requirements for you system simply type \begin{quote} ./configure \&\& make \end{quote} or \begin{quote} ./configure \doubledash{}disable-static \&\& make \end{quote} to avoid making the static libraries on a system that supports shared libraries. For additional configure options type ./configure \doubledash{}help. You can control what C++ compiler is used by setting the environmental variable CXX before running configure and you can control what flags are passed to the C++ compile via the environmental variable CXXFLAGS. Aspell should then compile with out any additional user intervention. If you run into problems please first check the sections below as that might solve your problem. If it doesn't please post a message to the aspell-help mailing list with the compiler, system you are using and any error messages that were produced. You can find more info on the aspell-help mailing list info page at \url{http://lists.sourceforge.net/lists/listinfo/aspell-help}. To install the program simply type \begin{quote} make install \end{quote} And that's all there is too it for a basic installation. If you do not have Ispell or the traditional Unix {}``spell'' utility installed on your system than you should also copy the compatibly scripts {}``ispell'' and {}``spell'' located in the scripts/ directory into your binary directory which is usually /usr/local/bin so that programs that expect ispell or spell command will work correctly. \subsection{General Problems} Aspell requires a specific version of Pspell. If the wrong version of Pspell is installed Aspell will not compile correctly. So before you try anything else make sure you are using the correct version of Pspell as stated in requirements (\ref{reqs}) section. Aspell does not use a released version of GNU Libtool. In previous versions of aspell this will often create problems if you inadvertently modify a file which causes Libtool to be called. However, as of Aspell .33.6.1 this should no longer be a problem and automake, autoconf, or libtool should not be called unless you specifically call them or if you configure Aspell with \doubledash{}enable-maintainer-mode. If you do notice any of these programs being called (and you did not configure with \doubledash{}enable-maintainer-mode) please let me know about it. If you have a need to modify configure.in or any of the Makefile.am's you should install the multi-language-branch of the CVS version of libtool. \subsection{Curses Notes} If you are having problems compiling termios.cc than the most likely reason is due to incompatibilities with the curses implementation on your system. If this is the case than you can explicitly disable the curses library with \doubledash{}disable-curses. By doing this you will lose the nice full screen interface but hopefully you will be able to at least get Aspell to compile correctly. If the curses library is installed in a non-standard location than you can specify the library and include directory with \doubledash{}enable-curses=\dlt{}lib\dgt{} and \doubledash{}enable-curses-include=\dlt{}dir\dgt{}. \textbf{Lib} can either be the complete path of the library (for example {}``/usr/local/curses/libcurses.a''), the name of the library (for example {}``ncurses'') or a combined location and library in the form {}``-L\dlt{}lib dir\dgt{} -l\dlt{}lib\dgt{}'' (for example {}``-L/usr/local/ncurses/lib -lncurses''). \textbf{Dir} is the location of the curses header files (for example {}``/usr/local/ncurses/include''). \subsection{Win32 Notes} Aspell is now able to compile on Win32 platforms using the Win32 version of gcc. Aspell .30.1 can either be compiled with the Cgiwin or the Mingw version of Gcc 2.95 using the Cgiwin development environment. The Mingw version of Aspell will have slightly less functionality, but none of which is noticeable to the end user. In order to get the nice full screen interface with Mingw when spell checking files a curses implementation that does not require Cygwin is required. The PDCurses (\url{http://www.lightlink.com/hessling/PDCurses/}) implementation is known to work, other implementations may work however they have not been tested. See the previous section for information on specifying the location of the curses library and include file. When compiling Pspell I recommend you configure with \doubledash{}disable-shared and \doubledash{}disable-ltdl. Shared libraries won't work correctly anyway on Win32 and trying to compile the ltdl library can lead to unnecessary complications. When compiling Aspell I recommend you configure with \doubledash{}disable-shared. If you are planning to use Aspell outside if the Cygwin environment I strongly recommend you install Aspell in its own location (ie prefix is not /usr/local/) and compile it with \doubledash{}enable-win32-relocatable. Please note that Pspell and Aspell must be installed in the same location. (ie don't install Pspell in c:/pspell and Aspell in c:/aspell. Instead install them both in c:/aspell) If Aspell is compiled with \doubledash{}enable-win32-relocatable and the \textbf{bindir} is set to the same value as \textbf{prefix} (ie not \dlt{}prefix\dgt{}/bin) then the Aspell directory (what prefix is set to) can be relocated anywhere provided that none of the data files are moved around within the Aspell directory. The default paths for Aspell are designed for a Unix system and not a Win32 system so you might want to specify different ones when compiling Aspell. Also if the HOME environmental variable is not set Aspell will assume it is the current working directly. This may lead to your personal word lists being saved in unpredictable locations. To solve this either compile with \doubledash{}enable-win32-relocatable (see above) or specify the complete path of the personal and replacement word lists in aspell.conf. If Aspell is compiled with \doubledash{}enable-win32-relocatable than the personal word lists are saved in the \textbf{prefix} directory and the name is changed from {}``\texttt{.aspell..{*}}'' to'' \texttt{.{*}}''. \subsection{Egcs 1.1 Notes} Aspell should now be able work with Egcs 1.1 but I have not been able to actually test it. If you have any luck one way or the other please let me know. \subsection{Upgrading from version .33} Even though .33.5 is a minor release it will unfortunately break binary compatibility with Aspell .33 due to the extensive changes needed to make Aspell better C++ compliant. This means applications such as Gaspell will need to be recompiled. \subsection{Upgrading from version .32.6} I have expanded the medium ({*}-med) word lists and decided to eliminate the large word lists ({*}-lrg) for now. However, the installing process will not automatically remove the large word lists so if you don't want them hanging around you should delete them your self. To remove all the files remove the following files from \dlt{}library dir\dgt{}/aspell: \begin{quote} {\raggedright american-lrg-only british-lrg-only canadian-lrg-only english-lrg-only american-lrg.multi british-lrg.multi canadian-lrg.multi \par} \end{quote} and the following files from \dlt{}share dir\dgt{}/pspell: \begin{quote} {\raggedright en-american-lrg-aspell.pwli en-canadian-lrg-aspell.pwli en-british-lrg-aspell.pwli\par} \end{quote} \subsection{Upgrading from version .32.1} Even though .32.5 is a minor release it breaks binary compatibly which means applications such as Gaspell will need to be recompiled. \subsection{Upgrading from version .31.1} The format and name of the main dictionary has changed yet again. The install process will over write the old version, so unless you are using dictionaries other than the one provided with aspell or want to have multiple versions of aspell installed you should not have to worry about this. The apostrophe (') is no longer considered part of the word if it appears at the end of a word. This means that you may have to manually remove words from your personal word list if you get a message similar to: \begin{quote} Invalid word \char`\"{}dogs'\char`\"{}: The character ''' may not appear at the end of a word. \end{quote} To remove the word simply delete the line containing the word form the personal word list ( normally called {}``.aspell.english.pws'' ). Aspell now uses a completely new word list. This means that same words that were in the original word list may no longer appear in the current one. You may now also chose from American, British, and Canadian spelling and from two sizes medium and large. See section \ref{dict-opts} for more information on choosing among the different choices. The original source that the word lists were created from is now found under the scowl/ directory. \subsection{Upgrading from version .30} The format of the main dictionary file has changed a bit. If you were able to use Aspell .30 then the old format should work. The only time the old format will NOT work is in the rare case the mmap fails. Previous version of aspell will just abort with an error when the mmap fails but the new version will attempt to read in the file using fread. Fread will fail with the old version of the main word list. \subsection{Upgrading from version .29.1} The format (but not the name) of the main dictionary has changed yet again. The install process will over write the old version, so unless you are using dictionaries other than the one provided with aspell or want to have multiple versions of aspell installed you should not have to worry about this. Aspell also now depends on Portable Spell Checker Interface Library otherwise known as Pspell. Pspell must be installed before aspell will compile, you can find it at \url{http://pspell.sourceforge.net/}. \subsection{Upgrading from version .29} The format and name of the main dictionary has changed which means it will need to be recompiled. The install process will remove the old files for you, so unless you are using dictionaries other than the one provided with aspell or want to have multiple versions of aspell installed you should not have to worry about this. \subsection{Upgrading from version .28.3} Aspell now uses namespaces which means egcs 1.0 and gcc 2.8 will no longer cut it. If this becomes a serious problem let me know as it should not be to difficult to get it working again with egcs 1.0 and gcc 2.8. Due to the new soundslike code the the main dictionary will need to be recompiled. The build process does this automatically so unless you want to have more than one version of aspell around you should not need to worry about this. The format and file name of the personal dictionaries has also changed. In most cases aspell will automatically detect this and convert it for you by using the following algorithm. \begin{enumerate} \item If no file exist of {*}.pws (for the personal word list) or {*}.prepl (for the personal replacement list) aspell will look for {*}.per or {*}.rpl respectfully. If that file is found it will read in the data using the old format. \item When saving the dictionary it will save is as {*}.pws or {*}.prepl respectfully. \item Once saved as the new format it will delete the old file. \end{enumerate} If you have an older version of aspell around you can restore the old dictionaries by using these command \begin{quote} aspell.new dump personal | aspell.old create personal aspell.new dump repl | aspell.old create repl \end{quote} The new version of aspell will then leave the old files alone as long as {*}.pws and {*}.prepl exist. Also, if the file does not end in .pws or .prepl it will try to read it in using the new format and if that fails it will read in the old format. When the file is saved it will be saved as the new format. I am hopping I will not have to change the format of the personal dictionaries again. However, the main word list however is very likely to change in format. \subsection{Upgrading from version .28.2.1} The behavior of {}``aspell check'' changed so that it will now over right the original file as creating new file was creating too many problems when used with programs like pine and vi. \subsection{Upgrading from version .27.2} The name of the personal word lists have changes from .aspell.per and .aspell.rpl to .aspell.\dlt{}lang\dgt{}.per and .aspell.\dlt{}lang\dgt{}.rpl respectively. \dlt{}lang\dgt{} is is the language name which will generally be {}``english''. If you wish to use your old word lists you will need to rename those files. \subsection{Upgrading from version .25} The format of the personal replacement dictionary has changed. So, you will either need to rename or remove the file \textbf{.aspell.rpl} located in your home directory. If you have information in this file you would like to preserve please send me an email. \subsection{Upgrading from version .24} Because the location of the main word list moved you should probably do a \textbf{make uninstall} (with the old version of a Aspell) before upgrading to remove the old word lists. A make uninstall will not remove any personal word lists. \chapter{Basic Usage} For a quick reference on the Aspell utility use the command {}``aspell \doubledash{}help''. \section{Spellchecking Individual Files\label{check}} To use Aspell to spellcheck a file type in \begin{quote} aspell check {[}\dlt{}options\dgt{}{]} \dlt{}filename\dgt{} \end{quote} at the command line where \dlt{}filename\dgt{} is the file you want to check and \dlt{}options\dgt{} is any number of optional options. Some of the more useful ones include: \begin{description} \item [\doubledash{}mode=\dlt{}mode>]the mode to use when checking files. The available modes are none, url, email, sgml, or tex. See section \ref{filter} for more informations on the various modes. \item [\doubledash{}dont-backup]don't create a backup file. \item [\doubledash{}sug-mode=\dlt{}mode\dgt{}]the suggestion mode to use where mode is one of ultra, fast, normal, or bad-spellers. See section \ref{suggestion} for more information on these modes. \item [\doubledash{}master=\dlt{}name\dgt{}]the main dictionary to use. The default aspell installation provided the following dictionaries: american, british, and canadian. \end{description} Please see Chapter \ref{customizing} for more information on the available options. For example to check the file foo.txt: \begin{quote} aspell check foo.txt \end{quote} and to check the file foo.txt using the bad-spellers suggestion mode and the large American English dictionary: \begin{quote} aspell check \doubledash{}sug-mode=bad-spellers \doubledash{}master=american-lrg foo.txt \end{quote} If the \textbf{mode} option is not given then aspell will use the extension of the file to determine the current mode. If the extension is .tex, then \TeX{} mode will be uses, if the extension is .html, .htm, .php, or .sgml it will check the file in sgml mode, otherwise it will use url mode. Please note that the \textbf{sgml-options} can be used to change what extension are used for the sgml mode. See chapter \ref{filter} for more information on the various modes that can be used. If Aspell was compiled with curses support and the TERM environmental variable is set to a capable terminal type than Aspell will use a nice full screen interface. Otherwise it will use a simpler {}``dumb'' terminal interface where the misspelled word is surrounded by two '{*}'. In either case the interface should be self explanatory. \section{Using Aspell with other Applications} \subsection{With Applications that Expect Ispell} Aspell can be used as a drop in replacement for Ispell for programs that use Ispell through a pipe such as Emacs and \LyX{}. It can also be used with programs that use simple call the ispell command and expect the original file to be overwritten with the corrected version. It support the basic features of Ispell, however it does not currently have a Nroff mode so there may be situations in which you still wish to use Ispell. Nevertheless, I have been using Aspell for Xemacs and \LyX{} since the middle of September of 1998 with out any problems. If you do not have Ispell installed on your system and have installed the Ispell compatibly script than you should not need to do anything as most applications will that expect Ispell will work as expected with Aspell via the Ispell compatibility script. Otherwise, the recommended way to use Aspell as a replacement for ispell is to change the Ispell command from within the program being used. If the program uses ispell in pipe mode simple change ispell to aspell. If the program calls the \textbf{}ispell command to check the file change {}``ispell'' \textbf{}with {}``aspell check''. If that is impossible and the program uses ispell through a pipe than the run-with-aspell script can be used for programs using ispell in pipe mode. The format of the script is: \begin{quote} run-with-aspell \dlt{}command\dgt{} \end{quote} where \dlt{}command\dgt{} is the name of the program with any optional arguments. The old method of mapping Ispell to Aspell is discouraged because it can create compatibility problems with programs that actually require Ispell such as Ispell's own scripts. \subsection{With Emacs and Xemacs} The easiest way to use Aspell with Emacs or Xemacs is to add this line: \begin{lyxcode} (setq-default~ispell-program-name~\char`\"{}aspell\char`\"{}) \end{lyxcode} to the end of your .emacs file. For some reason version 3.0 of ispell.el (the lisp program that (x)emacs uses) want to reverse the suggestion list. To fix this add this line: \begin{lyxcode} (setq-default~ispell-extra-args~'(\char`\"{}\doubledashb{}reverse\char`\"{})) \end{lyxcode} after the previous line in your .emacs file and it should solve the problem. Ispell.el, version 3.1 (December 1, 1998) and better, has the list reversing problem fixed. You can find it at \url{http://www.kdstevens.com/~stevens/ispell-page.html}. \subsection{With \LyX{}} Version 1.0 of \LyX{} provides support for Aspell learning for users mistake feature. To use aspell with \LyX{} 1.0 either change the \textbf{spell\_command} option in the lyxrc file or use the run-with-aspell utility. \subsection{With VIM} \textit{(The following section was written by \char`\"{}R. Marc\char`\"{}, rmarc at copacetic net.) } To use aspell in vim you simply need to add the following line to your .vimrc file: \begin{lyxcode} map~\^{}T~:w!:!aspell~check~\%:e!~\% \end{lyxcode} I use since that's the way you spell check in pico. In order to add a control character to your .vimrc you must type first. In this case . A more useful way to use Aspell, IMHO, is in combination with newsbody (\url{http://www.image.dk/~byrial/newsbody/}) which is how I use it since vim is my editor for my mailer and my news reader. \begin{lyxcode} map~\^{}T~\textbackslash{}1\textbackslash{}2:e!~\% map~\textbackslash{}1~:w! map~\textbackslash{}2~:!newsbody~-qs~-n~\%~-p~aspell~check~\textbackslash{}\%f \end{lyxcode} \subsection{With Pine} To use aspell in pine simply change the option \textbf{speller} to \begin{quote} aspell \doubledash{}mode=email check \end{quote} To change the speller option go to the main menu. Type \textbf{S} for \emph{setup}, \textbf{C} for \emph{config}, then W for \emph{where is}. Type in \textbf{speller} as the word to find. The speller option should be highlighted now. Hit enter, type in the above line, and hit enter again. Then type \textbf{E} for \emph{exit setup} and \textbf{Y} to save the change. If you have a strong desire to check other peoples comments change \textbf{speller} to \begin{quote} aspell check \end{quote} instead which will avoid switching aspell into email mode. \chapter{Managing Word Lists\label{manage}} \section{Creating an Individual Word List} To create an individual main word list from a list of words use the command \begin{quote} aspell \doubledash{}lang=\dlt{}lang\dgt{} create master ./\dlt{}base\dgt{} < \dlt{}wordlist\dgt{} \end{quote} where \dlt{}base\dgt{} is the name of the word list and \dlt{}word list\dgt{} is the list of words separated by white space. The name of the word list will automatically be converted to all lowercase. The {}``./'' is important because without it aspell will create the word list in the normal word list directory. If you are trying to create a word list in a language other than english check the aspell data-dir (usually /usr/share/aspell, use {}``aspell dump config'' to find out what it is on your system) to see if a language data file exists for your language. If not you will need to create one. See chapter \ref{inter} for more information on using Aspell with other languages. This will create the file \dlt{}base\dgt{} in the current directory. To use the new word list copy the file to the normal word list directory (use {}``aspell config'' to find out what it is) and use the option \doubledash{}master=\dlt{}base\dgt{}. The compiled dictionary file is machine dependent. It is dependent on endian order, and the page size for the machine because they are mmaped in. Please do not distribute the compiled dictionaries unless you are only distributing them for a particular platform such as you would a binary. That is why is normally installed in {}``lib/aspell'' instead of {}``share/aspell''. Aspell is now also able to use special {}``multi'' dictionaries. See section \ref{dict-opts} form more information. A personal and replacement word list can be created in a similar fashion. Because Aspell does not support any sort of affix compression like Ispell does Ispell word lists will not work as is. In order to use Ispell's word lists simply pipe the word list through ``ispell -e'' to expand the munched word lists. \subsection{Format of the Replacement Word List} The replacement word has each replacement pair on its own line in the following format \begin{lyxcode} \dlt{}misspelled~word\dgt{}:~\dlt{}correction\dgt{} \end{lyxcode} \section{The PWLI file} In order for Aspell to be able to correctly recognize a dictionary based on the setting of the LANG environmental variable and for Pspell to be able to find your word lists each main word list installed should have at least one PWLI file associated with it in the Pspell data directory. This is normally /usr/local/share/pspell/. You can use {}``pspell-config pkgdatadir'' to find out what it is on your system. Each PWLI has the the following name: \begin{quote} \dlt{}language\dgt{}{[}-{[}\dlt{}spelling\dgt{}{]}{[}-\dlt{}jargon\dgt{}{]}{]}-\dlt{}module\dgt{}.pwli \end{quote} Where \dlt{}language\dgt{} is the two letter language code, \dlt{}spelling\dgt{} is the particular spelling your interested in if the languages has multiple spelling in different parts of the world such as English, \dlt{}jargon\dgt{} is any extra informations to distinguish the word list from other ones with the same language and spelling, and \dlt{}module\dgt{} is the pspell module the main word list is for. For example: \begin{quote} en-aspell.pwli\\ en-american-aspell.pwli\\ en-american-medical-ispell.pwli\\ en-american-xlg-ispell.pwli\\ de\doubledash{}medical-ispell.pwli \end{quote} Notice how if the spelling is left out but the jargon is not there needs to be two dashes between the language and the jargon. Each PWLI file for an Aspell word list should then contain exactly one line which contains the full path of the main word list. \section{Dumping the contents of the word list} The dump command will simply dump the contents of a word list to stdout in a format than can be read back in with \textbf{aspell create}. If no word list is specified the command will act on the default one. For example the command \begin{quote} aspell dump personal \end{quote} will simply dump the contents of the current personal word list to stdout. \chapter{Customizing Aspell\label{customizing}} The behavior of Aspell can be changed by any number of options which can be specified at either the command line, the environmental variable ASPELL\_CONF, a personal configuration file, or a global configuration file. Options specified on the command line override options specified by the environmental variable. Options specified by the environmental variable override options specified by either of the configurations files. Finally options specified by the personal configuration file override options specified in the global configuration file. Options specified in the environmental variable ASPELL\_CONF, a personal configuration file, or a global configuration file will take effect no matter how Aspell is used which includes being used by other applications. Aspell has three basic type of options: \textbf{boolean}, \textbf{value}, and \textbf{list}. \textbf{Boolean} options are either enabled or disabled, \textbf{value} options take a specific value, and \textbf{list} options can either have entries added or removed from the list. \section{Specifying Options} \subsection{At the Command Line} All options specified at the command line have the following basic format: \begin{quote} \doubledash{}\dlt{}option\dgt{}{[}=\dlt{}value\dgt{}{]} \end{quote} where the '=' can be replaced by whitespace. However some options also have single letter abbreviations of the form: \begin{quote} -\dlt{}letter\dgt{}{[}\dlt{}optional whitespace\dgt{}\dlt{}value\dgt{}{]} \end{quote} \subsubsection{Boolean} To enable a boolean option simply special the option with out any corresponding value. For example to ignore accents when checking words use {}``\doubledash{}ignore-accents''. To disable a boolean option prefix the option name with a {}``dont-''. For example to not ignore accents when checking words use {}``\doubledash{}dont-ignore-accents''. If a boolean option has a single letter abbreviation simply give the letter corresponding to either enabling or disabling the option with out any corresponding value. For example to consider run-together words legal use {}``-C'' or to consider them illegal use {}``-B'' \subsubsection{Value} To specify a value option simply specify the option with its corresponding value. For example to set the filter mode to Tex use {}``\doubledash{}mode=tex''. If a value option has a single letter shortcut simply specify the single letter short cut with its corresponding value. For example to use a large american dictionary use {}``-d american-lrg''. \subsubsection{List} To add a value to the list prefix the option name with a {}``add-'' and then specify the value to add. For example to add the URL filter use {}``\doubledash{}add-filter url''. To remove a value from a list option prefix the option name with a {}``rem-'' and then specify the value to remove. For example to remove the URL filter use {}``\doubledash{}rem-filter url''. To remove all items from a list prefix the option name with a {}``rem-all'' without specify any value. For example to remove all filters use {}``\doubledash{}rem-all-filter''. \subsection{Via a Configuration File} Aspell can also accept options via a personal or global configuration file. The exact files to used are specified by the options \textbf{per-conf} and \textbf{conf} respectfully but the personal configuration file is normally {}``.aspell.conf'' located in the HOME directory and the global one is normally {}``aspell.conf'' which is located in the etc directory which is normally {}``/usr/etc'' or {}``/usr/local/etc''. To find out the particular values for your particular system use {}``aspell dump config''. Each line of the configuration file has the format: \begin{lyxcode} \dlt{}option\dgt{}~{[}\dlt{}value\dgt{}{]} \end{lyxcode} There may any number of spaces between the option and the value however it can only be spaces, ie there is no '=' between the option name and the value. Comments may also be included by preceding them with a '\#' as anything from a {}``\#'' to a newline is ignored. Blank lines are also allowed. Values set in the personal configuration file override those in the global file. Options specified at either the command line or via an environmental variable override those specified by either configuration file. \subsubsection{Boolean} To specify a boolean option simply include the option followed by a {}``true'' to enable it or a {}``false'' to disable it. For example to allow run-together words use {}``run-together true''. \subsubsection{Value} To specify a value option simply include the option followed by the corresponding option. For example to set the default language to german use {}``lang german''. \subsubsection{List} To add a value to the list prefix the option name with a {}``add-'' and then specify the value to add. For example to add the URL filter use {}``add-filter url''. To remove a value from a list option prefix the option name with a {}``rem-'' and then specify the value to remove. For example to remove the URL filter use {}``rem-filter url''. To remove all items from a list prefix the option name with a {}``rem-all'' without specify any value. For example to remove all filters use {}``rem-all-filter''. \subsection{Via an Environmental Variable} The environmental variable ASPELL\_CONF may also be used and it overrides any options set in the configuration file. The format of the string is exactly the same as the configuration file except that semicolons ( ; ) are used instead of newlines. \section{The Options} The following is a list of available options broken down by category. Each entry has the following format: \begin{quote}\begin{description} \item [\dlt{}option\dgt{}[,\dlt{}single letter abbreviations\dgt{}]] \textit{(\dlt{}type\dgt{})} \dlt{}description\dgt{} \end{description}\end{quote} Where single letter options are specified as they would appear at the command line, ie with the preceding dash. Boolean single letter options are specified in the following format: \begin{quote} -\dlt{}abbreviation to enable\dgt{}|-\dlt{}abbreviation to disable\dgt{} \end{quote} \dlt{}Option\dgt{} is one of the following: \textbf{boolean}, \textbf{string}, \textbf{file}, \textbf{dir}, \textbf{integer}, or \textbf{list}. \textbf{String}, \textbf{file}, \textbf{dir}, and \textbf{integer} types are all value options which can only take a specific type of value. \subsection{Basic Options} \begin{description} \item [conf]\textit{(file)} main configuration file \item [conf-dir]\textit{(dir)} location of main configuration file \item [data-dir]\textit{(dir}) location of language data files \item [local-data-dir]\textit{(dir)} alternative location of language data files. This directory is searched before \textbf{data-dir}. It defaults to the same directory the actual main word list is in (which is not necessarily dict-dir). \item [filter]\textit{(list)} add or removes a filter \item [home-dir](\textit{dir}) location for personal files \item [ignore,-W](\textit{integer}) ignore words <= n chars \item [ignore-case]\textit{(boolean)} ignore case when checking words \item [ignore-accents]\textit{(boolean)}ignore accents when checking words \item [ignore-repl]\textit{(boolean)} ignore commands to store replacement pairs \item [save-repl]\textit{(boolean)} save the replacement word list on save all \item [lang]\textit{(string)} default language to use when creating a dictionary or all else fails \item [language-tag]\textit{(string)} language code to use when selecting a dictionary, it follows the same format of the LANG environmental variable on most systems. In fact it defaults to the value of the LANG environmental variable if it is set. \item [mode]\textit{(string)} sets the filter mode. Mode is one if none, url, email, sgml, or tex. (The short cut options '-e' may be used for email, '-H' for Html/Sgml, or '-t' for Tex) \item [per-conf]\textit{(file)} personal configuration file \item [personal,-p]\textit{(file)} personal word list file name \item [prefix]\textit{(dir)} prefix directory \item [set-prefix]\textit{(boolean)} set the prefix based on executable location (only works on Win32 and when compiled with \doubledash{}enable-win32-relocatable) \item [repl]\textit{(file)} replacements list file name \item [keyboard]\textit{(file)} the base name of the keyboard definition file to use (see section \ref{typo}) \item [sug-mode]\textit{(mode)} suggestion mode = ultra | fast | normal | bad-spellers (see section \ref{suggestion}) \end{description} \subsection{Dictionary Options} The following options may be used to control which dictionaries to use and how they behave (see section \ref{dict-opts} for more information): \begin{description} \item [master,-d]\textit{(string)} base name of the main dictionary to use. The default Aspell installation provided the following dictionaries: american, british, and canadian. \item [dict-dir]\textit{(dir)} location of the main word list \item [extra-dicts]\textit{(list)} extra dictionaries to use \item [strip-accents]\textit{(boolean)} strip accents from all words in the dictionary \end{description} \subsection{Run-together Word Options} These may be used to control the behavior of run-together words (see section \ref{run-together} for more information): \begin{description} \item [run-together,-C|-B]\textit{(boolean)} consider run-together words legal \item [run-together-limit]\textit{(integer)} maximum numbers that can be strung together \item [run-together-min]\textit{(integer)} minimal length of interior words \end{description} \subsection{Filter Options} These options modify the behavior of the various filters (see section \ref{filter} for more information): \begin{description} \item [add|rem-email-quote]\textit{(list)} email quote characters \item [email-margin]\textit{(integer)} num chars that can appear before the quote char \item [sgml-check]\textit{(list)} sgml tags to always check. \item [sgml-extension]\textit{(list)} sgml file extensions. \item [tex-command]\textit{(list)} \TeX{} commands \item [tex-check-comments]\textit{(boolean)} check \TeX{} comments \end{description} \subsection{Aspell Utility Options} These options are may only be specified at the command line as there are aspell utility specific: \begin{description} \item [backup,-b|-x]\textit{(boolean)} create a backup file by appending {}``.bak'' to the file name. (Only applies when the command is \textbf{check}) \item [time]\textit{(boolean)} time load time and suggest time in pipe mode. \item [reverse]\textit{(boolean)} reverse the order of the suggestions list. \end{description} \section{Dumping Configuration Values} To find out the current value of all the options use the command {}``aspell dump config''. This will dump the current configuration to standard output. The format of the contents dumped is such that it can be used as either the global or personal configuration file. \section{Notes on various Options} \subsection{Pertaining to which word lists to use \label{dict-opts}} \subsubsection{How Aspell Selects an Appropriate Dictionary} Aspell will go through the following steps to find an appropriate dictionary: \begin{enumerate} \item If the \textbf{master} options is set in any fashion (via the command line, the ASPELL\_CONF environmental variable, or a configuration file) look for a dictionary of that name. If one could not be found complain. \item If the \textbf{language-tag} (\emph{not} \textbf{lang}) option or LANG environmental variable is set and master option is not then use it (giving preference to \textbf{language-tag} over LANG) to search for an appropriate dictionary. Aspell will use the same strategy that Pspell does, which is based on the installed .pwli files, to find an appropriate word list. For more information of how this is done see the Pspell manual. \item If 2 fails than look for a dictionary of the same name of current setting of the \textbf{lang} options. This will currently work even if the language name is invalid, but this fact should not be relied upon as it is an implementation detail. \item Finally, if all else fails, complain. \end{enumerate} \subsubsection{About Multi Dictionaries} As with precious versions of aspell you can specify the main dictionary to use via the -d or \doubledash{}master option. However as of Aspell .32 you can now also: \begin{enumerate} \item Specify more than word list to use with the \textbf{extra-dicts} option. \item Optionally have all accents striped form the word lists using \textbf{strip-accents} option. This is \emph{not} the same thing as the \textbf{ignore-accents} option. As enabling the \textbf{ignore-accents} would accept both cafe and café (notice the accent on the e), but only enabling strip-accents would only accent cafe, even if café is in the original dictionary. Specify \textbf{strip-accents} is just like using a word list with out the accents. \item Specify special {}``multi'' dictionaries. \end{enumerate} A {}``multi'' dictionary is a special file which basically a list of dictionary files to use. A multi dictionary must end is \textbf{.multi} and has roughly the same format of a configuration file where the two valid keys are \textbf{add} and \textbf{strip-accents}. The \textbf{add} key is used for adding individual word lists, or other {}``multi'' files. The \textbf{strip-accents} key is used to control if accents are striped from the dictionaries. Unlike the global strip-accent option this option only effects word lists that came after the option. For example: \begin{quote} strip-accents yes\\ add english\\ strip-accents no\\ add must-accent \end{quote} will strip accents from the english word list but not the must-accent word list. If the global strip-accents option is specified the local strip-accents options are ignored. \subsubsection{Provided Word Lists} Aspell now provides multi dictionaries for three variates of english: \textbf{american-med}, \textbf{british-med}, and \textbf{canadian-med}. The word lists themselves all contain accented words however the \textbf{strip-accents} option is enabled by default for all the individual word lists. If you wish to use the accented words you can set the global \textbf{strip-accents} option to false or create a new multi word list. Great care has been taken so that that only one spelling for any particular word is included in the main list. When two variants were considered equal I randomly picked one for inclusion in the main word list. Unfortunately this means that my choice in how to spell a word may not match your choice. If this is the case you can try to include one of the special variant dictionaries with the \textbf{add-extra-dict} option. You can chose from english-variant-0, english-variant-1, and english-variant-2. Each of these word lists included all the others from the previous variant level, thus there is no need to include more than one. English-variant-0 includes most variants which are considered almost equal, english-variant-1 include variants which are also generally considered acceptable, and english-variant-2 contains variants which are seldom used. These special variant dictionaries are an experimental feature so please let me know if you take advantage of them. If no one seams to be using them I may no longer provide them in a future release of Aspell. Many other dictionary sizes and varieties can be created. See the scowl/ directory in the source distribution for information on the different varieties you can create and section \ref{manage} for how to create an individual dictionary. \subsection{Notes on Various Filters and Filter Modes\label{filter}} Aspell now has rudimentary filter support. You can either select from individual filters or chose a filter mode. To select a filter mode use the \textbf{mode} option. You may chose from \textbf{none}, \textbf{url}, \textbf{email}, \textbf{sgml}, and \textbf{tex}. The default mode is \textbf{url}. Individual filters can be added with the option \textbf{add-filter} and remove with the \textbf{rem-filter} option. The currently available filters are \textbf{url}, \textbf{email}, \textbf{sgml}, \textbf{tex} as well as a bunch of filters which translate the text from one format to another. \subsubsection{None Mode} This mode is exactly what it says. It turns off all filters. \subsubsection{Url Filter/Mode} The \textbf{url} filter/mode skips over URL's, host names, and email addresses. Because this filter is almost always useful and rarely does any harm it is enabled in all modes except \textbf{none}. To turn it off either select the \textbf{none} mode or use \textbf{rem-filter} option \emph{after} the desired mode is selected. \subsubsection{Email Filter/Mode} The \textbf{email} filter/mode skips over quoted text. It currently does not support skipping over headers however a future version should. In the mean time I suggest you use Aspell with Newsbody which can be found at \url{http://home.worldonline.dk/~byrial/newsbody/}. The option \textbf{email-skip} controls the number of characters that can appear before the email quote char, the default is 10. The option \textbf{add|rem-email-quote} controls the characters that are considered quote characters, the default is {}``>' and '|'. \subsubsection{SGML Filter/Mode\label{sgml}} The \textbf{sgml} filter/mode will skip over sgml commands. It currently does not handle nested < > unless they are in quotes. It also does it handle the null end tag (net) minimization feature of sgml such as \begin{quote} psycolagest \end{quote} then store the replacement pair \begin{quote} psycolagest -> psychologist \end{quote} The replacement pair \begin{quote} sicolagest -> psychologist \end{quote} will also get stored so that you don't have to worry about it. \chapter{Adding Support For Other Languages\label{inter}} Before you consider adding support for Aspell for your language first check if it is already done via the Aspell-Dicts project. As of the release of support for the following languages is now available: \begin{quotation} \begin{tabular}{lll} Breton& {[}br{]}& \\ Catalan& {[}ca{]}& \\ Czech& {[}cs{]}& \\ Danish& {[}da{]}& \\ Dutch& {[}nl{]}& \\ Esperanto& {[}eo{]}& \\ Faroese& {[}fo{]}& \\ French& {[}fr{]}& (Standard {[}fr\_FR{]} and Swiss French {[}fr\_CH{]} and in three sizes: small, medium and large)\\ German& {[}de{]}& (Standard {[}de\_DE{]} and Swiss German {[}de\_CH{]})\\ Italian& {[}it{]}& \\ Norwegian& {[}no{]}& \\ Polish& {[}pl{]}& \\ Portuguese& {[}pt{]}& (Standard {[}pt\_PT{]} and Brazilian {[}pt\_BR{]})\\ Russian& {[}ru{]}& \\ Spanish& {[}es{]}& \\ Swedish& {[}sv{]}& \\ \end{tabular} \end{quotation} You can find all of these packages at the aspell home page (\url{http://aspell.sourceforge.net/}). If your language is not listed above please send me a note and I will work with you on adding support. \emph{The instructions below still apply however for this version of Aspell however they may not once the Aspell is merged into Pspell (see section \ref{future}).} Adding a language to aspell is fairly straightforward. You need to at very least create the language data file, and compile a new word list. You should also create a PWLI file for each of your word lists so that your new word lists will work correctly with the LANG environmental variable and with Pspell. Please note, however, that Aspell international support is not 100\% done yet. More information on my future planes for international support in aspell can be found at \url{http://aspell.sourceforge.net/international/}. \section{The Language Data File} The basic format of the language data data is the same as it for aspell configuration file. It is named \dlt{}lang\dgt{}.dat and is located in the architecture independent data dir for aspell (option \textbf{data-dir}) which is usually \dlt{}prefix\dgt{}/share/aspell. Use {}``aspell config'' to find out where it is in your installation. The language data file has several mandatory fields, and several optional ones. All fields are case sensitive and should be in all lower case. The two mandatory fields are \textbf{name} and \textbf{charset}. \textbf{Name} is the name of the language and should be the same as the file name (without the .dat). \textbf{Charset} is the charset aspell will expect the word lists to be formatted in. You may chose from any of the iso-8859-{*} character sets as well as, koi8-f, koi8-r, and viscii. If your language can fit in the plain old ASCII character set use iso8859-1. If you use some other character set for your language other than the ones listed here drop me a note and I will look into adding support for it. The optional fields are \textbf{special, soundslike}, \textbf{keyboard} and a bunch of options to specify how run-together words are handles. \textbf{Special} is for non letter characters that can appear in your language such as the \textbf{'} and \textbf{-}. The format for the value is a list separated by spaces. Each item of the list has the following format \begin{quote} \dlt{}char\dgt{}~~\dlt{}begin\dgt{}\dlt{}middle\dgt{}\dlt{}end\dgt{} \end{quote} \dlt{}char\dgt{} is the non letter character in question. \dlt{}begin\dgt{},\dlt{}middle\dgt{},\dlt{}end\dgt{} are either a '-' or a '{*}'. A star for \dlt{}begin\dgt{} means that the character at the beginning of the word, a '-' means it can't. The same is true for \dlt{}middle\dgt{} and \dlt{}end\dgt{}. For example the entry for the \textbf{'} in english is: \begin{lyxcode} '~-{*}- \end{lyxcode} To include more than one middle character just list them one after another on the same line. for example to make both the ' and the - a middle character use the following line in the language data file: \begin{lyxcode} special~'~-{*}-~-~-{*}- \end{lyxcode} The \textbf{soundslike} option, \textbf{}if present, should be the name of the soundslike data for the language. The data is expected to be in the file \dlt{}name\dgt{}\_phonetic.dat. If the name is \textbf{generic} a really generic soundslike algorithm will be used which consists of striping all the vowels and removing all accents. I recommend first using the generic algorithm and then, after aspell is working with the new language, work on the transformation array. If the soundslike name is \textbf{none} then no soundslike lookup table will be used. This will reduce the size of the compiled word list by around 50\% but at the sacrifice of suggestion quality. If the soundslike is none than the soundslike for the word will simply be the word itself in lowercase, will all accents stripped. For languages with phonetic spelling the difference will not be very noticeable. However, for languages with non-phonetic spelling there will be a noticeable difference. The difference you notice will depend on the quality of the soundslike data file. If you do not notice much of a difference for a language with non-phonetic spelling that is a good indication that the soundslike data is not rough enough---or the words you are trying are not that badly misspelled. The keyboard option specifies the base name of the keyboard definition file to use. See section \ref{typo} for more information. The options to control how run-together words are handled are the same as the are in the normal configurations files. Please see section \ref{run-together} for more information. \section{Compiling the Word List} Once you have a working language data file installed in the right place you are ready to compile the main word list. See section \ref{manage} to find out what to do. This section also includes instructions for creating the PWLI file. \section{Phonetic Code} \textit{(The following section was written by Björn Jacke, bjoern.jacke at gmx de)} Aspell is in fact the spell checker that comes up with the best suggestions if it finds an unknown word. One reason is that it does not just compare the word with other words in the dictionary (like Ispell does) but also uses phonetic comparisons with other words. The new table driven phonetic code is very flexible and setting up phonetic transformation rules for other languages is not difficult but there can be a number of stumbling stones --- that's why I wrote this section. The main phonetic code is free of any language specific code and should be powerful enough to allow setting up rules for any language. Anything which is language specific is kept in a plain text file and can easily be edited. So it's even possible to write phonetic transformation rules if you don't have any programming skills. All you need to know is how words of the language are written and how they are pronounced. \subsection{Syntax of the transformation array} In the translation array there are two strings on each line; the first one is the search string (or switch name) and the second one is the replacement string (or switch parameter). The line \begin{lyxcode} version~~~\dlt{}version\dgt{} \end{lyxcode} is also required to appear somewhere in the translation array. The version string can be anything but it should be changed when ever the a new version of the translation array is released. This is important because it will keep Aspell from using a compiled dictionary with the wrong set of rules. For example if when coming up with suggestion for {}``hallo'' aspell will use the new rules to come up with the soundslike say {}``H{*}L{*}'' but if hello is stored in the dictionary using the old rules as {}``HL'' instead of {}``H{*}L{*}'' aspell will never be able to come up with hello. So to solve this problem Aspell checks if the version strings match and abort with an error if they don't. Thus it is important to update it when ever a new version of the translation array is releases. This is only a problem with the main word list as the personal word lists are now stored as simple word lists with a single header line (ie, no soundslike data). Each non switch line represents one replacement (transformation) rule. Words beginning with the same letter must be grouped together; the order inside this group does not depend on alphabetical issues but it gives priorities; the higher the rule the higher the priority. That's why the first rule that matches is applied. In the following example: \begin{lyxcode} GH~~~~~\_~~\\ G~~~~~~K~~\\ ~ \end{lyxcode} {}``GH \nach \_'' has higher priority than {}``G \nach K''. {}``\_'' represents the empty string {}``''. If {}``GH \nach \_'' would stand after {}``G \nach K'', the second rule would never match because the algorithm would stop searching for more rules after the first match. The above rules transform any {}``GH'' to an empty string (delete them) and transform any other {}``G'' to {}``K''. At the end of the first string of a line (the search string) there may optionally stand a number of characters in brackets. One (only one!) of these characters must fit. It's comparable with the {[} {]} brackets in regular expressions. The rule {}``DG(EIY) \nach J'' for example would match any {}``DGE'', {}``DGI'' and {}``DGY'' and replace them with {}``J''. This way you can reduce several rules to one. Behind the search string there can stand one or more dashes (-). Those search strings will be matched totally but only the beginning of the string will be replaced. Furthermore for these rules no follow-up rule will be searched (what this is will be explained later). The rule {}``TCH\doubledash{} \nach \_'' will match any word containing {}``TCH'' (like {}``match'') but will only replace the first character {}``T'' with an empty string. The number of dashes determines how many characters from the end will not be replaced. After the replacement the search for transformation rules continues with the not replaced {}``CH''! If a {}``<'' is appended to the search string, the search for replacement rules will continue with the replacement string and not with the next character of the word. The rule {}``PH< \nach F'' for example would replace {}``PH'' with {}``F'' and then again start to search for a replacement rule for {}``F\ldots{}''. If there would also be rules like {}``FO \nach O'' and {}``F \nach \_'' then words like {}``PHOXYZ'' would be transformed to {}``OXYZ'' and any occurrences of {}``PH'' that are not followed by an {}``O'' will be deleted like {}``PHIXYZ \nach IXYZ''. The second replacement however is not applied if the priority of this rule is lower than the priority of the first rule. Priorities are added to a rule by putting a number between 0 and 9 at the end of the search string, for example {}``ING6 \nach N''. The higher the number the higher is the priority. Priorities are especially important for the previously mentioned follow-up rules. Follow-up rules are searched beginning from the last string of the first search string. This is a bit complicated but I hope this example will make it more clear: \begin{lyxcode} CHS~~~~~~X~\\ CH~~~~~~~G~\\ ~\\ HAU\doubledashb{}1~~~H~\\ ~\\ SCH~~~~~~SH~\\ ~ \end{lyxcode} In this example {}``CHS' in the word {}``FUCHS'' would be transformed to {}``X''. If we take the word {}``DURCHSCHNITT'' the things look a bit different. Here {}``CH'' belongs together and {}``SCH'' belongs together and both are spoken separately. The algorithm however first finds the string {}``CHS'' which may not be transformed like in the previous word {}``FUCHS''. At this point the algorithm can find a follow up rule. It takes the last character of the first matching rule ({}``CHS'') which is {}``S'' and looks for the next match, beginning from this character. What it finds is clear: It finds {}``SCH \nach SH'', which has the same priority (no priority means standard priority, which is 5). If the priority is the same or higher the follow-up rule will be applied. Let's take a look at the word {}``SCHAUKEL''. In this word {}``SCH'' belongs together and may not be torn apart. After the algorithm has found \char`\"{}SCH \nach SH\char`\"{} it searches for a follow-up rule for \char`\"{}H\char`\"{}+\char`\"{}AUKEL\char`\"{}. It finds \char`\"{}HAU\doubledash{}1 \nach H\char`\"{}, but does not apply it because its priority is lower than the one of the first rule. You see that this is a very powerful feature but it also can easily lead to mistakes. If you really don't need this feature you can turn it off by putting the line \begin{lyxcode} followup~~~~~~0~ \end{lyxcode} at the beginning of the phonetic table file. As mentioned, for rules containing a `-' no follow-up rules are searched but giving such rules a priority is not totally senseless because they self can be follow-up rules and in that case the priority makes sense again. Follow-up rules of follow-up rules are not searched because this is in fact not needed very often. The control character \char`\"{}\hoch\char`\"{} says that the search string only matches at the beginning of words so that the rule \char`\"{}RH\hoch \nach R\char`\"{} will only apply to words like \char`\"{}RHESUS\char`\"{} but not \char`\"{}PERHAPS\char`\"{}. You can append another \char`\"{}\hoch\char`\"{} to the search string. In that case the algorithm treats the rest of the word totally separately from first matched string in at beginning. This is useful for prefixes whose pronunciation does not depend on the rest of the word and vice versa like \char`\"{}OVER\hoch\hoch\char`\"{} in English for example. The same way as \char`\"{}\hoch\char`\"{} works does \char`\"{}\$\char`\"{} only apply on words that end with the search string. \char`\"{}GN\$ \nach N\char`\"{} only matches on words like \char`\"{}SIGN\char`\"{} but not \char`\"{}SIGNUM\char`\"{}. If you use \char`\"{}\hoch\char`\"{} and \char`\"{}\$\char`\"{} together, both of them must fit \char`\"{}ENOUGH\hoch\$ \nach NF\char`\"{} will only match the word \char`\"{}ENOUGH\char`\"{} and nothing else. Of course you can combine all of the mentioned control characters but they must occur in this order: \texttt{< - priority \^{} \$}. All characters must be written in CAPITAL letters. If absolutely no rule can be found --- might happen if you use strange characters for which you don't have any replacement rule --- the next character will simply be skipped and the search for replacement rules will continue with the rest of the word. If you want double letters to be reduced to one you must set up a rule like \char`\"{}LL- \nach L\char`\"{}. If double letters in the resulting phonetic word should be allowed, you must place the line \begin{lyxcode} collapse\_result~~~~~0~ \end{lyxcode} at the beginning of your transformation table file; otherwise set the value to `1'. The English rules for example strip all vowels from words and so the word \char`\"{}GOGO\char`\"{} would be transformed to \char`\"{}K\char`\"{} and not to \char`\"{}KK\char`\"{} (as desired) if \texttt{collapse\_result} is set to 1. That's why the English rules have \texttt{collapse\_result} set to \texttt{0}. \subsection{How do I start finally?} Before you start to write an array of transformation rules, you should be aware that you have to do some work to make sure that things you do will result in correct transformation rules. \subsubsection{Things that come in handy} First of all you need to have a large word list of the language you want to make phonetics for. It should contain about as many words as the dictionary of the spell checker. If you don't have such a list, you will probably find an Ispell dictionary at \url{http://fmg-www.cs.ucla.edu/geoff/ispell-dictionaries.html} which will help you. You can then make affix expansion via \texttt{ispell -e} and then pipe it trough \texttt{\textbackslash{}tr \char`\"{} \char`\"{} \char`\"{}\textbackslash{}n\char`\"{}} to put one word on each line. After that you eventually have to convert special characters like `é' from Ispell's internal representation to latin1 encoding. \texttt{sed s/e'/é/g} for example would replace all e' with é. The second is that you know how to use regular expressions and know how to use \texttt{grep}. You should for example know that \begin{lyxcode} grep~\^{}{[}\^{}aeiou{]}qu{[}io{]}~wordlist~|~less \end{lyxcode} will show you all words that begin with any character but a, e, i, o or u and then continue with `qui' or `quo'. This stuff is important for example to find out if a phonetic replacement rule you want to set up is valid for all words which match the expression you want to replace. Taking a look at the regex(7) man page is a good idea. \subsubsection{What the phonetic code should do} Normal text comparison works well as long as the typer misspells a word because he pressed one key he didn't really want to press. In this cases mostly one character differs from the original word. In cases where the writer didn't know about the correct spelling of the word however the word may have several characters that differ from the original word but usually the word would still sound like the original word. Someone might think for example that `tough' is spelled `taff'. No spell checker without phonetic code will come to the idea that this might be `tough' but a spell checker who knows that `taff' would be pronounced like `tough' will make good suggestions to the user. Another example could be `funetik' and `phonetic'. From this examples you can see that the phonetic transformation should not be too fussy and too precise. If you implement a whole phonetic dictionary as you can find it in books this will not be very useful because then there could still be many characters differing from the misspelled and the desired word. What you should do if you implement the phonetic transformation table is to reduce the number of used letters to the only really necessary ones. Characters that sound similar should be reduced to one. In English language for example `Z' sounds like `S' and that's why the transformation rule {}``Z \nach S'' is present in the replacement table. `PH' is spoken like `F' and so we have a {}``PH \nach F'' rule. If you take a closer look you will even see that vowels sound very similar in English language: `contradiction', `cuntradiction', `cantradiction' or `centradiction' in fact sound nearly the same, don't they? Therefore the English phonetic replacement rules not only reduce all vowels to one but even remove them all (removing is done by just setting up no rule for those letters). The phonetic code of `contradiction' is `KNTRTKXN' and if you try to read this letter-monster loud you will hear that it still sound a bit like `contradiction'. You also see that `D' is transformed to `T' because they nearly sound the same. If you think you have found a regularity you should \emph{always} take your word list and grep for the corresponding regular expression you want to make a transformation rule for. An example: If you come to the idea that all English words ending on `ough' sound like `AF' at the end because you think of `enough' and `tough'. If you then grep for the corresponding regular expression by {}``grep -i ough\$ wordlist'' you will see that the rule you wanted to set up is not correct because the rule doesn't fit to words like `although' or `bough'. So you have to define your rule more precisely or you have to set up exceptions if the number of words that differ from the desired rule is not so big. Don't forget about follow-up rules which can help in many cases but which also can lead to many confusions and side effects. It's also important to write exceptions in front of the more general rules (\char`\"{}GH\char`\"{} before \char`\"{}G\char`\"{} etc.). If you think you have set up a number of rules that may produce some good results try them out! If you run Aspell as \texttt{}{}``aspell \doubledash{}lang=\dlt{}your language\dgt{} pipe'' you get a prompt at which you can type in words. If you just type words Aspell checks them and eventually makes suggestions if they are misspelled. If you type in {}``\$\$Sw \dlt{}word\dgt{}'' you will see the phonetic transformation and you can test out if your work does what you want. Another good way to control if changes you apply to your rules don't have any evil side effects is to create another list from your word list which contains not only the word of the word list but also the corresponding phonetic version of this word on the same line. If you do this one time before the change and one time after the change you can make a diff (see \texttt{man diff}) to see what \emph{really} changed. To do this use the command {}``aspell \doubledash{}lang=\dlt{}your language\dgt{} soundslike''. In this mode aspell will output the the original word and then its soundslike separated by a tab character for each word you give it. If you are interested in seeing how the algorithm works you can download a set of useful programs from \url{http://members.xoom.com/maccy/spell/phonet-utils.tar.gz}. This includes a program that produces a list as mentioned above and another program which illustrates how the algorithm works. It uses the same transformation table as Aspell and so it helps a lot during the process of creating a phonetic transformation table for Aspell. During your work you should write down your basic ideas so that other people are able to understand what you did (and you still know about it after a few weeks). The English table has a huge documentation appended for example. Now you can start experimenting with all the things you just read and perhaps set up a nice phonetic transformation table for your language to help Aspell to come up with the best correction suggestions ever seen also for your language. Take a look at the Aspell homepage to see if there is already a transformation table for your language. If there is one you might also take a look at it to see if it could be improved. If you think that this section helped you or if you think that this is just a waste of time you can send any feedback to \emph{}\textit{\emph{bjoern.jacke@gmx.de}}\emph{.} \section{Controlling the Behavior of Run-together Words\label{run-together}} Aspell has support for either unconditionally accepting run-together words or only accepting certain words in compound formation. Support for unconditionally accepting run-together words can either be turned on in the language data file or as a normal option via the \textbf{run-together} option. The \textbf{run-together-limit} options controls the maximum number of words that can be strung together, the default is normally 255. The \textbf{run-together-min} options controls the minimal length the individual components of the run together word can be, the default is normally 3. Both the run-together-limit and run-together-min option may be specified in both the language data file or as a normal. The \textbf{run-together-mid} option, which may only be specified in the language data file, may be used to specify up to three optional characters that may appear between individual words. In order for aspell to conditionally only accept certain words in compounds those words must be flagged when the compiled word list is being created. The format for each entry is \begin{quote} \dlt{}word\dgt{}:C{[}1{]}{[}2{]}{[}3{]}\dlt{}middle char\dgt{} \end{quote} The 1, 2, and 3 control if the word is allowed to appear in the begging, middle, or end of the compound, respectfully. More than one position flag may be specified. If none of them are specified it as assumed that the word may appear anywhere. The C is optional if 1, 2, or 3 is specified. The \dlt{}middle char\dgt{} represents an optional character that may appear after the word in the formation of the compound if the word is not at the end of the compound. If the letter is lowercase than the character may appear after the word, if it is in uppercase then that letter must appear after the compound. Only one letter may be specified and it must also be in the list of middle letters specified via the \textbf{run-together-mid} option. \textbf{}The \textbf{run-together-limit} option may also be used to specify the maximum number of words to string together. For example the word list: \begin{quote} beg:1\\ mid:2 \\ end:3 \\ any:C \\ never \\ must:CM \\ maybe:Cm \end{quote} Means that the word {}``beg'' may only appear at the begging of a word, the word {}``mid'' at the middle, the word {}``end'' at the end, and the word {}``any'' any place. The word {}``never'' is never accepted in a compound unless the \textbf{run-together} option is set. The word {}``must'' may appear anywhere however it must be followed by an {}``m'', while the word maybe may be followed by an {}``m''. Given the above word list the following compounds or legal: \begin{quote} begmidend\\ begany\\ mustmend \\ maybeend \\ maybemend \end{quote} are all legal, but the following are not: \begin{quote} begmid\\ mustend\\ neverany \end{quote} Individual words such as {}``beg'' are always accepted. When the \textbf{run-together} option is not set Aspell will only accept words that have been flagged in a run-together word. When the \textbf{run-together} option is set aspell will accept words which are as least as long as the value specified in the \textbf{run-together-min} option. If the words length is less than \textbf{run-together-min} then it will only accept the word if it has been flagged. When the \textbf{run-together} option is not set the \textbf{run-together-min} option is ignored all together. Currently Aspell only supports run-together words when checking if a word is in the dictionary. When coming up with suggestions Aspell treats the word as a normal word and does not do anything special. This means that the suggestions will be virtually meaningless when the actual word is a run-together. I plan on more intelligently supporting run-together words when coming up with suggestions in a future version of Aspell. \chapter{How Aspell Works\label{works}} The magic behind my spell checker comes from merging Lawrence Philips excellent metaphone algorithm and Ispell's near miss strategy which is inserting a space or hyphen, interchanging two adjacent letters, changing one letter, deleting a letter, or adding a letter. The process goes something like this. \begin{enumerate} \item Convert the misspelled word to its soundslike equivalent (its metaphone for English words). \item Find all words that have a soundslike within one or two edit distances from the original words soundslike. The edit distance is the total number of deletions, insertions, exchanges, or adjacent swaps needed to make one string equivalent to the other. When set to only look for soundslikes within one edit distance it tries all possible soundslike combinations and check if each one is in the dictionary. When set to find all soundslike within two edit distance it scans through the entire dictionary and quickly scores each soundslike. The scoring is quick because it will give up if the two soundslikes are more than two edit distances apart. \item Find misspelled words that have a correctly spelled replacement by the same criteria of step number 2 and 3. That is the misspelled word in the word pair (such as teh -> the) would appear in the suggestions list as if it was a correct spelling. \item Score the result list and return the words with the lowest score. The score is roughly the weighed average of the weighed edit distance of the word to the misspelled word and the soundslike equivalent of the two words. The weighted edit distance is like the edit distance except that the various edits have weights attached to them. \item Replace the misspelled words that have correctly spelled replacements with their replacements and remove any duplicates that might arise because of this. \end{enumerate} Please note that the soundslike equivalent is a rough approximation of how the words sounds. It is not the phoneme of the word by any means. For more details about exactly how each step is performed please see the file \textbf{suggest.cc}. For more information on the metaphone algorithm please see the data file \textbf{english\_phonet.dat}. \appendix \chapter{Changelog} \section*{Changes from .33.7 to .33.7.1 (Aug 20, 2001)} \begin{itemize} \item Minor manual fixes. \item Compile fix for Gcc 3.0 and Solaris. \end{itemize} \section*{Changes from .33.6.3 to .33.7 (Aug 2, 2001)} \begin{itemize} \item Updates to Autoconf 2.50 and switched to the HEAD branch of libtools. \item Fixed a bug which caused Aspell to crash when typo-analysis is not used such as when sug-mode is \textbf{fast} or \textbf{bad spellers}. \item Added support for typo-analysis even when a soundslike is not used. \item Fixed a bug which causes extended charters to display incorrectly on some platforms \item Compile fixes so that it will compile with Gcc 3.0. \item Compile fixed which should allow Aspell to compile with Egcs 1.1. I have not been able to actually test it though. Please let me know at kevina@users.sourceforge.net if you have have tried with Egcs 1.1. \item Compile and configuration script fixes so that USE\_FILE\_INO will properly be defined and Aspell will compile correctly when it is defined. \item More ANSI C++ compliance fixes. \end{itemize} \section*{Changes from .33.6.2 to .33.6.3 (June 3, 2001)} \begin{itemize} \item Fixed a build problem in the manual/ directory by including manual-text and manual-html in the distribution. \end{itemize} \section*{Changes from .33.6.1 to .33.6.2 (June 3, 2001)} \begin{itemize} \item Compile fix so that Aspell will work correctly when not installed in /usr/local. \item Avoided regenerating the manual unless configured with enable-maintainer-mode. \item Added the missing documentation files in the scowl directory. \end{itemize} \section*{Changes from .33.6 to .33.6.1 (May 29, 2001)} \begin{itemize} \item Fixed a formating problem with the manual involving \dlt{}. \item Added a note about creating PWLI files. \item Removed the space after between the -L and the directory name in the pspell-module/ Makefile which caused problems on some platforms. \item Added the configure option AM\_MAINTAINER\_MODE to avoid enabling rules which often causes generated build files to be rebuild with the wrong version of Libtool by default. I don't know why I didn't think to do this a long time ago. \end{itemize} \section*{Changes from .33.5 to .33.6 (May 18, 2001)} \begin{itemize} \item Fixed a minor bug where some words would have random compound tags attached to them. \item Fixed a compile problem on many platforms where fileno is defined as a macro. \item Updated the description for a few of Aspell's options. \item Removed the note of Aspell not being able to run when compiled with the upcoming Gcc 3.0 compiler as things seam to work now. \item Added a note about Aspell not being able to compile with Egcs 1.1. \item Added hack to deal with Libtool's interdependencies problem. See bug \#416981 for Pspell for more info. \end{itemize} \section*{Changes from .33 to .33.5 (April 5, 2001)} \begin{itemize} \item {}``dump master'' correctly detects which dictionary and language to use based on the LANG environmental variable. \item Fixed a problem on Win32 which involves path names that begin with \dlt{}Drive Letter\dgt{}:. \item Bug fixes and enhancements so that Aspell can once again run under MinGW. You can even use the new full screen interface if Aspell is compiled with PDCurses. \item Some major modifications to make Aspell more C++ compliant in order to get Aspell to compile under the upcoming Gcc 3.0 compiler. This included only using STL features found in the standard version of C++. (Which means Aspell will no longer require using the SGI version of the STL) This should also make compiling C++ under non-gcc compilers a lot simpler. Please not that Aspell still has some problems with the upcoming Gcc 3.0 compiler (see section \ref{gcc3.0} for more info). \item Minor changes to remove some -Wall warnings. \item Added a hack to that Aspell will properly compile as a shared library under Solaris. \item Added a few import missing words to the English word list. \end{itemize} \section*{Changes from .32.6 to .33 (January 28, 2001)} \begin{itemize} \item Added a new new curses based interface to replace the dumb terminal interface everyone has been bitching about. \item Added the ability to give higher priority to words such as \char`\"{}the\char`\"{} instead of \char`\"{}teh\char`\"{} which are likely to be due to typos. \item Reorganized the manual so that it is hopefully easier to follow. \item Ability to automatically select the best dictionary to used based on the setting of the LANG environmental variable. \item Expanded the medium dictionary size to include more words which included the original words found in ispell and eliminated the large size for now. \item Added three special variant add-on dictionaries. \item Switched to the multi-language branch of the CVS version of libtool. \end{itemize} \section*{Changes from .32.5 to .32.6 (Nov 8, 2000)} \begin{itemize} \item Fixed a bug that where Aspell will crash when reading in accented characters on some platforms. This fixes bug \# 112435. \item Fixed some other bugs so that it will run under Win32 under CygWin. Unfortunately it still won't run properly under Mingw. \item Fixed the mmap test in configure so that it won't fail on some platforms that use munmap(char {*}, int) instead of munmap(void {*}, int). \item Upgraded to the latest CVS version of libtool which fixed the problem with using GNU Make under Solaris. \item Added an option to copy files instead of using symbolic links for the special {}``multi'' dictionary files. \end{itemize} \section*{Changes from .32.1 to .32.5 (August 18, 2000)} \begin{itemize} \item Changed my email from kevinatk at home com to kevina at users sourceforge net please make a note of the new email address. \item Added an option to control if the personal replacement dictionary is saved when the save\_all\_wls method is called. \item Brought back the ability to dump the master word list even in the case of the special {}``multi'' lists. \item Added a large number of hacker related words and some other slang terms to the medium size word list. \item Added an {}``ispell'' and {}``spell'' compatibility script for systems which don't have ispell installed. They are located in the scripts/ directory and are not installed by default. \item Manual fixes. \item Added a note on not using GNU Make on Solaris. \end{itemize} \section*{Changes from .32 to .32.1 (August 5, 2000)} \begin{itemize} \item Minor compile fixes for recent gcc snapshot. \item Fixed naming of pwli files. \item Fixed a bug when aspell will crash when used with certain single letter flags. This bug was most noticeable when used with Emacs. \item Word list changes, see SCOWL Readme. \item Other miscellaneous changes. \end{itemize} \section*{Changes from .31.1 to .32 (July 23, 2000)} \begin{itemize} \item Added support for optionally doing without the soundslike data. \item Greatly reduced the amount of memory used when creating word lists. \item Added support for ignoring accents when coming up with suggestions. \item Added support for local-data-dir which is searched before data-dir. \item Added support for specifying which words may be used in compounds and where they may be used. \item Added support for having more than one main word list as well as a special {}``multi'' word list files which will allow multiple word lists to be treated as one. \item Aspell now uses a completely new word list. \item The apostrophe (') is no longer considered part of the word when it as at the end of the word such as in {}`` dogs' ''. \end{itemize} \section*{Changes from .31 to .31.1 (June 18, 2000)} \begin{itemize} \item Fixed a bug where Aspell would not create a complete dictionary file on some platforms when the data is 8-bit. \item Added a workaround so Aspell will work with ispell.el 3.3. \item Minor compile fixes so it would compile better with the very latest gcc (CVS Version). \item Removed note about compiling in Win32 as I was now able to get it to work. \end{itemize} \section*{Changes from .30.1 to .31 (June 11, 2000)} \begin{itemize} \item Added support for spell checking run together words. \item Added an option to produce a list of misspelled words from standard input. \item More robust error reporting when reading in language data files. \item Fixed a bug when that will cause Aspell to crash if the {}``special'' line was not defined in the language data file. \item Update Pspell Module. \item Minor bug fixes. \item Added cross references in {}``The Aspell utility Chapter'' for easier use. \end{itemize} \section*{Changes from .30 to .30.1 (April 29, 2000)} \begin{itemize} \item Ported Aspell to Win32 platforms. \item Portability fixes which may help aspell compile on other platforms. \item Aspell will no longer fail if for some reason the mmap fails, instead it will just read the file in as normal and free the memory when done. \item Minor changes in the format of the main word list as a result of the changes, the old format should still work in most cases. \item Fixed a bug when aspell was ignoring the extension of file names such as .html or .tex when checking files. \item Fixed a bug when aspell will go into an infinite loop when creating the main word list from a word list which has duplicates in it. \item Minor changes to the manual for better clarity. \end{itemize} \section*{Changes from .29.1 to .30 (April 2, 2000)} \begin{itemize} \item Fixed many of the capitalization bugs found in previous versions of Aspell. \item Changed the format of the main word list yet again. \item Fixed a bug so that {}``aspell check'' will work on the PowerPC. \item Added ability to change configuration options in the middle of a session. \item Added words from /usr/dict/words found on most Linux systems as well as a bunch of commonly used abbreviation to the word list. \item Fixed a bug when aspell will dump core after reporting certain errors when compiled with gcc 2.95 or higher. This involved reworked the Exception heritage to get around a bug in gcc 2.95. \item Added a few more commands to the list of default commands the \TeX{} filter knows about. \item Aspell will now check if a word only contains valid characters before adding it to any dictionaries. This might mean that you have to manually delete a few words from your personal word list. \item Added option to ignore case when checking a document. \item Adjusted the parameters of the {}``normal'' suggest mode to so that significantly less far fetched results are returned in cases such as tomatoe, which went from 100 suggestions down to 32, at the expense of getting slightly lower results (less than 1\%), \item Improved the edit distance algorithm for slightly faster results. \item Removed the \$\$m command in pipe mode, you should now use {}``\$\$cs mode,\dlt{}mode\dgt{}'' to set the mode and {}``\$\$cr mode'' to find out the current mode. \item Reworked parts of Aspell to use Pspell services to avoid duplicating code. \item Added a module for the newly released Pspell. It will get installed with the rest of aspell. \item Miscellaneous other bug fixes. \end{itemize} \section*{Changes from .29 to .29.1 (Feb 18, 2000)} \begin{itemize} \item Improved the \TeX{} filter so that it will accept '@' at the begging of a command name and ignore trailing '{*}'s. It also now has better defaults for which parameters to skip. \item Reworked the main dictionary so that it can be memory mapped in. This decreases startup time and allows multiple aspell processes to use the same memory for the main word list. This also also made Aspell 64 bit clean so that it should work on an alpha now. \item Fix so that aspell can compile on platforms that gnu as is not available for. \item Fixed issue with flock so it would compile on FreeBSD. \item Minor changes in the code to make it more C++ compliant although I am sure there will still be problems when using some other compiler other than gcc or egcs. \item Added some comments to the header files to better document a few of the classes. \end{itemize} \section*{Changes from .28.3 to .29 (Feb 6, 2000)} \begin{itemize} \item Fixed a bug in the pipe mode with lines that start with {}``\^{}\$\$''. \item Added support for ignoring all words less than or equal to a specified length \item New soundslike code based thanks to the contribution of Björn Jacke. It now gets all of its data from a table making it easier for other people to add soundslike code for their native language. He also converted the metaphone algorithm to table form, eliminating the need for the old metaphone code. \item Major redesign of the suggestion code for better results. \item Changed the format of the personal word lists. In most cases it should be converted automatically. \item Changed the format of the main word list. \item Name space cleanup for more consistent naming. I now use name spaces which means that gcc 2.8.{*} and egcs 1.0.{*} will no longer cut it. \item Used file locks when reading and saving the personal dictionaries so that it truly multiprocess safe. \item Added rudimentary filter support. \item Reworked the configuration system once again. However, the changes to the end user who does not directly use my library should be minimal. \item Rewrote my code that handles parsing command line parameters so that it no longer uses popt as it was causing to many problems and didn't integrate well with my new configuration system. \item Fixed pipe mode so that it will properly ignore lines starting with '\~{}' for better ispell compatibility. \item Aspell now has a new home page at \url{http://aspell.sourceforge.net/}. Please make note of the new URL. \item Miscellaneous manual fixes and clarifications. \end{itemize} \section*{Changes from .28.2.1 to .28.3 (Nov 20, 1999)} \begin{itemize} \item Fixed a bug that caused aspell to crash when spell checking words over 60 characters long. \item Reworked {}``aspell check'' so that \begin{enumerate} \item You no longer have to hit enter when making a choice. \item It will now overwrite the original file instead of creating a new file. An optional backup can be made by using the -b option. \end{enumerate} \item Fixed a few bugs in data.cc. \end{itemize} \section*{Changes from .28.2 to .28.2.1 (Aug 25, 1999)} \begin{itemize} \item Fixed the version number for the shared library. \item Fixed a problem with undefined references when linking to the shared library. \end{itemize} \section*{Changes from .28.1 to .28.2 (Aug 25, 1999)} \begin{itemize} \item Fixed a bunch of bugs in the language and configuration classes. \item Minor changed in the code so that it could compile with the new gcc 2.95(.1). \item Changed the output of {}``dump config'' so that default values are given the value \char`\"{}\char`\"{}. This means that the output can be used to created a configuration file. \item Added notes on using aspell with VIM. \end{itemize} \section*{Changes from .28 to .28.1 (July 27, 1999)} \begin{itemize} \item Removed some debug output \item Changed notes on compiling with gcc 2.8.{*} as I managed to get it to compile on my school account \item Avoided included \textbf{stdexcept} in \textbf{const\_string.hh} so that I could get to compile on my schools account with gcc 2.8.1. \end{itemize} \section*{Changes from .27.2 to .28 (July 25, 1999)} \begin{itemize} \item Provided an iterator for the replacement classes. \item Added support for dumping and creating and merging the personal and replacement word lists. \item Changed the aspell utility command line a bit, it now used popt. \item Totally reworked aspell configuration system. Now aspell can get configuration from any of 5 sources: the command line, the environmental variable ASPELL\_CONF, the personal configuration file, the global configuration file, and finally the compiled in defaults. \item Totally reworked the language class in preparation for my new language code. See \url{http://aspell.sourceforge.net/international/} for more information of what I have in store. \item Added some options to the configure script: \doubledash{}enable-dict-dir=DIR, \doubledash{}enable-doc-dir=DIR, \doubledash{}enable-debug, and \doubledash{}enable-opt \item Removed some old header files. \item Reorganized the directory structure a bit \item Made the text version of the manual pages slightly easier to read \item Used the \textbackslash{}url command for urls for better formating of the printed version. \end{itemize} \section*{Changes from .27.1 to .27.2 (Mar 1, 1999)} \begin{itemize} \item Fixed a major bug that caused aspell to dump core when used without any arguments \item Fixed another major bug that caused aspell to do nothing when used in interactive mode. \item Added an option to exit in Aspell's interactive mode. \item Removed some old documentation files from the distribution. \item Minor changes on to the section on using Aspell with egcs. \item Minor changes to remove -Wall warnings. \end{itemize} \section*{Changes from .27 to .27.1 (Feb 24, 1999)} \begin{itemize} \item Fixed a minor compile problem. \item Updated the section on using Aspell with egcs to it it more clear why the patch is necessary. \end{itemize} \section*{Changes from .26.2 to .27 (Feb 22, 1999)} \begin{itemize} \item \textbf{Totally reworked the C++ library which means you may need to change some things in your code.} \item Added support for detachable and multiple personal dictionaries in the C++ class library. \item The C++ class library now throws exceptions. \item Reworked aspell ability to learn from users misspellings a bit so that it now has a memory. See section \ref{replpair} for more information. \item Upgraded autoconf to version 2.13 and automake to version 1.4 for better portability. \item Fixed the configuration so the {}``make dist'' will work. For now on aspell will be distributed with {}``make dist''. \item Added support to skip over URL's, email addresses and host names. \item Added support for dumping the master and personal word list. You can now also merge a personal word list. Type aspell -help for help on using this feature. \item Reorganized the source code. \item Started using proper version numbers for the shared library. \item Fixed a bug that caused a spell to crash when adding certain replacement pairs. \item Fixed the problem with duplicate lines when exiting pipe mode for good. \end{itemize} \section*{Changed from .26.1 to .26.2 (Jan 3, 1998)} \begin{itemize} \item Fixed another compile problem. Hopefully this time it will really compile OK on other peoples machines. \end{itemize} \section*{Changed from .26 to .26.1 (Jan 3, 1998)} \begin{itemize} \item Fixed a small compile problem in \textbf{as\_data.cc}. \end{itemize} \section*{Changed from .25.1 to .26 (Jan 3, 1999)} \begin{itemize} \item Fixed a bug that causes duplicates items to be displayed in the suggestion list for good. (If it still does it please send be email.) \item Added the ability for aspell to learn form the users misspellings. \item Library Interface changes. Still more to come.... \item Is now multiprocess safe. When a personal dictionary (or replacement list) is saved it will now first update he list against the dictionary on disk in case another process modified it. \item Fixed the bug that caused duplicate output when used non interactively in pipe mode. \item Dropped support for gcc 2.7.2 as the C++ compiler. \item Updated the How Aspell Works (\ref{works}) Chapter. \item Added support for the ASPELL\_DATA\_DIR environmental variable. \end{itemize} \section*{Changes from .25 to .25.1 (Dec 10, 1998)} \begin{itemize} \item Fixed the version number so that Aspell reports the correct version number. \item Changed the note on gcc 2.7.2 compilers to make it clear that only the C++ compiler can not be gcc 2.7.2, it is ok if the C compiler is gcc 2.7.2. \item Updated the TODO list and reorganized it a bit. \item Fixed the directory so that all the documentation will get installed in \$\{prefix\}/doc/aspell instead of half of it in \$\{prefix\}/doc/aspell and half of it in \$\{prefix\}/doc/kspell. \end{itemize} \section*{Changes from .24 to .25 (Nov 23, 1998)} \begin{itemize} \item Total rework of how the main word list is stored. Start up time decreased to about 1/3 of what it was in .24 and memory usage decreased to about 2/3. (When used with the provided word list on a Linux system). \textbf{}Also the \textbf{format and default locations of the main word list data files changed} in the process and the data \textbf{is now machine dependent}. The personal word list format, however, stayed the same. \item Changed the scoring method to produce slightly better results with words like the vs. teh. And other simpler misspellings where two letters are swapped. \item Fixed the very unpredictable behavior of the '{*}', '\&', '@' commands in the pipe mode. \item Added documentations for Aspell pipe mode (also known as ispell -a compatibility mode) \item Added a bunch of Aspell specific extensions to the pipe mode and documented them. \item Documented the \textbf{to\_soundslike} and \textbf{soundslike} methods for the \textbf{aspell} class. \item Changed the scoring method to produce better results for words like {}``fone'' vs {}``phone'' and other words that have a spelling that doesn't directly relate to how the word sounds by using the phoneme equivalent of the word in the scoring of it. \item Added the \textbf{to\_phoneme} and \textbf{have\_phoneme} methods to the \textbf{SC\_Language} class. \item Added the \textbf{to\_phoneme} method to the \textbf{aspell} class. \item Added the framework for being able to learn from the users misspelling. Right now it just keep a log of replacements. \item Re did \textbf{stl\_rope-30.diff}. \textbf{}For some reason the version of patch on my system refused it. \item Rewrite of the {}``Using as a replacement for Ispell'' section and added the \textbf{run-with-aspell} utility as a replacement of the old method of mapping Ispell to Aspell. \item Fixed a bug that caused duplicate words to appear in the suggestion list. \end{itemize} \section*{Changes from .23 to .24 (Nov 8, 1998)} \begin{itemize} \item Fixed my code so that it can once again compile with g++ 2.7.2. \item Rewrote the How It Works chapter. \item Rewrote the Requirement section and added noted on compiling with g++ 2.7.2. \item Added a To Do chapter. \item Added a Glossary and References chapter. \item Other minor documentation improvements. \item Internal code documentation improvements. \end{itemize} \section*{Changes from .22.1 to .23 (Oct 31, 1998)} \begin{itemize} \item Minor documentation fixes. \item Changed the scoring strategy for words with 3 or less letters. This cut the number of words returned for these roughly in half. \item Expanded the word list to also include \textbf{american.0} and \textbf{american.1} from the Ispell distribution. It now includes \textbf{english.0}, \textbf{english.1}, \textbf{american.0} and \textbf{american.1} from the directory \textbf{languages/english} provided with Ispell 3.1.20. \item Added a link to the location of the latest Ispell.el in the documentation. \item Started a C interface and added some rough documentation for it. \end{itemize} \section*{Changes from .22 to .22.1 (Oct 27, 1998)} \begin{itemize} \item Minor bug fixes. I was deleting arrays with delete rather than delete{[}{]}. I was suppressed that this had not created a problem. \item Added a simple test program to test for a memory leak present on some systems. (Only debian slink at the moment.) See the file memleak-test.cc for more info. \end{itemize} \section*{Changes from .21 to .22 (Oct 26, 1998)} \begin{itemize} \item Major redesign or the scoring method. It now uses absolute distances rather than relative scores for more consistent results. See suggest.cc for more info. \item Suggest code rewritten is several places however the core process stayed the same. \item The suggest\_ultra method temporally does nothing. It should be working again by the next release. \end{itemize} \section*{Changes from .20 to .21 (Oct 13, 1998)} \begin{itemize} \item Added documentation for aspell::Error \item \textbf{Changed the library name from libspell to libaspell.} It should never have been libspell in the first place. Sorry for the incompatibility. \item Added \textbf{as\_error.hh} to the list of files copied to the include directory so that you can actually use the library outside of the source dir. \item Fixed bug that caused a segmentation fault with words where the only suggestions was inserting a space or hyphen such as in {}``ledgerline''. \item Added the \textbf{score} method to \textbf{aspell}. \item Changed the scoring method to deal with word when the user uses \char`\"{}f\char`\"{} in place of \char`\"{}ph\char`\"{} a lot better. \end{itemize} \section*{Changes from .11 to .20 (Oct 10, 1998)} \begin{itemize} \item \textbf{Name change.} \textbf{\emph{}}\textbf{Everything that was Kspell is now Aspell.} Sorry, the name Kspell was already used by KDE and I didn't want to cause any confusion. \item Fixed a bug that causes a segmentation fault when the HOME environmental variable doesn't exist. \end{itemize} \section*{Changes from .10 to .11 (Sep 12, 1998)} \begin{itemize} \item Overhaul of the SC\_Language class \item Added documentation for international support \item Added documentation for the C++ library \item Other minor bug fixes. \end{itemize} \chapter{To Do} Words in bold indicate how you should refer to the item when discussing it with me or others. \section{Things that will be done real soon} These items should get done within a release or two. \begin{itemize} \item Totally rewrite the aspell international support. See \url{http://aspell.sourceforge.net/international/} for more information. \item Rework the aspell \textbf{check} function to provide support for using any number of filters which will be needed for international support. \item Add support for more intelligently coming up with suggestions for words that are run-togethers. \end{itemize} \section{Things that need to be done} Things items will eventually be implemented as I know they are important however I am not sure when they will get done. \begin{itemize} \item Figure out a way for Aspell to work better with \textbf{extremely large dictionaries}. \end{itemize} \section{Things that I would like to get done} These items will eventually be implemented. I hope to have them all done before I move aspell to beta testing. They are in the approximate order of when they will get done. \section{Things that will be done eventually} I plan on doing these things eventually. It is just a matter of getting around to it. \section{Good ideas that are worth implementing} These items all sound like good ideas however I am not sure when I will get to implementing then if ever. If you are looking for a way to contribute picking up on one of these ideas would be a great way to start. They are presented in no particular order. \begin{itemize} \item Use Lawrence Philips' new Double Metaphone algorithm. See \url{http://aspell.sourceforge.net/metaphone/}. \item Add support for \textbf{affix compression.} \item Come up with a plug-in for \textbf{gEdit} the gnome text editor. \item Change languages (and thus dictionaries) based on the information in the actual document. \item Come up with a \textbf{nroff mode} for spell checking. \item Come up with a mode that will skip words based on the symbols that (almost) always surround the word. (\textbf{Word skipping by context}) \item Create two \textbf{server modes} for Aspell. One that uses the \textbf{DICT} protocol and one that uses \textbf{ispell -a} method of communication of some arbitrary port. \item Come up with \textbf{thread safe personal dictionaries}. \item Use the \textbf{Hidden Markov Model} to base the suggestions on not only the word itself but on the context around the word. \item Having a way to \textbf{email} \textbf{the personal dictionary} and/or replacement list to a particular address either periodical or when it grows to a certain size. \item Be able to \textbf{}accept \textbf{words with spaces in them} as many languages have words such as as a word in a foreign phrases which only make sense when followed by other words. \end{itemize} The following good ideas where found in the ispell WISHES file so I thought I would pass them on. \begin{itemize} \item Ispell should be smart enough to ignore hyphenation signs, such as the \TeX{} \textbackslash{}- hyphenation indicator. \item (Jeff Edmonds) The personal dictionary should be able to remove certain words from the master dictionary, so that obscure words like \char`\"{}wether\char`\"{} wouldn't mask favorite typos. \item (Jeff Edmonds) It would be wonderful if ispell could correct inserted spaces such as \char`\"{}th e\char`\"{} for \char`\"{}the\char`\"{} or even \char`\"{}can not\char`\"{} for \char`\"{}cannot\char`\"{}. \item Since ispell has dictionaries available to it, it is conceivable that it could automatically determine the language of a particular file by choosing the dictionary that produced the fewest spelling errors on the first few lines. \end{itemize} \section{Things that are not likely to get implemented} Theses ideas are not likely to get implemented any time soon. \begin{itemize} \item (None Yet) \end{itemize} \section{Notes and Status of various items} \subsection{Affix Compression\label{affixcomp}} Due to the current way my spell checker works implementing affix compression would be next to impossible. Nevertheless, I do realize that for some languages affix compression is very important. So to solve this dilemma I plan on having two different modes of my spell checker: One with affix compression that does not use soundslike pairs (much like ispell) and one without affix compression that does use soundslike. I plan to extract the affix manipulation code from Ispell with the help of an Ispell author. The tricky part would be getting this to getting this all to work properly at tun time bases on the dictionary used. \subsection{Extremely Large Dictionaries} This problem extends back to the fact of the way words are index is Aspell. This problem will get resolved when I implanted the affix compression mode as only one index would be used. \subsection{General region skipping} I want to implement this give other people an idea of how it should be done and because I am really sick of having to spell check through url and email address. \subsection{Word skipping by context} This was posted on the Aspell mailing list on January 1, 1999: I had an idea on a great general way to determine if a word should be skipped. Determine the words to skip based on the symbols that (almost) always surround the word. For example when asked to check the following C++ code: \begin{lyxcode} cout~<\/<~\char`\"{}My~age~is:~\char`\"{}~<\/<~num~<\/<~endl;~~\\ cout~<\/<~\char`\"{}Next~year~I~will~be~\char`\"{}~<\/<~num~+~1~<\/<~endl;~ \end{lyxcode} cout, num, and endl will all be skipped. \char`\"{}cout\char`\"{} will be skipped because it is always preceded by a <\/<. \char`\"{}num\char`\"{} will be skipped because it is always preceded by a <\/<. And \char`\"{}endl\char`\"{} will be skipped because it is always between a <\/< and a ;. Given the following html code. \begin{lyxcode} ~~\\ OneTwoThree~~\\ 123~~\\ ~\\ ~\\ ~~\\ \end{lyxcode} table, width cellspacing, cellpadding, tr, td will all be skipped because they are always enclosed in \char`\"{}<>\char`\"{}. Now of course table and width would be marked as correct anyway however there is no harm in skipping them. So I was wondering if anyone on this list has any experience in writing this sort of context recognition code or could give me some pointers in the right direction. This sort of word skipping will be very powerful if done right. I imagine that it could replace specific spell checker modes for Tex, Nroff, SGML etc because it will automatically be able to figure out where it should skip words. It could also probably do a very good job on programming languages code. If you are interested in helping be out with this or just have general comments about the idea please let me know. \subsection{Hidden Markov Model} Knud Haugaard Sørensen suggested this one. From his email on the Aspell mailing list: \begin{quote} consider this examples. a fone number. -> a phone number. \\ a fone dress. -> a fine dress. the example illustrates that the right correction might depend on the context of the word. So I suggest that you take a look on HMM to solve this problem. This might also provide a good base to include grammar correction in aspell. see this link \url{http://www.cse.ogi.edu/CSLU/HLTsurvey/ch1node7.html} \end{quote} I think it is a great idea. However unfortunately it will probably be very complicated to implement. Perhaps in the far future. \subsection{Email the Personal Dictionary} Some one suggest in a personal email: \begin{quote} Have you thought of adding a function to aspell, that - when the personal dictionary has grown significantly - sends the user's personal dictionary to the maintainer of the corresponding aspell dictionary? (if the user allows it) It would be a very useful service to the dictionary maintainers, and I think most users can see their benefit in it too. \end{quote} And I replied: \begin{quote} Yes I have considered something like that but not for the personal dictionaries but rather the replacement word list in order to get better test data for \url{http://aspell.sourceforge.net/test/}. The problem is I don't know of a good way to do this sense Aspell can also be used as a library. It also is not a real high priority, especially sense I would first need to learn how to send email within a C++ program. \end{quote} \subsection{Words With Spaces in Them} While this is something I would like to do it is not a simple task. The basic problem is that when tokenizing a string there is no good way to keep phrases together. So the solution is to some how add special conditions to certain words which will dictate which words can come before/after it. Then there is also a problem of how to come up with intelligent suggestions. What further complicates things is that many applications send words to Aspell a word at a time. So even if Aspell did support such a thing many applications that would use Aspell will not. So, in order for this to work applications will need to send text to Aspell a document or at least a sentence at a time. Unfortunately the framework for doing this is not there yet. It will be once I finish the filter interface. Another possible is to provide call back functions in which Aspell will be able to request the previous or next word on request. Yet again the framework for doing this is not there. Perhaps sometime in the near future. \chapter{Support for Gcc 2.7.2} (and other non-standard compliance compilers) My original plan was to program in such a way that it would Aspell would compile under Gcc 2.7.2 however after releasing a rather nasty bug in 2.7.2 with nested types I have desired to drop all support for Gcc 2.7.2. As of Aspell .27 all hope for being able to compile under Gcc 2.7.2 is lost as I am now using many modern C++ features which are simply not present in Gcc 2.7.2, most notably template specialization and template members. Egcs 1.1.1 is a very good standards compliant compiler and that is now the officially supported compiler. However as Egcs 1.1.1 is relatively new and, except for namespaces, provides little new functionality I will continue to support Egcs 1.0.3. Gcc 2.8.1 should in theory work however it is so buggy I have yet to get it to compile with it nor has anyone else that I know of. Yes, my code could be rewritten so that is could compile under Gcc 2.7.2 but Why? Using modern C++ has probably accelerated the development of this library my at least 50\%. And for that matter why stop at Gcc 2.7.2 why not go all out and totally rewrite my code pure C. I hope you see my point. However that does not mean I want to sacrifice portability unnecessarily. If you see any of part of my code that in not Standard C++ please let me know. My hope is that my code could compile on all Standard compliance C++ compilers with the addition of a few extra header files from SGI's STL. As a side note I think that Mozilla's C++ portability guide (\url{http://www.mozilla.org/docs/tplist/catBuild/portable-cpp.html}) could be summed up in one sentence: Program in the dark ages of C++. \chapter{Credits} \begin{itemize} \item To the many authors of Ispell (including R. E. Gorin, Pace Willisson, and Geoff Kuenning) for providing me with a good word list as well as giving me a few good ideas. \item Alan Beale for going well beyond the call of duty with helping me create a better word list. \item Lawrence Philips for coming up with the original metaphone algorithm and Michael Kuhn for writing C code for the algorithm. \item Björn Jacke for coming up with a generic soundslike algorithm which gets all of its data from a file, thus eliminating almost all need for language specific code from aspell. \item To the authors of SGI STL version 3.0 and up for proving a great set of generic container classes with cut the development time of this program in half at least. \item To the \LyX{} development team for giving me a great tool for the development of this manual. \end{itemize} \chapter{Glossary and References} \begin{description} \item [affix]in grammar, a word element that, when added to a word, modifies its meaning or function; prefix, infix, or suffix. \item [Debian]A 100\% Open Source Linux distribution \url{http://www.debian.org} \item [DICT~protocol]A TCP transaction based query/response protocol that allows a client to access dictionary definitions from a set of natural language dictionary databases. \url{http://www.dict.org/} \item [Gnome]A project to build a complete, user-friendly desktop based entirely on free software. \url{http://www.gnome.org/} \item [GTK+]A library for creating graphical user interfaces for the X Window System. It is designed to be small, efficient, and flexible. \url{http://www.gtk.org/} \item [GUI]Graphics User Interface \item [Ispell.el]Emacs interface for ispell. \url{http://www.kdstevens.com/~stevens/ispell-page.html} \item [Ispell]An international spell checker which is just about the only decent Open Source spell checker out there. (except Aspell of course). \url{http://fmg-www.cs.ucla.edu/geoff/ispell.html} \item [Ispell~-a]An Ispell mode that is designed to be used by other applications though a pipe. \item [KDE]A powerful graphical desktop environment for Unix workstations. \url{http://www.kde.org} \item [Linux]A Open Source version of Unix which runs on many platforms. \url{http://www.linux.org/} \item [\LyX{}]An What You See is What You Mean document editor for the X environment, \url{http://www.lyx.org} \item [Open~Source]Software where the source code is available for anyone to extend or modify. \url{http://www.opensource.org/} \item [Red~Hat]A commercial Linux distribution. \url{http://www.redhat.com} \item [RPM]Red Hat's packing format also used by other Linux distributions. \item [STL]Standard Temple Library. A C++ library of container classes, algorithms, and iterators. \item [SGI~STL]Silicon Graphics implantation of the STL. \url{http://www.sgi.com/Technology/STL/} \item [STLPort]A port of SGI STL designed to run on compilers that don't support all the latest features of C++. \url{http://corp.metabyte.com/~fbp/stl/} \end{description} \chapter{Copyright} The library and utility program is Copyrighted 2000 by Kevin Atkinson under the LGPL. Certain parts of the library, as indicated at the top of the source file, are under a weaker license. The provided word list is extracted from the word lists provided with Ispell thus it is distributed under the same terms of Ispell. The two licenses follow: \section{LGPL} {\centering GNU LIBRARY GENERAL PUBLIC LICENSE \\ Version 2, June 1991\par} Copyright (C) 1991 Free Software Foundation, Inc. \begin{quote} 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA \end{quote} Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed. {[}This is the first released version of the library GPL. It is numbered 2 because it goes with version 2 of the ordinary GPL.{]} {\centering Preamble\par} The licenses for most software are designed to take away your freedom to share and change it. By contrast, the GNU General Public Licenses are intended to guarantee your freedom to share and change free software --- to make sure the software is free for all its users. This license, the Library General Public License, applies to some specially designated Free Software Foundation software, and to any other libraries whose authors decide to use it. You can use it for your libraries, too. When we speak of free software, we are referring to freedom, not price. Our General Public Licenses are designed to make sure that you have the freedom to distribute copies of free software (and charge for this service if you wish), that you receive source code or can get it if you want it, that you can change the software or use pieces of it in new free programs; and that you know you can do these things. To protect your rights, we need to make restrictions that forbid anyone to deny you these rights or to ask you to surrender the rights. These restrictions translate to certain responsibilities for you if you distribute copies of the library, or if you modify it. For example, if you distribute copies of the library, whether gratis or for a fee, you must give the recipients all the rights that we gave you. You must make sure that they, too, receive or can get the source code. If you link a program with the library, you must provide complete object files to the recipients so that they can relink them with the library, after making changes to the library and recompiling it. And you must show them these terms so they know their rights. Our method of protecting your rights has two steps: (1) copyright the library, and (2) offer you this license which gives you legal permission to copy, distribute and/or modify the library. Also, for each distributor's protection, we want to make certain that everyone understands that there is no warranty for this free library. If the library is modified by someone else and passed on, we want its recipients to know that what they have is not the original version, so that any problems introduced by others will not reflect on the original authors' reputations. Finally, any free program is threatened constantly by software patents. We wish to avoid the danger that companies distributing free software will individually obtain patent licenses, thus in effect transforming the program into proprietary software. To prevent this, we have made it clear that any patent must be licensed for everyone's free use or not licensed at all. Most GNU software, including some libraries, is covered by the ordinary GNU General Public License, which was designed for utility programs. This license, the GNU Library General Public License, applies to certain designated libraries. This license is quite different from the ordinary one; be sure to read it in full, and don't assume that anything in it is the same as in the ordinary license. The reason we have a separate public license for some libraries is that they blur the distinction we usually make between modifying or adding to a program and simply using it. Linking a program with a library, without changing the library, is in some sense simply using the library, and is analogous to running a utility program or application program. However, in a textual and legal sense, the linked executable is a combined work, a derivative of the original library, and the ordinary General Public License treats it as such. Because of this blurred distinction, using the ordinary General Public License for libraries did not effectively promote software sharing, because most developers did not use the libraries. We concluded that weaker conditions might promote sharing better. However, unrestricted linking of non-free programs would deprive the users of those programs of all benefit from the free status of the libraries themselves. This Library General Public License is intended to permit developers of non-free programs to use free libraries, while preserving your freedom as a user of such programs to change the free libraries that are incorporated in them. (We have not seen how to achieve this as regards changes in header files, but we have achieved it as regards changes in the actual functions of the Library.) The hope is that this will lead to faster development of free libraries. The precise terms and conditions for copying, distribution and modification follow. Pay close attention to the difference between a \char`\"{}work based on the library\char`\"{} and a \char`\"{}work that uses the library\char`\"{}. The former contains code derived from the library, while the latter only works together with the library. Note that it is possible for a library to be covered by the ordinary General Public License rather than by this special one. {\centering GNU LIBRARY GENERAL PUBLIC LICENSE\\ TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION\par} 0. This License Agreement applies to any software library which contains a notice placed by the copyright holder or other authorized party saying it may be distributed under the terms of this Library General Public License (also called \char`\"{}this License\char`\"{}). Each licensee is addressed as \char`\"{}you\char`\"{}. A \char`\"{}library\char`\"{} means a collection of software functions and/or data prepared so as to be conveniently linked with application programs (which use some of those functions and data) to form executables. The \char`\"{}Library\char`\"{}, below, refers to any such software library or work which has been distributed under these terms. A \char`\"{}work based on the Library\char`\"{} means either the Library or any derivative work under copyright law: that is to say, a work containing the Library or a portion of it, either verbatim or with modifications and/or translated straightforwardly into another language. (Hereinafter, translation is included without limitation in the term \char`\"{}modification\char`\"{}.) \char`\"{}Source code\char`\"{} for a work means the preferred form of the work for making modifications to it. For a library, complete source code means all the source code for all modules it contains, plus any associated interface definition files, plus the scripts used to control compilation and installation of the library. Activities other than copying, distribution and modification are not covered by this License; they are outside its scope. The act of running a program using the Library is not restricted, and output from such a program is covered only if its contents constitute a work based on the Library (independent of the use of the Library in a tool for writing it). Whether that is true depends on what the Library does and what the program that uses the Library does. 1. You may copy and distribute verbatim copies of the Library's complete source code as you receive it, in any medium, provided that you conspicuously and appropriately publish on each copy an appropriate copyright notice and disclaimer of warranty; keep intact all the notices that refer to this License and to the absence of any warranty; and distribute a copy of this License along with the Library. You may charge a fee for the physical act of transferring a copy, and you may at your option offer warranty protection in exchange for a fee. 2. You may modify your copy or copies of the Library or any portion of it, thus forming a work based on the Library, and copy and distribute such modifications or work under the terms of Section 1 above, provided that you also meet all of these conditions: \begin{LyXParagraphIndent}{0.5in} a) The modified work must itself be a software library. b) You must cause the files modified to carry prominent notices stating that you changed the files and the date of any change. c) You must cause the whole of the work to be licensed at no charge to all third parties under the terms of this License. d) If a facility in the modified Library refers to a function or a table of data to be supplied by an application program that uses the facility, other than as an argument passed when the facility is invoked, then you must make a good faith effort to ensure that, in the event an application does not supply such function or table, the facility still operates, and performs whatever part of its purpose remains meaningful. \end{LyXParagraphIndent} (For example, a function in a library to compute square roots has a purpose that is entirely well-defined independent of the application. Therefore, Subsection 2d requires that any application-supplied function or table used by this function must be optional: if the application does not supply it, the square root function must still compute square roots.) These requirements apply to the modified work as a whole. If identifiable sections of that work are not derived from the Library, and can be reasonably considered independent and separate works in themselves, then this License, and its terms, do not apply to those sections when you distribute them as separate works. But when you distribute the same sections as part of a whole which is a work based on the Library, the distribution of the whole must be on the terms of this License, whose permissions for other licensees extend to the entire whole, and thus to each and every part regardless of who wrote it. Thus, it is not the intent of this section to claim rights or contest your rights to work written entirely by you; rather, the intent is to exercise the right to control the distribution of derivative or collective works based on the Library. In addition, mere aggregation of another work not based on the Library with the Library (or with a work based on the Library) on a volume of a storage or distribution medium does not bring the other work under the scope of this License. 3. You may opt to apply the terms of the ordinary GNU General Public License instead of this License to a given copy of the Library. To do this, you must alter all the notices that refer to this License, so that they refer to the ordinary GNU General Public License, version 2, instead of to this License. (If a newer version than version 2 of the ordinary GNU General Public License has appeared, then you can specify that version instead if you wish.) Do not make any other change in these notices. Once this change is made in a given copy, it is irreversible for that copy, so the ordinary GNU General Public License applies to all subsequent copies and derivative works made from that copy. This option is useful when you wish to copy part of the code of the Library into a program that is not a library. 4. You may copy and distribute the Library (or a portion or derivative of it, under Section 2) in object code or executable form under the terms of Sections 1 and 2 above provided that you accompany it with the complete corresponding machine-readable source code, which must be distributed under the terms of Sections 1 and 2 above on a medium customarily used for software interchange. If distribution of object code is made by offering access to copy from a designated place, then offering equivalent access to copy the source code from the same place satisfies the requirement to distribute the source code, even though third parties are not compelled to copy the source along with the object code. 5. A program that contains no derivative of any portion of the Library, but is designed to work with the Library by being compiled or linked with it, is called a \char`\"{}work that uses the Library\char`\"{}. Such a work, in isolation, is not a derivative work of the Library, and therefore falls outside the scope of this License. However, linking a \char`\"{}work that uses the Library\char`\"{} with the Library creates an executable that is a derivative of the Library (because it contains portions of the Library), rather than a \char`\"{}work that uses the library\char`\"{}. The executable is therefore covered by this License. Section 6 states terms for distribution of such executables. When a \char`\"{}work that uses the Library\char`\"{} uses material from a header file that is part of the Library, the object code for the work may be a derivative work of the Library even though the source code is not. Whether this is true is especially significant if the work can be linked without the Library, or if the work is itself a library. The threshold for this to be true is not precisely defined by law. If such an object file uses only numerical parameters, data structure layouts and accessors, and small macros and small inline functions (ten lines or less in length), then the use of the object file is unrestricted, regardless of whether it is legally a derivative work. (Executables containing this object code plus portions of the Library will still fall under Section 6.) Otherwise, if the work is a derivative of the Library, you may distribute the object code for the work under the terms of Section 6. Any executables containing that work also fall under Section 6, whether or not they are linked directly with the Library itself. 6. As an exception to the Sections above, you may also compile or link a \char`\"{}work that uses the Library\char`\"{} with the Library to produce a work containing portions of the Library, and distribute that work under terms of your choice, provided that the terms permit modification of the work for the customer's own use and reverse engineering for debugging such modifications. You must give prominent notice with each copy of the work that the Library is used in it and that the Library and its use are covered by this License. You must supply a copy of this License. If the work during execution displays copyright notices, you must include the copyright notice for the Library among them, as well as a reference directing the user to the copy of this License. Also, you must do one of these things: \begin{LyXParagraphIndent}{0.5in} a) Accompany the work with the complete corresponding machine-readable source code for the Library including whatever changes were used in the work (which must be distributed under Sections 1 and 2 above); and, if the work is an executable linked with the Library, with the complete machine-readable \char`\"{}work that uses the Library\char`\"{}, as object code and/or source code, so that the user can modify the Library and then relink to produce a modified executable containing the modified Library. (It is understood that the user who changes the contents of definitions files in the Library will not necessarily be able to recompile the application to use the modified definitions.) b) Accompany the work with a written offer, valid for at least three years, to give the same user the materials specified in Subsection 6a, above, for a charge no more than the cost of performing this distribution. c) If distribution of the work is made by offering access to copy from a designated place, offer equivalent access to copy the above specified materials from the same place. d) Verify that the user has already received a copy of these materials or that you have already sent this user a copy. \end{LyXParagraphIndent} For an executable, the required form of the \char`\"{}work that uses the Library\char`\"{} must include any data and utility programs needed for reproducing the executable from it. However, as a special exception, the source code distributed need not include anything that is normally distributed (in either source or binary form) with the major components (compiler, kernel, and so on) of the operating system on which the executable runs, unless that component itself accompanies the executable. It may happen that this requirement contradicts the license restrictions of other proprietary libraries that do not normally accompany the operating system. Such a contradiction means you cannot use both them and the Library together in an executable that you distribute. 7. You may place library facilities that are a work based on the Library side-by-side in a single library together with other library facilities not covered by this License, and distribute such a combined library, provided that the separate distribution of the work based on the Library and of the other library facilities is otherwise permitted, and provided that you do these two things: \begin{LyXParagraphIndent}{0.5in} a) Accompany the combined library with a copy of the same work based on the Library, uncombined with any other library facilities. This must be distributed under the terms of the Sections above. b) Give prominent notice with the combined library of the fact that part of it is a work based on the Library, and explaining where to find the accompanying uncombined form of the same work. \end{LyXParagraphIndent} 8. You may not copy, modify, sublicense, link with, or distribute the Library except as expressly provided under this License. Any attempt otherwise to copy, modify, sublicense, link with, or distribute the Library is void, and will automatically terminate your rights under this License. However, parties who have received copies, or rights, from you under this License will not have their licenses terminated so long as such parties remain in full compliance. 9. You are not required to accept this License, since you have not signed it. However, nothing else grants you permission to modify or distribute the Library or its derivative works. These actions are prohibited by law if you do not accept this License. Therefore, by modifying or distributing the Library (or any work based on the Library), you indicate your acceptance of this License to do so, and all its terms and conditions for copying, distributing or modifying the Library or works based on it. 10. Each time you redistribute the Library (or any work based on the Library), the recipient automatically receives a license from the original licensor to copy, distribute, link with or modify the Library subject to these terms and conditions. You may not impose any further restrictions on the recipients' exercise of the rights granted herein. You are not responsible for enforcing compliance by third parties to this License. 11. If, as a consequence of a court judgment or allegation of patent infringement or for any other reason (not limited to patent issues), conditions are imposed on you (whether by court order, agreement or otherwise) that contradict the conditions of this License, they do not excuse you from the conditions of this License. If you cannot distribute so as to satisfy simultaneously your obligations under this License and any other pertinent obligations, then as a consequence you may not distribute the Library at all. For example, if a patent license would not permit royalty-free redistribution of the Library by all those who receive copies directly or indirectly through you, then the only way you could satisfy both it and this License would be to refrain entirely from distribution of the Library. If any portion of this section is held invalid or unenforceable under any particular circumstance, the balance of the section is intended to apply, and the section as a whole is intended to apply in other circumstances. It is not the purpose of this section to induce you to infringe any patents or other property right claims or to contest validity of any such claims; this section has the sole purpose of protecting the integrity of the free software distribution system which is implemented by public license practices. Many people have made generous contributions to the wide range of software distributed through that system in reliance on consistent application of that system; it is up to the author/donor to decide if he or she is willing to distribute software through any other system and a licensee cannot impose that choice. This section is intended to make thoroughly clear what is believed to be a consequence of the rest of this License. 12. If the distribution and/or use of the Library is restricted in certain countries either by patents or by copyrighted interfaces, the original copyright holder who places the Library under this License may add an explicit geographical distribution limitation excluding those countries, so that distribution is permitted only in or among countries not thus excluded. In such case, this License incorporates the limitation as if written in the body of this License. 13. The Free Software Foundation may publish revised and/or new versions of the Library General Public License from time to time. Such new versions will be similar in spirit to the present version, but may differ in detail to address new problems or concerns. Each version is given a distinguishing version number. If the Library specifies a version number of this License which applies to it and \char`\"{}any later version\char`\"{}, you have the option of following the terms and conditions either of that version or of any later version published by the Free Software Foundation. If the Library does not specify a license version number, you may choose any version ever published by the Free Software Foundation. 14. If you wish to incorporate parts of the Library into other free programs whose distribution conditions are incompatible with these, write to the author to ask for permission. For software which is copyrighted by the Free Software Foundation, write to the Free Software Foundation; we sometimes make exceptions for this. Our decision will be guided by the two goals of preserving the free status of all derivatives of our free software and of promoting the sharing and reuse of software generally. {\centering NO WARRANTY\par} 15. BECAUSE THE LIBRARY IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY FOR THE LIBRARY, TO THE EXTENT PERMITTED BY APPLICABLE LAW. EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES PROVIDE THE LIBRARY \char`\"{}AS IS\char`\"{} WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE LIBRARY IS WITH YOU. SHOULD THE LIBRARY PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING, REPAIR OR CORRECTION. 16. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR REDISTRIBUTE THE LIBRARY AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES, INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THE LIBRARY (INCLUDING BUT NOT LIMITED TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR A FAILURE OF THE LIBRARY TO OPERATE WITH ANY OTHER SOFTWARE), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. {\centering END OF TERMS AND CONDITIONS\par} {\centering How to Apply These Terms to Your New Libraries\par} If you develop a new library, and you want it to be of the greatest possible use to the public, we recommend making it free software that everyone can redistribute and change. You can do so by permitting redistribution under these terms (or, alternatively, under the terms of the ordinary General Public License). To apply these terms, attach the following notices to the library. It is safest to attach them to the start of each source file to most effectively convey the exclusion of warranty; and each file should have at least the \char`\"{}copyright\char`\"{} line and a pointer to where the full notice is found. \begin{quote} \\ Copyright (C) This library is free software; you can redistribute it and/or modify it under the terms of the GNU Library General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Library General Public License for more details. You should have received a copy of the GNU Library General Public License along with this library; if not, write to the Free Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA \end{quote} Also add information on how to contact you by electronic and paper mail. You should also get your employer (if you work as a programmer) or your school, if any, to sign a \char`\"{}copyright disclaimer\char`\"{} for the library, if necessary. Here is a sample; alter the names: \begin{quote} Yoyodyne, Inc., hereby disclaims all copyright interest in the library `Frob' (a library for tweaking knobs) written by James Random Hacker. , 1 April 1990 Ty Coon, President of Vice \end{quote} That's all there is to it! \section{Ispell Copyright} Copyright (c), 1983, by Pace Willisson Copyright 1992, 1993, Geoff Kuenning, Granada Hills, CA All rights reserved. Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met: 1. Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. 2. Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. 3. All modifications to the source code must be clearly marked as such. Binary redistributions based on modified source code must be clearly marked as modified versions in the documentation and/or other materials provided with the distribution. 4. All advertising materials mentioning features or use of this software must display the following acknowledgment: \begin{quote} This product includes software developed by Geoff Kuenning and other unpaid contributors. \end{quote} 5. The name of Geoff Kuenning may not be used to endorse or promote products derived from this software without specific prior written permission. THIS SOFTWARE IS PROVIDED BY GEOFF KUENNING AND CONTRIBUTORS ``AS IS'' AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL GEOFF KUENNING OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. \end{document}