• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..03-May-2022-

doc/H03-May-2022-

misc/H22-Jun-2017-4341

test/H07-May-2022-9064

COPYINGH A D22-Jan-200833.7 KiB662544

INSTALLH A D18-Jan-20125.4 KiB12092

LICENSEH A D22-Jun-2017461 107

Makefile.inH A D22-Jun-201713.1 KiB351202

PrefixBits.plH A D17-Jul-2004710 167

READMEH A D24-Jun-20177 KiB149115

WHATSNEWH A D22-Jun-201721.5 KiB465352

binarize-desc.plH A D17-May-20109 KiB275222

biza-nouns.hifH A D08-Oct-2005483 2221

configureH A D25-Feb-2012136.7 KiB4,8214,009

configure.inH A D25-Feb-20125.3 KiB145118

corlist.cH A D15-Dec-2003686 3727

dict_radix.cH A D25-Feb-201220.9 KiB722547

dict_radix.hH A D21-Dec-2003815 2512

find_sizes.cH A D21-Dec-2003370 2315

genprefixes.plH A D31-Aug-20032.3 KiB9273

gimatria.cH A D30-Jan-20105 KiB196153

gzbuffered.hH A D25-Feb-20123.1 KiB11876

hash.hH A D25-Feb-20124.7 KiB17192

hspell.1H A D22-Jun-201711.7 KiB372323

hspell.3H A D22-Jun-20179.9 KiB267235

hspell.cH A D22-Jun-201721.1 KiB737530

hspell.hH A D22-Jun-20172.6 KiB7736

libhspell.cH A D25-Feb-201213.1 KiB502341

linginfo.cH A D25-Feb-20128.1 KiB331282

linginfo.hH A D14-Apr-2006949 3010

mk_he_affix.cH A D22-Jun-20178.2 KiB242157

multispellH A D03-May-20229.4 KiB358241

pack-desc.plH A D25-Feb-20123 KiB11180

pmergeH A D03-May-20222 KiB8155

specfilter.cH A D28-Dec-20096.8 KiB18578

statsH A D05-Jan-20112.1 KiB5945

tclHash.cH A D04-Mar-201222.8 KiB819357

tclHash.hH A D23-Dec-20096 KiB13562

wolig.datH A D22-Jun-2017389.3 KiB16,02515,795

wolig.plH A D07-Feb-201530.3 KiB699432

wooH A D14-Feb-201543.5 KiB1,382940

woo.datH A D22-Jun-201761.5 KiB2,6032,592

wunzip.cH A D25-Feb-2012797 4133

wzipH A D25-Feb-2012254 1815

README

1This is version 1.4 of Hspell, the free Hebrew spellchecker and morphology
2engine.
3
4You can get Hspell from:
5	http://hspell.ivrix.org.il/
6
7Hspell was written by Nadav Har'El and Dan Kenigsberg:
8	nyh    @ math.technion.ac.il
9	danken @   cs.technion.ac.il
10
11Hspell is free software, released under the GNU Affero General Public License
12(AGPL) version 3. Note that not only the programs in the distribution, but
13also the dictionary files and the generated word lists, are licensed under
14the AGPL.
15There is no warranty of any kind for the contents of this distribution.
16See the LICENSE file for more information and the exact license terms.
17
18The rest of this README file explains Hspell's spelling standard (niqqud-less),
19a bit about the technology behind Hspell, how to use the "hspell" program
20(but see the manual page for more current information), and lists a few future
21directions. See the separate INSTALL file for instructions on how to install
22Hspell.
23
24
25About Hspell's spelling standard
26--------------------------------
27
28Hspell was designed to be 100% and strictly compliant with the official
29niqqud-less spelling rules ("Ha-ktiv Khasar Ha-niqqud", colloquially known as
30"Ktiv Male", or "plene spelling" in English), published by the Academy of
31the Hebrew Language. This is both an advantage and a disadvantage, depending
32on your viewpoint. It's an advantage because it encourages a *correct* and
33consistent spelling style throughout your writing. It is a disadvantage,
34because a few of the Academia's official spelling decisions are relatively
35unknown to the general public.
36
37Users of Hspell (and all Hebrew writers, for that matter) are encouraged to
38read the Academia's official niqqud-less spelling rules (which are printed at
39the end of most modern Hebrew dictionaries), and to refer to Hebrew
40dictionaries which use the niqqud-less spelling (such as Millon Ha-hove or
41Rav Milim). We also provide in docs/niqqudless.odt a document (in Hebrew)
42which describes in detail Hspell's spelling standard, and why certain words
43are spelled the way they are.
44
45
46The technology behind Hspell
47----------------------------
48
49The "hspell" program itself is mostly a simple (but efficient) program
50that checks input words against a long list of valid words. The real "brains"
51behind it are the word lists (lexicon) provided by the Hspell project.
52
53In order for it to be completely free of other people's copyright restrictions,
54the Hspell project is a clean-room implementation, not based on other
55companies' word lists, on other companies' spell checkers, or on copying of
56printed dictionaries. The word list is also not based on automatic scanning
57of available Hebrew documents (such as online newspapers), because there is
58no way to guarantee that such a list will be correct, complete, or consistent
59with regard to spelling rules.
60
61Instead, our idea was to write programs which know how to correctly inflect
62Hebrew nouns and conjugate Hebrew verbs. The inputs to these programs are
63lists of noun stems and of verb roots, plus hints needed for the correct
64inflection when these cannot be figured out automatically. These input files
65are obviously an important part of the Hspell project. The "word list
66generators" (written in Perl, and are also part of the Hspell project) then
67create the complete word-list for use by the spellchecking program, hspell.
68The generated lists are useful for much more than spellchecking, by the
69way - see more on that below ("the future").
70
71Although we wrote all of Hspell's code ourselves, we are truly indebted to
72the old-style "open source" pioneers - people who wrote books about the
73knowledge they developed, instead of hiding it in proprietary software.
74For the correct noun inflections, Dr. Shaul Barkali's "The Complete Noun Book"
75has been a great help. Prof. Uzzi Ornan's booklet "Verb Conjugation in Flow
76Charts" has been instrumental in the implementation of verb conjugation,
77and Barkali's "The Complete Verb Book" was used too.
78
79During our work we have extensively used a number of Hebrew dictionaries,
80including Even Shoshan, Millon Ha-hove and Rav-Milim, to ensure the correctness
81of certain words. Various Hebrew newspapers and books, both printed and online,
82were used for inspiration and for finding words we still do not recognize.
83We wish to thank Cilla Tuviana and Dr. Zvi Har'El for their assistance with
84some grammatical questions.
85
86
87Using hspell
88------------
89
90After unpacking the distribution and running "configure", "make" and
91"make install" (see the INSTALL file for more information), the hspell
92executable is installed (by default) in /usr/local/bin, and the dictionary
93files are in /usr/local/share/hspell.
94
95The "hspell" program can be used on any sort of text file containing Hebrew
96and potentially non-Hebrew characters which it ignores. For example, it
97works well on Hebrew text files, TeX/LaTeX files, and HTML. Running
98
99	hspell filename
100
101Will check the spelling in filename and will output the list of incorrect
102words (just like the old-fashioned UNIX "spell" program did). If run without
103a file parameter, hspell reads from its standard input.
104
105In the current release, hspell expects ISO-8859-8-encoded files. If files
106using a different encoding (e.g., UTF8) are to be checked, they must be
107converted first to ISO-8859-8 (e.g., see iconv(1), recode(1)).
108
109If the "-c" option is given, hspell will suggest corrections for misspelled
110words, whenever it can find such corrections. The correction mechanism in this
111release is especially good at finding corrections for incorrect niqqud-less
112spellings, with missing or extra 'immot-qri'a.
113
114The "-l" (verbose) option will explain for each correct word why it was
115recognized, if Hspell was built with the "linginfo" optional feature enabled
116(a morphological analysis is shown, i.e., fully describe all possible ways to
117read the given word as an inflected word with optional prefixes).
118
119Because hspell's output (naturally) is "logical-order", it is normally
120useful to pipe it to bidiv or rev before viewing. For example
121
122	hspell -c filename | bidiv | less
123
124Another convenient alternative is to run hspell on a BiDi-enabled terminal.
125
126Instead of using the hspell program described above, users can also use
127Hspell's lexicon through one of the popular multi-lingual spell-checkers,
128aspell and hunspell. See the INSTALL file for more information on building
129these dictionaries.
130
131
132How *you* can help
133------------------
134
135By now, Hspell is fairly mature, and its lexicon of over 24,000 base words
136is fairly comprehensive, similar in breadth to some printed dictionaries.
137Careful attention has also been given to its accuracy, and its conformance
138with the spelling rules of the Academy of the Hebrew Language.
139
140Nevertheless, Hspell does not, and probably never will, cover all of modern
141Hebrew language. Also, undoubtedly, it may contain some errors as well.
142If you find such omissions or errors, please let us know.
143
144Before reporting such omissions or errors, please try to verify that the word
145you are proposing is indeed correctly spelled: Please refer to modern
146dictionaries. Please also look at doc/niqqudless.odt - the word you are
147proposing might actually be a known mispelling which we discuss in that
148document.
149