• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..03-May-2022-

config/H29-Nov-2018-4,9874,376

setdefinitions/H29-Nov-2018-2,1701,721

.gitignoreH A D29-Nov-2018168 1514

AUTHORSH A D29-Nov-201851 43

COPYINGH A D29-Nov-201834.3 KiB675553

INSTALLH A D29-Nov-201815.4 KiB371289

Makefile.amH A D29-Nov-2018203 106

NEWSH A D29-Nov-20182 KiB6048

READMEH A D29-Nov-201841 21

README.mdH A D29-Nov-2018987 3628

TODOH A D29-Nov-201817 21

bootstrap.shH A D29-Nov-20182.1 KiB7939

configure.acH A D29-Nov-2018337 1512

generate-setdefinitions.pyH A D29-Nov-20181.7 KiB4839

uctodata.pc.inH A D29-Nov-2018109 75

README

1Please see README.md for for information
2

README.md

1# uctodata 0.4 CLST/ILK 2009 - 2016
2	 https://github.com/LanguageMachines/uctodata/
3
4Website and documentation: https://languagemachines.github.io/ucto
5
6uctodata provides datafiles for the tokeniser ucto for several languages. The
7language code can be supplied to ucto using the ``-L`` paramater (e.g. ``ucto
8-L nld input.txt``):
9
10 * ``eng`` - English
11 * ``nld`` - Dutch
12 * ``deu`` - German
13 * ``fra`` - French
14 * ``ita`` - Italian
15 * ``spa`` - Spanish
16 * ``por`` - Portuguese
17 * ``rus`` - Russian
18 * ``swe`` - Swedish
19 * ``tur`` - Turkish
20 * ``fry`` - Frisian
21
22uctodata is architecture independent.
23
24To install uctodata, first consult whether your distribution's
25package manager has an up-to-date package.
26If not, for easy installation of ucto and uctodata, it is included
27as part of our software distribution LaMachine:
28https://proycon.github.io/LaMachine .
29
30To compile and install manually from source instead:
31
32    $ bash bootstrap.sh
33    $ ./configure
34    $ make
35    $ make install
36