• Home
  • History
  • Annotate
Name Date Size #Lines LOC

..03-May-2022-

words/H03-May-2022-1,9851,859

4.0.affixH A D03-May-202277 72

4.0.constituent-knowledgeH A D03-May-20221.7 KiB133120

4.0.dictH A D03-May-202219.3 KiB884752

4.0.knowledgeH A D03-May-20221.3 KiB2925

4.0.regexH A D03-May-20221 KiB2017

Makefile.amH A D03-May-2022457 2114

Makefile.inH A D03-May-202222.4 KiB756666

READMEH A D03-May-20222 KiB4936

corpus-basic.batchH A D03-May-202275 42

README

1
2                   Persian Link Grammar
3                   --------------------
4                       By Jon Dehdari
5       Brigham Young University Linguistics Department
6                         2003-2005
7
8
9This README is for 'persianlg', a Persian (Farsi) implementation of the
10Link Grammar (LG) syntactic parser.  The LG Parser is a new approach to
11parsing sentences, and was developed at Carnegie Melon University.  The
12parser works by simply linking words together.  Directionality and
13distance are the primary constraints given, and are found in the 4.0.dict
14file.
15
16The Persian PersianLG implementation is the first full non-English
17implementation, to my knowledge.  It works a little differently than
18the English version, due to the more complex word structure.
19Morphological pre-parsing is required, either by hand, using the
20included stemmer (stemmer.pl) [1], or using the Persian-pckimmo
21morphology engine.  The LG-Soar framework, developed in the BYU Soar
22research group, has tied both the LG parser and the PC-Kimmo engine
23together, to provide seamless integration of morphology and syntax.
24Thus manual morphology pre-parsing is not necessary, nor is cutting
25and pasting pckimmo output to the link-parser input.
26
27For example, an unvoweled text like 'mn midAnm kh tu rfti.' (I know
28that you went), should go into the LG parser as 'mn mi dAn m kh tu rf t i.'
29See the orthography.txt file for the romanization scheme used.
30
31This package is Free Software, licensed under Lesser General Public
32License v.2.1 (www.fsf.org).  See LICENSE for details.  Note: the
33original PersianLG was made available under GPLv2; but, as of 2014,
34Jon Dehdari has consented to changing the license to LGPLv2.1 so that
35it can be included with the main Link Grammar project.
36
37
38[1]  The stemmmer is located in the /morphology/fa directory.  An
39example usage is:
40
41   ./persianparse.sh 'man midAnam keh tu rafti.'
42
43or, equivalently:
44
45   echo -e 'man midAnam keh tu rafti.\n' | ./stemmer.pl -u | link-parser fa
46
47See also:
48https://github.com/jonsafari/perstem
49