Difference between revisions of "Syntactic labels"
(Link to French page) |
|||
(11 intermediate revisions by 3 users not shown) | |||
Line 1: | Line 1: | ||
[[Étiquettes syntaxiques|En français]] |
|||
{{TOCD}} |
|||
In some language pairs, syntactic function labels are used to improve disambiguation, or allow tighter transfer rules to be written. For example, disambiguating verb phrase co-ordinators from noun phrase co-ordinators lets you write transfer rules to merge two co-ordinated NPs. |
In some language pairs, syntactic function labels are used to improve disambiguation, or allow tighter transfer rules to be written. For example, disambiguating verb phrase co-ordinators from noun phrase co-ordinators lets you write transfer rules to merge two co-ordinated NPs. |
||
Apertium processes left-to-right longest match, so if we have the following sentence, |
Apertium processes [[LRLM|left-to-right longest match]], so if we have the following sentence, |
||
* John kicked the ball and Mary caught it. |
* John kicked the ball and Mary caught it. |
||
Line 31: | Line 34: | ||
==Standard syntax tags== |
==Standard syntax tags== |
||
These are the uniform tags used in many [http://giellatekno.uit.no/english.html Giellatekno] projects. It isn't necessary to implement all of the analysis, |
These are the uniform tags used in many [http://giellatekno.uit.no/english.html Giellatekno] projects. It isn't necessary to implement all of the analysis, even implementing part of it can prove useful in writing transfer or lexical selection rules. |
||
even implementing part of it can prove useful in writing transfer or lexical selection rules. |
|||
===Direction=== |
|||
* If the tag indicates that the current word serves that function, then: |
|||
** If the word on which the current word depends is to the left, then the arrow is on the left of the tag, |
|||
*** e.g. <code>@←SUBJ</code>: A subject which depends on a verb to the left. |
|||
** If the word on which the current word depends is to the right, then the arrow is on the right of the tag, |
|||
*** e.g. <code>@OBJ→</code>: A direct object which depends on a verb to the right. |
|||
* If the tag indicates that the current word depends on a word of that function, then: |
|||
** If the word on which it depends is to the right, the tag is on the left: |
|||
*** e.g. <code>@→N</code>: A noun modifier with its head to the right |
|||
** If the word on which it depends is to the left, the tag is on the right: |
|||
*** e.g. <code>@P←</code>: A word depending on a preposition to the left. |
|||
A full example: |
|||
<pre> |
|||
We<@SUBJ→> sang<@+FMAINV> songs<@←OBJ> about<@←ADVL> a<@→N> submarine<@P←> |
|||
</pre> |
|||
===Table=== |
|||
{|class=wikitable |
{|class=wikitable |
||
Line 124: | Line 146: | ||
|- |
|- |
||
|} |
|} |
||
==See also== |
|||
* [[List of symbols]] (Morphology/POS tags) |
|||
==External links== |
==External links== |
||
* [http://giellatekno.uit.no/doc/lang/ |
* [http://giellatekno.uit.no/doc/lang/common/docu-sme-syntaxtags.html Syntax tags used in Sámi] at giellatekno.uit.no |
||
[[Category:Documentation]] |
[[Category:Documentation]] |
||
[[Category:Documentation in English]] |
Latest revision as of 08:16, 8 October 2014
In some language pairs, syntactic function labels are used to improve disambiguation, or allow tighter transfer rules to be written. For example, disambiguating verb phrase co-ordinators from noun phrase co-ordinators lets you write transfer rules to merge two co-ordinated NPs.
Apertium processes left-to-right longest match, so if we have the following sentence,
- John kicked the ball and Mary caught it.
And we have a rule for noun cc noun, then we will get the following analysis,
- John kicked the [ball] and [Mary]
But if we can tag the conjunction as being a global conjunction, then we can avoid lumping the subject of the second sentence with the object of the first sentence.
- [John kicked the ball] and [Mary caught it]
Example[edit]
$ echo "Gud talaði øll hesi orð og segði Hann:" | lt-proc fo-is.automorf.bin | cg-proc fo-is.rlx.bin ^Gud/Gud<np><al><m><sg><acc><@OBJ→>/Gud<np><al><m><sg><nom><@SUBJ→>$ ^talaði/tala<vblex><past><p2><sg><@+FMAINV>/tala<vblex><past><p3><sg><@+FMAINV>$ ^øll/allur<prn><qnt><nt><pl><acc><@←OBJ>$ ^hesi/hesin<prn><dem><nt><pl><acc><@←OBJ>$ ^orð/orð<n><nt><sg><acc><ind><@←OBJ>/orð<n><nt><pl><acc><ind><@←OBJ>$ ^og/og<cnjcoo><@CNP>/og<cnjsub><@CVP>$ ^segði/siga<vblex><past><p3><sg><@+FMAINV>$ ^Hann/Prnpers<prn><p3><m><sg><nom><@←SUBJ>$^:/:<sent>$
Here, we could for example have a rule that moves subjects of a finite main verb that are to the right, to the left. e.g. @+FMAINV @←SUBJ
to @→SUBJ @+FMAINV
as is the order in English.
Standard syntax tags[edit]
These are the uniform tags used in many Giellatekno projects. It isn't necessary to implement all of the analysis, even implementing part of it can prove useful in writing transfer or lexical selection rules.
Direction[edit]
- If the tag indicates that the current word serves that function, then:
- If the word on which the current word depends is to the left, then the arrow is on the left of the tag,
- e.g.
@←SUBJ
: A subject which depends on a verb to the left.
- e.g.
- If the word on which the current word depends is to the right, then the arrow is on the right of the tag,
- e.g.
@OBJ→
: A direct object which depends on a verb to the right.
- e.g.
- If the word on which the current word depends is to the left, then the arrow is on the left of the tag,
- If the tag indicates that the current word depends on a word of that function, then:
- If the word on which it depends is to the right, the tag is on the left:
- e.g.
@→N
: A noun modifier with its head to the right
- e.g.
- If the word on which it depends is to the left, the tag is on the right:
- e.g.
@P←
: A word depending on a preposition to the left.
- e.g.
- If the word on which it depends is to the right, the tag is on the left:
A full example:
We<@SUBJ→> sang<@+FMAINV> songs<@←OBJ> about<@←ADVL> a<@→N> submarine<@P←>
Table[edit]
Tag | Description | Examples |
---|---|---|
@←SUBJ |
Subject, head verb to the left | Í upphafi skapaði Guð himinn og jörð. |
@SUBJ→ |
Subject, head verb to the right | Ég tala við hann. |
@←OBJ |
Direct object, head verb to the left | Ég sendi þér bréfið. |
@OBJ→ |
Direct object, head verb to the right | |
@←IOBJ |
Indirect object, head verb to the left | Ég sendi þér bréfið. |
@IOBJ→ |
Indirect object, head verb to the right | |
@→N |
Noun modifier, head noun to the right | Um 1.2 milljónir manna eru heimilislausar. |
@N← |
Noun modifier, head noun to the left | Samskipti landanna tveggja eru góð. |
@→A |
Adjective modifier, head noun to the right | |
@A← |
Adjective modifier, head noun to the left | |
@IM |
||
@SPRED |
Subject predicate, | |
@←SPRED |
Subject predicate, | She is my sister |
@SPRED→ |
Subject predicate, | Blár er himinninn. |
@OPRED |
Object predicate, | |
@←OPRED |
Object predicate, | I will make you my personal slave |
@OPRED→ |
Object predicate, | |
@+FAUXV |
Finite auxiliary verb | |
@-FAUXV |
Non-finite auxiliary verb | |
@+FMAINV |
Finite main verb | |
@-FMAINV |
Non-finite main verb | |
@-FSUBJ→ |
Subject of a non-finite verb | |
@-F←OBJ |
Object of a non-finite verb | |
@-FOBJ→ |
Object of a non-finite verb | |
@SPRED←OBJ |
||
@-FADVL |
||
@←ADVL |
Adverbial modifier, head to the left | |
@ADVL→ |
Adverbial modifier, head to the right | |
@ADVL |
Adverbial modifier | |
@P← |
Complement of a preposition | |
@CNP |
Local conjunction or subjunction | |
@CVP |
Conjunction or subjunction that joins finite-verb phrases | |
@→CS |
||
@CNP-VP |
Ambiguous co-ordinator | |
@APP |
Apposition | |
@ICL-ADVL |
Non-finite subclause ... | |
@ICL-AUX← |
"right" argument of auxiliary (?) | |
@ICL-OBJ |
Non-finite subclause ... | |
@ICL-STA |
Non-finite subclause ... | |
@HNOUN |
Noun phrase fragment | |
@X |
No analysis |
See also[edit]
- List of symbols (Morphology/POS tags)
External links[edit]
- Syntax tags used in Sámi at giellatekno.uit.no