Difference between revisions of "Conjoined lexical units"
Jump to navigation
Jump to search
Popcorndude (talk | contribs) (Created page with "A single surface form can correspond to multiple lexical units. When this happens, the lexical units are connected with <code>+</code>. An example of where this might be usef...") |
Popcorndude (talk | contribs) (add Chukchi) |
||
Line 1: | Line 1: | ||
A single surface form can correspond to multiple lexical units. When this happens, the lexical units are connected with <code>+</code>. |
A single surface form can correspond to multiple lexical units. When this happens, the lexical units are connected with <code>+</code>. |
||
== Minimal Example: English Possessives == |
|||
An example of where this might be useful is English plurals and possessives: |
An example of where this might be useful is English plurals and possessives: |
||
Line 49: | Line 51: | ||
NounRoot NounNumPos |
NounRoot NounNumPos |
||
</pre> |
</pre> |
||
== More Involved Example: Chukchi Incorporation == |
|||
Chukchi can incorporate nouns into verbs. A simplified example is given below: |
|||
<pre> |
|||
LEXICON VerbRoot |
|||
амэчатык:амэчат |
|||
анӈатык:анӈат |
|||
LEXICON NounRoot |
|||
варат |
|||
ватап |
|||
PATTERN VerbStem |
|||
VerbRoot |
|||
NounRoot [<n><incorp>+:>{ы}] VerbRoot |
|||
PATTERNS |
|||
VerbStem [<v>:] |
|||
</pre> |
|||
This generates the forms |
|||
<pre> |
|||
^амэчат/амэчатык<v>$ |
|||
^анӈат/анӈатык<v>$ |
|||
^варат>{ы}амэчат/варат<n><incorp>+амэчатык<v>$ |
|||
^варат>{ы}анӈат/варат<n><incorp>+анӈатык<v>$ |
|||
^ватап>{ы}амэчат/ватап<n><incorp>+амэчатык<v>$ |
|||
^ватап>{ы}анӈат/ватап<n><incorp>+анӈатык<v>$ |
|||
</pre> |
|||
Equivalent lexc: |
|||
<pre> |
|||
LEXICON Root |
|||
NounIncorp ; |
|||
VerbRoot ; |
|||
LEXICON NounIncorp |
|||
варат:варат NounIncorpInfl ; |
|||
ватап:ватап NounIncorpInfl ; |
|||
LEXICON NounIncorpInfl |
|||
%<n%>%<incorp%>%+:%>%{ы%} VerbRoot ; |
|||
LEXICON VerbRoot |
|||
амэчатык:амэчат VerbInfl ; |
|||
анӈатык:анӈат VerbInfl ; |
|||
LEXICON VerbInfl |
|||
%<v%>: # ; |
|||
</pre> |
|||
Roughly equivalent monodix (replacing <code>>{ы}</code> with <code>ы</code> since these are for composing with [[Twol]]) |
|||
<pre> |
|||
<pardef n="noun_root"> |
|||
<e><i>варат</i></e> |
|||
<e><i>ватап</i></e> |
|||
</pardef> |
|||
<pardef n="verb_root"> |
|||
<e><p> <l>амэчат</l> <r>амэчатык</r> </p></e> |
|||
<e><p> <l>анӈат</l> <r>анӈатык</r> </p></e> |
|||
</pardef> |
|||
<pardef n="verb_infl"> |
|||
<e><p> <l></l> <r><s n="v"/></r> </p></e> |
|||
</pardef> |
|||
<pardef n="noun_incorp_infl"> |
|||
<e><p> <l>ы</l> <r><s n="n"/><s n="incorp"/><j/></r> </p></e> |
|||
</pardef> |
|||
<e> <par n="verb_root"/> <par n="verb_infl"/> </e> |
|||
<e> <par n="noun_root"/> <par n="incorp_infl"/> <par n="verb_root"/> <par n="verb_infl"/> </e> |
|||
</pre> |
|||
== See Also == |
|||
[[Apertium stream format]] |
|||
[[Category:Documentation in English]] |
Revision as of 22:53, 5 April 2021
A single surface form can correspond to multiple lexical units. When this happens, the lexical units are connected with +
.
Minimal Example: English Possessives
An example of where this might be useful is English plurals and possessives:
^dog/dog<n><sg>$ ^dogs/dog<n><pl>$ ^dog's/dog<n><sg>+'s<gen>$ ^dogs'/dog<n><pl>+'s<gen>$
In monodix this is written with <j/>
<pardef n="dog__n"> <e><p> <l></l> <r><s n="n"/><s n="sg"/></r> </p></e> <e><p> <l>s</l> <r><s n="n"/><s n="pl"/></r> </p></e> <e><p> <l>'s</l> <r><s n="n"/><s n="sg"/><j/>'s<s n="gen"/></r> </p></e> <e><p> <l>s'</l> <r><s n="n"/><s n="pl"/><j/>'s<s n="gen"/></r> </p></e> </pardef> <e lm="dog"><i>dog</i><par n="dog__n"/></e>
In lexc this is written with %+
LEXICON NounInfl %<n%>%<sg%>: # ; %<n%>%<pl%>:s # ; %<n%>%<sg%>%+'s%<gen%>:'s # ; %<n%>%<pl%>%+'s%<gen%>:s' # ; LEXICON NounRoot dog:dog NounInfl ;
In lexd this is written with +
LEXICON NounNumPos <sg>: <pl>:s <sg>+'s<gen>:'s <pl>+'s<gen>:s' LEXICON NounRoot dog:dog PATTERNS NounRoot NounNumPos
More Involved Example: Chukchi Incorporation
Chukchi can incorporate nouns into verbs. A simplified example is given below:
LEXICON VerbRoot амэчатык:амэчат анӈатык:анӈат LEXICON NounRoot варат ватап PATTERN VerbStem VerbRoot NounRoot [<n><incorp>+:>{ы}] VerbRoot PATTERNS VerbStem [<v>:]
This generates the forms
^амэчат/амэчатык<v>$ ^анӈат/анӈатык<v>$ ^варат>{ы}амэчат/варат<n><incorp>+амэчатык<v>$ ^варат>{ы}анӈат/варат<n><incorp>+анӈатык<v>$ ^ватап>{ы}амэчат/ватап<n><incorp>+амэчатык<v>$ ^ватап>{ы}анӈат/ватап<n><incorp>+анӈатык<v>$
Equivalent lexc:
LEXICON Root NounIncorp ; VerbRoot ; LEXICON NounIncorp варат:варат NounIncorpInfl ; ватап:ватап NounIncorpInfl ; LEXICON NounIncorpInfl %<n%>%<incorp%>%+:%>%{ы%} VerbRoot ; LEXICON VerbRoot амэчатык:амэчат VerbInfl ; анӈатык:анӈат VerbInfl ; LEXICON VerbInfl %<v%>: # ;
Roughly equivalent monodix (replacing >{ы}
with ы
since these are for composing with Twol)
<pardef n="noun_root"> <e><i>варат</i></e> <e><i>ватап</i></e> </pardef> <pardef n="verb_root"> <e><p> <l>амэчат</l> <r>амэчатык</r> </p></e> <e><p> <l>анӈат</l> <r>анӈатык</r> </p></e> </pardef> <pardef n="verb_infl"> <e><p> <l></l> <r><s n="v"/></r> </p></e> </pardef> <pardef n="noun_incorp_infl"> <e><p> <l>ы</l> <r><s n="n"/><s n="incorp"/><j/></r> </p></e> </pardef> <e> <par n="verb_root"/> <par n="verb_infl"/> </e> <e> <par n="noun_root"/> <par n="incorp_infl"/> <par n="verb_root"/> <par n="verb_infl"/> </e>