Difference between revisions of "Lexc and flag diacritics for prefix tagging"

From Apertium
Jump to navigation Jump to search
(Created page with "This line is ignored by begiak. This page describes lexc+flag diacritics approach to move analyses in the end (suffixing style) for prefixing languages. The most logical way ...")
 
m
Line 27: Line 27:
 
This creates surface forms: {root1, root2, root3, plroot1, plroot2, plroot3} with analyses {%<sg%>%<n%>root1, %<sg%>%<n%>root2, ...} which does not work in further steps of apertium pipeline.
 
This creates surface forms: {root1, root2, root3, plroot1, plroot2, plroot3} with analyses {%<sg%>%<n%>root1, %<sg%>%<n%>root2, ...} which does not work in further steps of apertium pipeline.
   
To make it look like it was normal european suffixing language, we need to move the tags at the end, and one way to do it in lexc is flag diacritics...
+
To make it look like it was typical european suffixing language, we need to move the tags at the end, and one way to do it in lexc is flag diacritics...
   
 
<pre>
 
<pre>

Revision as of 10:37, 8 November 2018

This line is ignored by begiak. This page describes lexc+flag diacritics approach to move analyses in the end (suffixing style) for prefixing languages.

The most logical way to tag prefixing languages in lexc would be so:

Multichar_Symbols
%<n%>
%<sg%> 
%<pl%>

LEXICON Root
0 Prefixes ;


LEXICON Prefixes
%<sg%>:0 NounRoots
%<pl%>:pl NounRoots

LEXICON NounRoots

%<n%>root1:0root1 # ;
%<n%>root2:0root2 # ;
%<n%>root3:0root3 # ;

This creates surface forms: {root1, root2, root3, plroot1, plroot2, plroot3} with analyses {%<sg%>%<n%>root1, %<sg%>%<n%>root2, ...} which does not work in further steps of apertium pipeline.

To make it look like it was typical european suffixing language, we need to move the tags at the end, and one way to do it in lexc is flag diacritics...

Multichar_Symbols
%<n%>
%<sg%>
%<pl%>

@P.NTAG.ON@
@R.NTAG.ON@
@P.SG.ON@
@R.SG.ON@
@P.PL.ON@
@R.PL.ON@

LEXICON Root
0 Prefixes ;


LEXICON Prefixes
@P.SG.ON@ NounRoots
@P.PL.ON@:@P.PL.ON@pl NounRoots

LEXICON NounRoots

@P.NTAG.ON@root1:@P.NTAG.ON@root1 ENDLEX1 ;
@P.NTAG.ON@root2:@P.NTAG.ON@root2 ENDLEX1 ;
@P.NTAG.ON@root3:@P.NTAG.ON@root3 ENDLEX1 ;

LEXICON ENDLEX1

@R.NTAG.ON@%<n%>:@R.NTAG.ON@ ENDLEX2 ;

LEXICON ENDLEX2

@R.SG.ON@%<sg%>:@R.SG.ON@ ENDLEX3 ;
@R.PL.ON@%<pl%>:@R.PL.ON@ ENDLEX3 ;

LEXICON ENDLEX3

# ;