Difference between revisions of "User:Firespeaker/Removing bidix trimming"

From Apertium
Jump to navigation Jump to search
Line 31: Line 31:
 
All of this is hypothetical (all of the above is current behaviour).
 
All of this is hypothetical (all of the above is current behaviour).
   
  +
Use source language information for
  +
* transfer rules
  +
* generation, with pseudo-lemmas from source language
 
<pre>
 
<pre>
 
$ echo "Айгүл күчүктү байкаптыр." | apertium -d . kir-eng
 
$ echo "Айгүл күчүктү байкаптыр." | apertium -d . kir-eng
Line 41: Line 44:
 
OR
 
OR
   
  +
Use source language information for
  +
* transfer rules
 
<pre>
 
<pre>
 
$ echo "Айгүл күчүктү байкаптыр." | apertium -d . kir-eng
 
$ echo "Айгүл күчүктү байкаптыр." | apertium -d . kir-eng

Revision as of 06:49, 22 June 2020

For transfer

OOV handled poorly with trimming

$ echo "Айгүл күчүктү издептир." | apertium -d . kir-eng
Aygül looked for puppy.

$ echo "Айгүл күчүктү издебептир." | apertium -d . kir-eng
Aygül did not look for puppy.
$ echo "Айгүл күчүктү байкаптыр." | apertium -d . kir-eng
#Aygül puppy *байкаптыр.

$ echo "Айгүл күчүктү байкабаптыр." | apertium -d . kir-eng
#Aygül puppy *байкабаптыр.

OOV handled much better without trimming

$ echo "^Айгүл<np><ant><f><nom>$ ^күчүк<n><acc>$ ^байка<v><tv><ifi><evid><p3><sg>$^.<sent>$^.<sent>$" | rest-of-pipeline
Aygül @байка puppy..

$ echo "^Айгүл<np><ant><f><nom>$ ^күчүк<n><acc>$ ^байка<v><tv><neg><ifi><evid><p3><sg>$^.<sent>$^.<sent>$" | rest-of-pipeline
Aygül did not @байка puppy..

Ideal OOV handling

All of this is hypothetical (all of the above is current behaviour).

Use source language information for

  • transfer rules
  • generation, with pseudo-lemmas from source language
$ echo "Айгүл күчүктү байкаптыр." | apertium -d . kir-eng
Aygül @байка-ed puppy.

$ echo "Айгүл күчүктү байкабаптыр." | apertium -d . kir-eng
Aygül did not @байка puppy.

OR

Use source language information for

  • transfer rules
$ echo "Айгүл күчүктү байкаптыр." | apertium -d . kir-eng
Aygül @байкабаптыр puppy.

$ echo "Айгүл күчүктү байкабаптыр." | apertium -d . kir-eng
Aygül did not @байкабаптыр puppy.