Weighted transfer rules

From Apertium
Revision as of 11:57, 13 April 2018 by Deltamachine (talk | contribs) (Created page with "== Related links == Idea description [[Weighted_transfer_rules_at_GSoC_2016|Nikita Medyankin's project at GSoC 201...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Related links

Idea description

Nikita Medyankin's project at GSoC 2016

https://github.com/apertium/apertium-weights-learner/tree/629b48b306116565bc1d748c298bc28b41506f63

https://svn.code.sf.net/p/apertium/svn/branches/weighted-transfer/

Fixes

Nikita's code should work okay now. To run it, download apertium-weights-learner from https://github.com/apertium/apertium-weights-learner/tree/experimental, English - Spanish language pair with ambiguous rules from https://github.com/apertium/apertium-en-es/tree/ambiguous-rules and Apertium core with modified transfer module from https://svn.code.sf.net/p/apertium/svn/branches/weighted-transfer/apertium/.

Coverages

The number of all possible coverages was calculated 100 times for 100 random sentences for 5 language pairs.

language pair corpus mean number of coverages
English - Spanish Tatoeba 3.72
English - Spanish Europarl 194.35
Spanish - Catalan Tatoeba 2.94
Spanish - Catalan Europarl 53.04
Basque - Spanish Tatoeba 9.19
Swedish - Norwegian Europarl 488.57
Crimean Tatar - Turkish Crimean Tatar Wikipedia 3.12