Difference between revisions of "Ideas for Google Summer of Code/Unsupervised weighting of automata"

From Apertium

< Ideas for Google Summer of Code

Jump to navigation Jump to search

Revision as of 17:05, 29 March 2017

Coding challenge

Install HFST
Install lttoolbox
Define an evaluation metric --- talk to your mentor
Perform a baseline experiment using a tagged corpus:
- Select a language
- Split the corpus into 90% training, 10% testing (or use existing test/train split)
- Use the Apertium morphological analyser to analyse the test data
- Rank the analyses produced using the training data
- Compare this ranking to the default order from the transducer, and to a "random" ranking

Retrieved from "https://wiki.apertium.org/w/index.php?title=Ideas_for_Google_Summer_of_Code/Unsupervised_weighting_of_automata&oldid=62157"

Ideas for Google Summer of Code