Ideas for Google Summer of Code/Unsupervised weighting of automata

From Apertium

< Ideas for Google Summer of Code

Revision as of 17:05, 29 March 2017 by Francis Tyers (talk | contribs) (→‎Coding challenge)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to navigation Jump to search

Coding challenge

Install HFST
Install lttoolbox
Define an evaluation metric --- talk to your mentor
Perform a baseline experiment using a tagged corpus:
- Select a language
- Split the corpus into 90% training, 10% testing (or use existing test/train split)
- Use the Apertium morphological analyser to analyse the test data
- Rank the analyses produced using the training data
- Compare this ranking to the default order from the transducer, and to a "random" ranking using your metric

Retrieved from "https://wiki.apertium.org/w/index.php?title=Ideas_for_Google_Summer_of_Code/Unsupervised_weighting_of_automata&oldid=62158"

Ideas for Google Summer of Code