Ideas for Google Summer of Code/UD and Apertium integration
Jump to navigation
Jump to search
Tasks
This project would involve working on a number of tasks from the following list:
- UD annotatrix
- An HTML/JS interface for treebank annotation
- lttoolbox relabelling (tagset conversion)
- Seamless conversion between Apertium and UD tagsets
- UDpipe --- lttoolbox integration
- Use Apertium morphological analysers to be soft constraints on lemmatisation and POS/MSD tagging.
- UD mode for language modiules:
- calling `apertium qtz-ud` should produce conllu file with LEMMA, POS, FEATs, MISC fields filled in
- Your idea(s) here
Coding challenge
- Fix one issue in UD annotatrix and send a pull request
- Train UDpipe for a language that is also in Apertium
- Write a tagset equivalence file for a language that is in both Apertium and UD.