Ideas for Google Summer of Code/Bidix lookup and maintenance
< Ideas for Google Summer of Code
Jump to navigation
Jump to search
Revision as of 16:12, 13 March 2020 by Popcorndude (talk | contribs) (Created page with "Things to have in the interface: * Paradigm generation ** probably not editable (yet) ** get it by expanding monolingual transducer ** See also: https://github.com/apertium/a...")
Things to have in the interface:
- Paradigm generation
- probably not editable (yet)
- get it by expanding monolingual transducer
- See also: https://github.com/apertium/apertium-paradigmatrix
- Translations
- from parsing bidix
- deciding which form to display might be an interesting challenge
- indicate default translations by parsing .lrx
- See also: https://github.com/apertium/apertium-html-tools/issues/105
- Phrases
- get these from .lsx and maybe also .lrx
Ranking translations probably doesn't exhaust the information that can be extracted from .lrx, and we might as well display the rest and make it editable as well, even if that's information that doesn't normally appear in a dictionary.
If the user edits something and doesn't have a github account, a bot should make the PR for them (maybe? are we worried about spam?). If they do have an account they should log in using OAuth.
Coding Challenge
A program that takes 2 language codes (abc and xyz) and a surface form in abc and displays a list of surface forms in xyz of all translations, sorted by part of speech.