User:Mlforcada/sandbox/GSoC
Jump to navigation
Jump to search
Task | Difficulty | Description | Rationale | Requirements | Interested mentors | ||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Easy dictionary maintenance | 2. Hard | Write code that simplifies the maintenance of the single-word part of Apertium monolingual and bilingual dictionaries. This involves building an application that parses and reads the open-class (noun, adjective, verb) single-word part of the dictionary amenable to simple, data-base-like treatment, saving the remaining (hard to treat) part of the dictionaries, allows the user to easily add words (together with their inflection paradigms) through a friendly user interface and then combines the extended single-word data with the remaining data into Apertium monolingual and bilingual dictionaries ready to be compiled. Ideas and code from Apertium-dixtools could be useful. | Apertium dictionaries are very heterogeneous, but a great part of the development of a language pair consists in adding single words to monolingual and bilingual dictionaries, and, indeed, work on this part of the dictionaries is crucial for coverage and usefulness. Currently, dictionary maintenance is difficult because it involves editing an XML file. This may be slowing down the development of many language pairs. | Knowledge of XML, XSLT and one programming language that allows XML processing and easy writing of a user interface | - | what | how hard | description | rationale | requirements | - } |