Task ideas for Google Code-in/Add words from frequency list
Jump to navigation
Jump to search
Using
Using
Examples
The paradigms (inflectional classes) will be different depending on the dictionary format and the language in question. When in doubt, ask your mentor for help.
Using .dix
- See also: Starting a new language with lttoolbox
When using lttoolbox you will also need to find:
- the stem of the word, that is the part onto which inflectional endings are added.
- e.g. the stem for "wolf" is "wol" because the singular is "wol + f" and the plural is "wol + ves".
- the paradigm of the word. Paradigms in the
.dix
file come inpardef
elements. Find the one that given your stem generates all the valid surface forms of the lemma.
If a paradigm for the word does not exist then you will need to add a new one. Ask your mentor for help with this.
When adding nouns, depending on the language, you should be careful with the following:
- What gender is the noun ?
- Does the noun exist in both singular and plural ?
- Is the noun animate or inanimate ?
Before | After |
---|---|
n ^3570/3570<num>$ ^горад/горад$ n ^2491/2491<num>$ ^тэрыторыі/тэрыторыя$ n ^2409/2409<num>$ ^вайны/вайна$ n ^2316/2316<num>$ ^цэнтр/цэнтр$ |
<e lm="горад"><i>горад</i><par n="..."/></e> <e lm="тэрыторыя"><i>тэрыторы</i><par n="..."/></e> <e lm="вайна"><i>вайн</i><par n="..."/></e> <e lm="цэнтр"><i>цэнтр</i><par n="..."/></e> |