Difference between revisions of "Hindi and Urdu"
Jump to navigation
Jump to search
(out of date) |
|||
Line 1: | Line 1: | ||
Some pending tasks: |
|||
* Convert M. Humayoun's Urdu Morphology → [[lttoolbox]] (probably using full form list and [[speling tools]]) |
|||
* Create bilingual dictionary for all words in the Urdu morphology (some can be extracted from Wiktionary, see the <code>dev/</code> directory in the incubator module) |
|||
* Make sure tagsets are consistent between Humayoun, IIIT and Apertium (see [[List of symbols]]) |
|||
* Train part-of-speech taggers for both Urdu and Hindi. |
|||
* Finish conversion of IIIT Hindi analyser (see [[Hindi]]... Verbs still need to be converted, and other categories checked.) |
|||
* Write transfer rules, if any needed |
|||
* Retrain part-of-speech taggers with [[target-language tagger training]]. |
|||
* Run quality controls (see [[Quality control]]) |
|||
==See also== |
==See also== |
||
Line 20: | Line 11: | ||
* [http://www.lama.univ-savoie.fr/~humayoun/UrduMorph/ UrduMorph] |
* [http://www.lama.univ-savoie.fr/~humayoun/UrduMorph/ UrduMorph] |
||
[[Category:Hindi and Urdu]] |
[[Category:Hindi and Urdu|*]] |
||
[[Category:Hindi]] |