Difference between revisions of "Hindi and Urdu"

From Apertium
Jump to navigation Jump to search
(out of date)
 
Line 1: Line 1:
Some pending tasks:


* Convert M. Humayoun's Urdu Morphology → [[lttoolbox]] (probably using full form list and [[speling tools]])
* Create bilingual dictionary for all words in the Urdu morphology (some can be extracted from Wiktionary, see the <code>dev/</code> directory in the incubator module)
* Make sure tagsets are consistent between Humayoun, IIIT and Apertium (see [[List of symbols]])
* Train part-of-speech taggers for both Urdu and Hindi.
* Finish conversion of IIIT Hindi analyser (see [[Hindi]]... Verbs still need to be converted, and other categories checked.)
* Write transfer rules, if any needed
* Retrain part-of-speech taggers with [[target-language tagger training]].
* Run quality controls (see [[Quality control]])


==See also==
==See also==
Line 20: Line 11:
* [http://www.lama.univ-savoie.fr/~humayoun/UrduMorph/ UrduMorph]
* [http://www.lama.univ-savoie.fr/~humayoun/UrduMorph/ UrduMorph]


[[Category:Hindi and Urdu]]
[[Category:Hindi and Urdu|*]]
[[Category:Hindi]]

Latest revision as of 12:50, 24 May 2014