Difference between revisions of "Hindi and Urdu"

From Apertium
Jump to navigation Jump to search
(out of date)
 
(One intermediate revision by one other user not shown)
Line 1: Line 1:
Some pending tasks:
 
   
* Convert M. Humayoun's Urdu Morphology → [[lttoolbox]] (probably using full form list and [[speling tools]])
 
* Create bilingual dictionary for all words in the Urdu morphology (some can be extracted from Wiktionary, see the <code>dev/</code> directory in the incubator module)
 
* Make sure tagsets are consistent between Humayoun, IIIT and Apertium (see [[List of symbols]])
 
* Train part-of-speech taggers for both Urdu and Hindi.
 
* Finish conversion of IIIT Hindi analyser (see [[Hindi]]... Verbs still need to be converted, and other categories checked.)
 
* Write transfer rules, if any needed
 
* Retrain part-of-speech taggers with [[target-language tagger training]].
 
* Run quality controls (see [[Quality control]])
 
   
 
==See also==
 
==See also==
Line 20: Line 11:
 
* [http://www.lama.univ-savoie.fr/~humayoun/UrduMorph/ UrduMorph]
 
* [http://www.lama.univ-savoie.fr/~humayoun/UrduMorph/ UrduMorph]
   
[[Category:Hindi and Urdu]]
+
[[Category:Hindi and Urdu|*]]

Latest revision as of 12:50, 24 May 2014