Difference between revisions of "Kazakh and Tatar"

From Apertium
Jump to navigation Jump to search
m
Line 7: Line 7:
 
# Current output is ''сөйлесгенде'', should be ''сөйлескенде''.
 
# Current output is ''сөйлесгенде'', should be ''сөйлескенде''.
 
# Declination of Tatar nouns ending with -и.
 
# Declination of Tatar nouns ending with -и.
  +
# Set up <code>bidix-with-context.sh</code> script (see <code>apertium-kaz-tat/dev/bidix</code>; seems to be very useful, requires another script from spectie).
  +
# Add some of the short wikipedia-article-like texts I have for evaluation to <code>texts</code> (should be ~200 words).
  +
# Implement cont. class for compound/multiword nouns which already have possessive ending (<px3sp>), e.g. ''Қытай Халық Республикасы''.
  +
## This continuation class should link only to CASE (but consider that some of them can have plural form: ''ишегаллары'').
   
 
Part-of-speech related TODO's and DONE's can be found here:
 
Part-of-speech related TODO's and DONE's can be found here:

Revision as of 00:11, 6 June 2012

This is a language pair translating between Kazakh and Tatar.

General TODO

See /Work_plan.

  1. Current output is сөйлесгенде, should be сөйлескенде.
  2. Declination of Tatar nouns ending with -и.
  3. Set up bidix-with-context.sh script (see apertium-kaz-tat/dev/bidix; seems to be very useful, requires another script from spectie).
  4. Add some of the short wikipedia-article-like texts I have for evaluation to texts (should be ~200 words).
  5. Implement cont. class for compound/multiword nouns which already have possessive ending (<px3sp>), e.g. Қытай Халық Республикасы.
    1. This continuation class should link only to CASE (but consider that some of them can have plural form: ишегаллары).

Part-of-speech related TODO's and DONE's can be found here:

To run tests, use aq-regtest utility from Apertium-quality tools. E.g.

aq-regtest -d . kaz-tat http://wiki.apertium.org/wiki/Special:Export/Kazakh_and_Tatar/Postadvebs

Done

But keep an eye on this
  • Numerals
    • kaz <num><subst>(<px3>) in fractions[1] = tat <num><subst>(<px3>)
    • kaz <num><coll><advl> = tat <num><coll>
    • kaz <num><coll><subst> = tat <num><subst>

Notes

  1. Currently whether it is in fractions or not is not taken into account

See also