Difference between revisions of "Kazakh and Tatar"
Jump to navigation
Jump to search
m |
|||
Line 7: | Line 7: | ||
# Current output is ''сөйлесгенде'', should be ''сөйлескенде''. |
# Current output is ''сөйлесгенде'', should be ''сөйлескенде''. |
||
# Declination of Tatar nouns ending with -и. |
# Declination of Tatar nouns ending with -и. |
||
# Set up <code>bidix-with-context.sh</code> script (see <code>apertium-kaz-tat/dev/bidix</code>; seems to be very useful, requires another script from spectie). |
|||
# Add some of the short wikipedia-article-like texts I have for evaluation to <code>texts</code> (should be ~200 words). |
|||
# Implement cont. class for compound/multiword nouns which already have possessive ending (<px3sp>), e.g. ''Қытай Халық Республикасы''. |
|||
## This continuation class should link only to CASE (but consider that some of them can have plural form: ''ишегаллары''). |
|||
Part-of-speech related TODO's and DONE's can be found here: |
Part-of-speech related TODO's and DONE's can be found here: |
Revision as of 00:11, 6 June 2012
This is a language pair translating between Kazakh and Tatar.
Contents
General TODO
See /Work_plan.
- Current output is сөйлесгенде, should be сөйлескенде.
- Declination of Tatar nouns ending with -и.
- Set up
bidix-with-context.sh
script (seeapertium-kaz-tat/dev/bidix
; seems to be very useful, requires another script from spectie). - Add some of the short wikipedia-article-like texts I have for evaluation to
texts
(should be ~200 words). - Implement cont. class for compound/multiword nouns which already have possessive ending (<px3sp>), e.g. Қытай Халық Республикасы.
- This continuation class should link only to CASE (but consider that some of them can have plural form: ишегаллары).
Part-of-speech related TODO's and DONE's can be found here:
To run tests, use aq-regtest
utility from Apertium-quality tools. E.g.
aq-regtest -d . kaz-tat http://wiki.apertium.org/wiki/Special:Export/Kazakh_and_Tatar/Postadvebs
Done
- But keep an eye on this
- Numerals
- kaz <num><subst>(<px3>) in fractions[1] = tat <num><subst>(<px3>)
- kaz <num><coll><advl> = tat <num><coll>
- kaz <num><coll><subst> = tat <num><subst>
Notes
- ↑ Currently whether it is in fractions or not is not taken into account