Difference between revisions of "Kazakh and Tatar"
		
		
		
		
		
		
		Jump to navigation
		Jump to search
		
				
		
		
		
		
		
		
		
	
| Line 5: | Line 5: | ||
| See [[/Work_plan]]. | See [[/Work_plan]]. | ||
| # Current output is ''сөйлесгенде'', should be ''сөйлескенде''. | |||
| # Declination of Tatar nouns ending with -и. | # Declination of Tatar nouns ending with -и. | ||
| # Set up <code>bidix-with-context.sh</code> script (see <code>apertium-kaz-tat/dev/bidix</code>; seems to be very useful, requires another script from spectie). | # Set up <code>bidix-with-context.sh</code> script (see <code>apertium-kaz-tat/dev/bidix</code>; seems to be very useful, requires another script from spectie). | ||
| Line 16: | Line 15: | ||
| ## add them to tat.lexc too; | ## add them to tat.lexc too; | ||
| ## change <code>LEXICON NUM-ROMAN</code> to something like this: <code>%<num%>%<ord%>: # ; </code>. | ## change <code>LEXICON NUM-ROMAN</code> to something like this: <code>%<num%>%<ord%>: # ; </code>. | ||
| ⚫ | |||
| === Twol realated stuf === | |||
| ⚫ | |||
| # Current: <code>^сөйле<v><tv><coop><ger_past><loc>$ --> сөйлесгенде</code> Should be: <code>^сөйле<v><tv><coop><ger_past><loc>$ --> сөйлесгенде</code> | |||
| ---- | ---- | ||
Revision as of 14:26, 8 June 2012
This is a language pair translating between Kazakh and Tatar.
General TODO
See /Work_plan.
- Declination of Tatar nouns ending with -и.
- Set up bidix-with-context.shscript (seeapertium-kaz-tat/dev/bidix; seems to be very useful, requires another script from spectie).
- Add some of the short wikipedia-article-like texts I have for evaluation into- texts(should be ~200 words).
- Implement cont. class for compound/multiword nouns which already have possessive ending (<px3sp>), e.g. Қытай Халық Республикасы.
- This continuation class should link only to CASE (but consider that some of them can have plural form: ишегаллары).
 
- Add "ярты", "ярым" and "чирек" as numerals, but don't link them to common numerals cont. class.
- (Lexical selection rule): сондай-ақ > шулай-ук
- Fix roman numerals:
- add them to tat.lexc too;
- change LEXICON NUM-ROMANto something like this:%<num%>%<ord%>: # ;.
 
Twol realated stuf
- Current: ^миллион<num><subst><dat>$ --> миллионгеShould be:^миллион<num><subst><dat>$ --> миллионға
- Current: ^сөйле<v><tv><coop><ger_past><loc>$ --> сөйлесгендеShould be:^сөйле<v><tv><coop><ger_past><loc>$ --> сөйлесгенде
Part-of-speech related TODO's and DONE's can be found here:
To run tests, use aq-regtest utility from Apertium-quality tools. E.g. 
aq-regtest -d . kaz-tat http://wiki.apertium.org/wiki/Special:Export/Kazakh_and_Tatar/Postadvebs
Done
- But keep an eye on this
- Numerals
- kaz <num><subst>(<px3>) in fractions[1] = tat <num><subst>(<px3>)
- kaz <num><coll><advl> = tat <num><coll>
- kaz <num><coll><subst> = tat <num><subst>
 
Notes
- ↑ Currently whether it is in fractions or not is not taken into account

