Difference between revisions of "Kazakh and Tatar/Diary"

From Apertium
Jump to navigation Jump to search
m
Line 3: Line 3:
; Checking & refactoring clitics
; Checking & refactoring clitics


Some of the clitics appear only after certain forms (e.g. "шы<mod_foc>" in Kazakh, which expresses politeness, joins only 2nd person singular). And vice versa - some of the forms can get only certain clitics (imperative forms get only "чы" and "сана" in Tatar e.g.)
Some of the clitics appear only after certain forms (e.g. "шы<mod_foc>" in Kazakh, which expresses politeness, joins only 2nd person singular). And vice versa - some of the forms can get only certain clitics (imperative forms get only "чы" and "сана" in Tatar)


I moved the above clitics into a separate lexicon, and linked imperative forms to it, so that there is no overgeneration now (and a bit easier life for spectie's "testvocing" PC's).
I moved the above clitics into a separate lexicon, and linked imperative forms to it, so that there is no overgeneration now (and a bit easier life for spectie's "testvocing" PC's).


In Tatar some new clitics were added as well.
In Tatar some new clitics were added as well.

=== Monday, 28th May 2012 ===

; Checking & refactoring clitics (cont.)

A question whether <code>%+ғана%<postadv%>:% %{G%}ана # ; ! "only"</code> in <code>CLIT</code> continuation class was correct produced a discussion about whether we should handle harmonizing of such words in transducer (means matching them to the previous word) or post-generator can take care of that.

Another thing is that some Tatar modal particles do not vary depending of the previous word (e.g. "бит"), but I have put them into <code>CLIT</code> continuation class (as all other modal particles were there). This might be wrong.

I learned a lot of new stuff :), but the possible changes in CLIT lexicon were kept for later.

; Some work on postadverbs

See [[../Postadverbs]]

Revision as of 22:25, 29 May 2012

Monday, 28th May 2012

Checking & refactoring clitics

Some of the clitics appear only after certain forms (e.g. "шы<mod_foc>" in Kazakh, which expresses politeness, joins only 2nd person singular). And vice versa - some of the forms can get only certain clitics (imperative forms get only "чы" and "сана" in Tatar)

I moved the above clitics into a separate lexicon, and linked imperative forms to it, so that there is no overgeneration now (and a bit easier life for spectie's "testvocing" PC's).

In Tatar some new clitics were added as well.

Monday, 28th May 2012

Checking & refactoring clitics (cont.)

A question whether %+ғана%<postadv%>:% %{G%}ана # ; ! "only" in CLIT continuation class was correct produced a discussion about whether we should handle harmonizing of such words in transducer (means matching them to the previous word) or post-generator can take care of that.

Another thing is that some Tatar modal particles do not vary depending of the previous word (e.g. "бит"), but I have put them into CLIT continuation class (as all other modal particles were there). This might be wrong.

I learned a lot of new stuff :), but the possible changes in CLIT lexicon were kept for later.

Some work on postadverbs

See Kazakh and Tatar/Postadverbs