Difference between revisions of "User:Firespeaker/TODO"

From Apertium
Jump to navigation Jump to search
Line 10: Line 10:
:: Good enough? —[[User:Firespeaker|Firespeaker]] 15:52, 21 September 2012 (UTC)
:: Good enough? —[[User:Firespeaker|Firespeaker]] 15:52, 21 September 2012 (UTC)
::: Could we get examples/categories for kir/kaz ? - [[User:Francis Tyers|Francis Tyers]] 17:39, 21 September 2012 (UTC)
::: Could we get examples/categories for kir/kaz ? - [[User:Francis Tyers|Francis Tyers]] 17:39, 21 September 2012 (UTC)

== Things for hector2 ==


====New list 2012.09.21====
* Vowel harmony: tests/автан.yaml. There is a strange change of %{Ӑ%} to и. --[[User:Hectoralos|Hèctor Alòs i Font]] 19:05, 21 September 2012 (UTC)


=== questions/requests for hector2 ===
* <s>What is special about [[пуртӑ]]? As far as I can tell, it behaves as one would expect for a noun ending in ӑ that has gemination. —[[User:Firespeaker|Firespeaker]] 07:56, 18 September 2012 (UTC)</s>
** The gemination of т. There is no gemination in CCӑ, only in VCӑ. The exception is пуртӑ.--[[User:Hectoralos|Hèctor Alòs i Font]] 09:14, 18 September 2012 (UTC)
::: Ah. I think we're going to move gemination to something conditioned by lexc. Trying to set up phonological triggers for it is too complex and has been causing too many problems. Marking nouns that get gemination in lexc is simple. —[[User:Firespeaker|Firespeaker]] 07:07, 19 September 2012 (UTC)
* Add forms and correct ones at [[Morphology_of_Chuvash#Nouns_ending_in_.E2.80.B9.D0.BE.E2.80.BA|Nouns ending in о]] —[[User:Firespeaker|Firespeaker]] 08:05, 18 September 2012 (UTC)
** I'll do. In fact, the rule is quite simple: words ending in unstressed ''o'', ''u'' or ''a'' behave as words ending in ''ӑ'', except in gemination (and the "dictionary" form, which conserves the Russian orthography). A subrule, which complicates this general one, is that the 3rd person differs for words invariable in Russian, and also радио seems to be special (it behaves as ''o'' was stressed - it seems there some Russian words are assimilated changing the stress to the final vowel). I'll write down the examples, change the wiki, etc.--[[User:Hectoralos|Hèctor Alòs i Font]] 19:59, 18 September 2012 (UTC)
*** Done. There are two columns I don't understand: ''σ́]'' and ''о→ов?'', so I'm not sure what to put there. I've added a new column, about the genitive form. I've changed some words and added new ones, in order to have all types of words (I think) and to use these words that are in the wiki. All words have their yaml file.--[[User:Hectoralos|Hèctor Alòs i Font]] 06:41, 22 September 2012 (UTC)
* Could we have pages for the following irregular nouns, especially the first two? —[[User:Firespeaker|Firespeaker]] 08:30, 18 September 2012 (UTC)
** [[пичче]]
** [[аппа]]
** [[кӗрӳ]]
** [[мучи]]
** [[кинемей]]
** [[кукаҫи]]
** [[ен]]
** [[хӗрри]]
* <s>Could we have a page for the following regular noun? —[[User:Firespeaker|Firespeaker]] 08:48, 18 September 2012 (UTC)</s>
** <s>[[хӗв]]</s>
*** вот! There's also a yaml file. I am not sure yet about px2sg.loc and px2sg.abl, but all other forms should be correct.
**** Awesome, thanks!
***** In principle the forms px2sg are already revised. However I'm not really sure about px2sg.dat. The forms px2sg are seldom used and there were zillions of errors in px2sg.loc and px2sg.abl, even after 2-3 revisions (i.a. by philology teachers). In px2sg.dat in some cases {У} falls, but I'm not sure which is the rule (I haven't analyzed it yet) and I guess the informants may have made mistakes. In these cases, rarely used, where a few cases without any difference with others don't follow the rules, I finally decided to follow the general rule and not believe the informants.--[[User:Hectoralos|Hèctor Alòs i Font]] 19:03, 20 September 2012 (UTC)
* Could I have yaml files for the following words? —[[User:Firespeaker|Firespeaker]] 08:46, 18 September 2012 (UTC)
** <s>[[йӗп]]</s>
** <s>[[пуртӑ]]</s>
** <s>[[атте]]</s>
** [[пӳ]]
** [[лаша]]
** <s>[[утӑ]]</s>
*** You can use the script tests/gen_yaml_mot.sh, which creates a yaml for a given noun from the wiki.yaml file (which I regularly read from the wiki, when I change something in it). As above, I still have some doubts about px2sg.loc and px2sg.abl, but all other forms should be correct. Атте should be correct in all forms.--[[User:Hectoralos|Hèctor Alòs i Font]] 19:07, 18 September 2012 (UTC)
**** See above about px2sg (by the way, I see that both атте and анне have the forms px2sg.loc and px2sg.abl which I corrected everywhere. As they are irregular words I don't dare correct them... It may be said that really in these words personal suffixes are used, so this gives some more confidence).--[[User:Hectoralos|Hèctor Alòs i Font]] 19:03, 20 September 2012 (UTC)
* Given the following line in the twol, is the gemination in утӑ right? —[[User:Firespeaker|Firespeaker]] 07:39, 21 September 2012 (UTC)
: <pre>утӑ:ут%{nodup%}ӑ N1 ; ! "сено" ! exception according to И.П. Павлов 1974: 18</pre>
** I'm sorry. I didn't notice it was "especial". I've corrected the third person forms, which don't have to regular gemination.
* As you requested, I've put the files tests/v_кил_del.yaml and tests/v_кала_del.yaml where you can find examples of %{Ӑdel%}, %{ӗdel%}, %{иdel%}. кала is not working because of the vowel harmony problem you can find e.g. in tests/автан.yaml (my first request of the new list). --[[User:Hectoralos|Hèctor Alòs i Font]] 20:05, 21 September 2012 (UTC)


== General cv.twol TODO list ==
== General cv.twol TODO list ==

Revision as of 03:22, 28 September 2012

Things for selimcan

Things for spectie

Good enough? —Firespeaker 15:52, 21 September 2012 (UTC)
Could we get examples/categories for kir/kaz ? - Francis Tyers 17:39, 21 September 2012 (UTC)

Things for hector2

New list 2012.09.21

  • Vowel harmony: tests/автан.yaml. There is a strange change of %{Ӑ%} to и. --Hèctor Alòs i Font 19:05, 21 September 2012 (UTC)


questions/requests for hector2

  • What is special about пуртӑ? As far as I can tell, it behaves as one would expect for a noun ending in ӑ that has gemination. —Firespeaker 07:56, 18 September 2012 (UTC)
    • The gemination of т. There is no gemination in CCӑ, only in VCӑ. The exception is пуртӑ.--Hèctor Alòs i Font 09:14, 18 September 2012 (UTC)
Ah. I think we're going to move gemination to something conditioned by lexc. Trying to set up phonological triggers for it is too complex and has been causing too many problems. Marking nouns that get gemination in lexc is simple. —Firespeaker 07:07, 19 September 2012 (UTC)
  • Add forms and correct ones at Nouns ending in оFirespeaker 08:05, 18 September 2012 (UTC)
    • I'll do. In fact, the rule is quite simple: words ending in unstressed o, u or a behave as words ending in ӑ, except in gemination (and the "dictionary" form, which conserves the Russian orthography). A subrule, which complicates this general one, is that the 3rd person differs for words invariable in Russian, and also радио seems to be special (it behaves as o was stressed - it seems there some Russian words are assimilated changing the stress to the final vowel). I'll write down the examples, change the wiki, etc.--Hèctor Alòs i Font 19:59, 18 September 2012 (UTC)
      • Done. There are two columns I don't understand: σ́] and о→ов?, so I'm not sure what to put there. I've added a new column, about the genitive form. I've changed some words and added new ones, in order to have all types of words (I think) and to use these words that are in the wiki. All words have their yaml file.--Hèctor Alòs i Font 06:41, 22 September 2012 (UTC)
  • Could we have pages for the following irregular nouns, especially the first two? —Firespeaker 08:30, 18 September 2012 (UTC)
  • Could we have a page for the following regular noun? —Firespeaker 08:48, 18 September 2012 (UTC)
    • хӗв
      • вот! There's also a yaml file. I am not sure yet about px2sg.loc and px2sg.abl, but all other forms should be correct.
        • Awesome, thanks!
          • In principle the forms px2sg are already revised. However I'm not really sure about px2sg.dat. The forms px2sg are seldom used and there were zillions of errors in px2sg.loc and px2sg.abl, even after 2-3 revisions (i.a. by philology teachers). In px2sg.dat in some cases {У} falls, but I'm not sure which is the rule (I haven't analyzed it yet) and I guess the informants may have made mistakes. In these cases, rarely used, where a few cases without any difference with others don't follow the rules, I finally decided to follow the general rule and not believe the informants.--Hèctor Alòs i Font 19:03, 20 September 2012 (UTC)
  • Could I have yaml files for the following words? —Firespeaker 08:46, 18 September 2012 (UTC)
    • йӗп
    • пуртӑ
    • атте
    • пӳ
    • лаша
    • утӑ
      • You can use the script tests/gen_yaml_mot.sh, which creates a yaml for a given noun from the wiki.yaml file (which I regularly read from the wiki, when I change something in it). As above, I still have some doubts about px2sg.loc and px2sg.abl, but all other forms should be correct. Атте should be correct in all forms.--Hèctor Alòs i Font 19:07, 18 September 2012 (UTC)
        • See above about px2sg (by the way, I see that both атте and анне have the forms px2sg.loc and px2sg.abl which I corrected everywhere. As they are irregular words I don't dare correct them... It may be said that really in these words personal suffixes are used, so this gives some more confidence).--Hèctor Alòs i Font 19:03, 20 September 2012 (UTC)
  • Given the following line in the twol, is the gemination in утӑ right? —Firespeaker 07:39, 21 September 2012 (UTC)
утӑ:ут%{nodup%}ӑ N1 ; ! "сено"	! exception according to И.П. Павлов 1974: 18
    • I'm sorry. I didn't notice it was "especial". I've corrected the third person forms, which don't have to regular gemination.
  • As you requested, I've put the files tests/v_кил_del.yaml and tests/v_кала_del.yaml where you can find examples of %{Ӑdel%}, %{ӗdel%}, %{иdel%}. кала is not working because of the vowel harmony problem you can find e.g. in tests/автан.yaml (my first request of the new list). --Hèctor Alòs i Font 20:05, 21 September 2012 (UTC)

General cv.twol TODO list

  • gemination
  • ӳ:ӗв, у:ӑв
  • Nouns ending in о
  • <px2sg><dat> of nouns
  • "Irregular" nouns (family relations that have different stems):w
  • clean up twol conflicts