Difference between revisions of "User:Qavan"
(frame[inf,int,wp start]) |
(add Category:GSoC 2019) |
||
Line 53: | Line 53: | ||
articles <br /> |
articles <br /> |
||
and some others(coming soon) <br /> |
and some others(coming soon) <br /> |
||
[[Category:GSoC 2019 student proposals]] |
Revision as of 21:09, 8 April 2019
Contents
- 1 Information
- 2 Why is it that you are interested in machine translation and in Apertium?
- 3 Which of the published tasks are you interested in? What do you plan to do?
- 4 Reasons why Google and Apertium should sponsor it
- 5 A description of how and who it will benefit in society
- 6 Current state:
- 7 Work plan
Information
Name: Nikolay Aleksandrov University: I. N. Ulianov Chuvash State University E-mail address: pro100qavan@gmail.com IRC: qavan Location: Cheboxary, Russia Timezone: UTC+3 Telegram: @qavan GitHub: https://github.com/qavan
Why is it that you are interested in machine translation and in Apertium?
Apertium as an open source system is of great interest to me because of its ability to take into account many factors of the language that other interpreters do not take into account. I will work for the benefit of society, that's great. I'm surprised by the system itself, because before that I did not see translators with Chuvash language who could work so flexibly.
Which of the published tasks are you interested in? What do you plan to do?
I am planning to developing an existing language pair Chuvash-Russian (Chuvash <-> Russian), which is now in nursery. Also I will working on improving Chuvash-Tatar translator.
Reasons why Google and Apertium should sponsor it
The sponsorship of Google gives an excellent motivation to work on projects in which, unfortunately, few enthusiasts come to work. And for Apertium, this is an excellent opportunity to notice people who can help make their work better and bring something new.
A description of how and who it will benefit in society
Currently, there is a problem of ousting the Chuvash language in Chuvashia. Since the chance to find among themselves a "common language" in Russian is much more likely, even though the Chuvash language speakers are much more. This is a big problem at the moment.
Current state:
apertium-rus: coverage ~89.6%, stems ~170,000(?)
apertium-chv: coverage ~90%, stems ~45,000(?)
apertium-chv-rus.chv-rus.dix: about ~75 words
Work plan
Available resources
Dictionaries
Russian-Chuvash and Chuvash-Russian online dictionaries
http://ru.samah.chv.su
https://chuvash-republic.ru
https://translate.academic.ru/
http://www.classes.ru
https://glosbe.com/ru/cv/
https://glosbe.com/ru/cv
printed dictionaries
and some others(coming soon)
Resources for corpus
wiki dumps
Resources for parallel corpus
online newspapers both in Chuvash and Russian, for example: https://ru.wikipedia.org/wiki/Районные_газеты_Чувашии
articles
and some others(coming soon)