Apertium has moved from SourceForge to GitHub.
If you have any questions, please come and talk to us on #apertium on irc.freenode.net or contact the GitHub migration team.

User:Qavan

From Apertium
(Difference between revisions)
Jump to: navigation, search
(frame[inf,int,wp start])
 
(add Category:GSoC 2019)
Line 53: Line 53:
 
articles <br />
 
articles <br />
 
and some others(coming soon) <br />
 
and some others(coming soon) <br />
  +
  +
[[Category:GSoC 2019 student proposals]]

Revision as of 23:09, 8 April 2019

Contents

Information

Name: Nikolay Aleksandrov University: I. N. Ulianov Chuvash State University E-mail address: pro100qavan@gmail.com IRC: qavan Location: Cheboxary, Russia Timezone: UTC+3 Telegram: @qavan GitHub: https://github.com/qavan


Why is it that you are interested in machine translation and in Apertium?

Apertium as an open source system is of great interest to me because of its ability to take into account many factors of the language that other interpreters do not take into account. I will work for the benefit of society, that's great. I'm surprised by the system itself, because before that I did not see translators with Chuvash language who could work so flexibly.


Which of the published tasks are you interested in? What do you plan to do?

I am planning to developing an existing language pair Chuvash-Russian (Chuvash <-> Russian), which is now in nursery. Also I will working on improving Chuvash-Tatar translator.


Reasons why Google and Apertium should sponsor it

The sponsorship of Google gives an excellent motivation to work on projects in which, unfortunately, few enthusiasts come to work. And for Apertium, this is an excellent opportunity to notice people who can help make their work better and bring something new.


A description of how and who it will benefit in society

Currently, there is a problem of ousting the Chuvash language in Chuvashia. Since the chance to find among themselves a "common language" in Russian is much more likely, even though the Chuvash language speakers are much more. This is a big problem at the moment.

Current state:

apertium-rus: coverage ~89.6%, stems ~170,000(?)
apertium-chv: coverage ~90%, stems ~45,000(?)
apertium-chv-rus.chv-rus.dix: about ~75 words

Work plan

Available resources

Dictionaries

Russian-Chuvash and Chuvash-Russian online dictionaries
http://ru.samah.chv.su
https://chuvash-republic.ru
https://translate.academic.ru/
http://www.classes.ru
https://glosbe.com/ru/cv/
https://glosbe.com/ru/cv
printed dictionaries
and some others(coming soon)

Resources for corpus

wiki dumps

Resources for parallel corpus

online newspapers both in Chuvash and Russian, for example: https://ru.wikipedia.org/wiki/Районные_газеты_Чувашии
articles
and some others(coming soon)

Personal tools