Difference between revisions of "User:Sokureo"

From Apertium
Jump to navigation Jump to search
Line 53: Line 53:
<p>5. Estimated the coverage of Udmurt and Komi wikis: 77% for Udmurt and 90% for Komi.</p>
<p>5. Estimated the coverage of Udmurt and Komi wikis: 77% for Udmurt and 90% for Komi.</p>
<p>6. Counted non-disambiguated words: 30% in Udmurt and 62% in Komi.</p>
<p>6. Counted non-disambiguated words: 30% in Udmurt and 62% in Komi.</p>

== Non-Summer-of-Code plans for the Summer ==
From 17th June to 2nd July I will be in the Khanty expedition, but there is the Internet connection here, so I will be able to cooperate.

Revision as of 20:14, 26 March 2018

Contact information

Name: Elena Sokur
E-mail address: pelmenium.sokurium@gmail.com
IRC: sokureo
Location: Moscow, Russia
Timezone: UTC+3
GitHub: https://github.com/Sokureo

Skills and qualifications

I am a 3rd-year student of the Bachelor's programme "Fundamental and Computational Linguistics" in National Research University Higher School of Economics (NRU HSE), Russia.

Main university courses:

  • Programming (Python, R)
  • Computer Tools for Linguistic Research
  • Theory of Language (Phonetics, Morphology, Syntax, Semantics, Discourse)
  • Machine Learning
  • Math (Discrete Math, Linear Algebra and Calculus, Probability Theory and Mathematical Statistics)
  • Theory of Algorithms

Technical skills:

  • Programming languages: Python, R
  • Web-design: HTML, CSS
  • Frameworks: Flask, Django
  • Databases: SQLite, MySQL

Field work experience:

2 years of expeditions in:

  • Beserman dialect of Udmurt
  • Hill Mari

Languages: Russian (native), English (advanced), German (intermediate), Udmurt (reading), Komi-Zyrian (basic knowledge of grammar).

Coding Challenge

1. Installed Apertium tools.

2. Installed kpv-udm language pair using this instruction: http://wiki.apertium.org/wiki/Udmurt_and_Komi. Morphological analyzers for Komi-Zyrian and Udmurt exist.

3. Found kpv-udm parallel texts (only one is aligned).

4. Added some words from the aligned text to the Bilingual dictionary and translated one sentence.

5. Estimated the coverage of Udmurt and Komi wikis: 77% for Udmurt and 90% for Komi.

6. Counted non-disambiguated words: 30% in Udmurt and 62% in Komi.

Non-Summer-of-Code plans for the Summer

From 17th June to 2nd July I will be in the Khanty expedition, but there is the Internet connection here, so I will be able to cooperate.