User:Sokureo

From Apertium
Jump to navigation Jump to search

Contact information

Name: Elena Sokur
E-mail address: pelmenium.sokurium@gmail.com
IRC: sokureo
Location: Moscow, Russia
Timezone: UTC+3
GitHub: https://github.com/Sokureo

Skills and qualifications

I am a 3rd-year student of the Bachelor's programme "Fundamental and Computational Linguistics" in National Research University Higher School of Economics (NRU HSE), Russia.

Main university courses:

  • Programming (Python, R)
  • Computer Tools for Linguistic Research
  • Theory of Language (Phonetics, Morphology, Syntax, Semantics, Discourse)
  • Machine Learning
  • Math (Discrete Math, Linear Algebra and Calculus, Probability Theory and Mathematical Statistics)
  • Theory of Algorithms

Technical skills:

  • Programming languages: Python, R
  • Web-design: HTML, CSS
  • Frameworks: Flask, Django
  • Databases: SQLite, MySQL

Field work experience:

2 years of expeditions in:

  • Beserman dialect of Udmurt
  • Hill Mari

Languages: Russian (native), English (advanced), German (intermediate), Udmurt (reading), Komi-Zyrian (basic knowledge of grammar).

CODING CHALLENGE

1. Installed Apertium tools.

2. Installed kpv-udm language pair using this instruction: http://wiki.apertium.org/wiki/Udmurt_and_Komi. Morphological analyzers for Komi-Zyrian and Udmurt exist.

3. Found kpv-udm parallel texts (only one is aligned).

4. Added some words from the aligned text and translated one sentence.

5. Estimated the coverage of Udmurt and Komi wikis: 77% for Udmurt and 90% for Komi.

6. Counted non-disambiguated words: 30% in Udmurt and 62% in Komi.