User:Sokureo
Contact information
Name: Elena Sokur |
E-mail address: pelmenium.sokurium@gmail.com |
IRC: sokureo |
Location: Moscow, Russia |
Timezone: UTC+3 |
GitHub: https://github.com/Sokureo |
Skills and qualifications
I am a 3rd-year student of the Bachelor's programme "Fundamental and Computational Linguistics" in National Research University Higher School of Economics (NRU HSE), Russia.
Main university courses:
- Programming (Python, R)
- Computer Tools for Linguistic Research
- Theory of Language (Phonetics, Morphology, Syntax, Semantics, Discourse)
- Machine Learning
- Math (Discrete Math, Linear Algebra and Calculus, Probability Theory and Mathematical Statistics)
- Theory of Algorithms
Technical skills:
- Programming languages: Python, R
- Web-design: HTML, CSS
- Frameworks: Flask, Django
- Databases: SQLite, MySQL
Field work experience:
2 years of expeditions in:
- Beserman dialect of Udmurt
- Hill Mari
Languages: Russian (native), English (advanced), German (intermediate), Udmurt (reading), Komi-Zyrian (basic knowledge of grammar).
CODING CHALLENGE
1. Installed Apertium tools.
2. Installed kpv-udm language pair using this instruction: http://wiki.apertium.org/wiki/Udmurt_and_Komi. Morphological analyzers for Komi-Zyrian and Udmurt exist.
3. Found kpv-udm parallel texts (only one is aligned).
4. Added some words from the aligned text and translated one sentence.
5. Estimated the coverage of Udmurt and Komi wikis: 77% for Udmurt and 90% for Komi.
6. Counted non-disambiguated words: 30% in Udmurt and 62% in Komi.