Difference between revisions of "User:Uliana/gsoc-propuesta"
Line 41: | Line 41: | ||
'''Project roles:''' project manager, software developer (Python) |
'''Project roles:''' project manager, software developer (Python) |
||
− | '''Description:''' Creating a hybrid information extraction system using rule-based approach and machine learning technologies. |
+ | '''Description:''' Creating a hybrid information extraction system using rule-based approach and machine learning technologies. This system is able to extract named entities (persons, locations and organizations) and will become a part of stack technology for NLP. |
== My interest in Machine Translation == |
== My interest in Machine Translation == |
Revision as of 17:58, 17 March 2016
Contents
Contacts
Uliana Sentsova
E-mail: uliana.sentsova@gmail.com
Number: +7 (916) 774-95-30
Skype: ulyanasidorova
IRC channel: uliana at #apertium
Education
Lomonosov Moscow State University
Qualification: Bachelor in Linguistics (romance-german languages)
GPA: 10.0 / 10.0
National Research University „Higher School of Economics“
Qualification: Major in Natural Language Processing
Current GPA: 8.5 / 10.0
2015: Awardee of graduates’ competition „Natural Language Processing” (a competition for students hold by National Research University Higher School of Economics)
2014: Scholarship of Academic Council of MSU for scientific activities (a special award for top 10% students with academic excellence and scientific activity)
2013: Enhanced State Academic Scholarship for scientific activities (is awarded on the basis of academic excellence and scientific achievements)
Projects
„Building Open Source Information Extraction System for Russian Language”
Organisation: National Research „University Higher School of Economics”
Project roles: project manager, software developer (Python)
Description: Creating a hybrid information extraction system using rule-based approach and machine learning technologies. This system is able to extract named entities (persons, locations and organizations) and will become a part of stack technology for NLP.
My interest in Machine Translation
My interest in Apertium projects
I am interested in working an unreleased language pair for Sicilian - Spanish languages. As my coding challenge I created a new language package scn-spa, added basic vocabulary to the dictionary of Sicilian and translations into Sicilian-Spanisch dictionary. I also started to conduct research in the structure of Sicilian language: I have got into touch with contributors of Wikipedia in Sicilian language and thanks to spectei I also have reached computational linguist who studies in Munich and is native speaker of Sicilian.